Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for cost-sensitive, high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities including audio input, RAG snippet ranking, translation, data extraction, and code completion. It supports full thinking levels (minimal/low/medium/high) for fine-grained cost-performance trade-offs, and is priced at half the cost of Gemini 3 Flash.
Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for cost-sensitive, high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities including audio input, RAG snippet ranking, translation, data extraction, and code completion. It supports full thinking levels (minimal/low/medium/high) for fine-grained cost-performance trade-offs, and is priced at half the cost of Gemini 3 Flash.