Gemini 3.1 Flash Lite
budgetGoogle•Released on 2026-03-03
Google's fastest and most cost-efficient Gemini 3 series model. 2.5X faster Time to First Token and 45% faster output than 2.5 Flash. Designed for high-volume workloads including translation, content moderation, UI generation, and simulations. Supports adjustable thinking levels.
71
Overall Score
Core Specs
1000K
Context Window
NaNK
Max Output
✓
Reasoning
✗
Open Source
Multimodal Support
textimage
Scenario Scores
User Feedback Highlights
Based on community feedback. Hover to see original reviews.
+ Great for high-volume tasks+ Cheapest Gemini 3 model ($0.25/$1.50)+ 2.5X faster TTFT than 2.5 Flash+ 45% faster output speed+ Free tier via AI Studio− Preview stage only− New model, limited community feedback− Not optimized for complex coding tasks+ Adjustable thinking levels
Sentiment:👍 0%😐 0%👎 0%
Pros & Cons
Pros
- +Cheapest Gemini 3 model ($0.25/$1.50)
- +2.5X faster TTFT than 2.5 Flash
- +45% faster output speed
- +Adjustable thinking levels
- +Great for high-volume tasks
- +Free tier via AI Studio
Cons
- −New model, limited community feedback
- −Not optimized for complex coding tasks
- −Preview stage only
Reliability
Pricing
Input (per 1M tokens)$0.25
Output (per 1M tokens)$1.50
Free trial available
Updated on
Compare with Others
Benchmarks
gpqaDiamond86.9%
mmmuPro76.8%
arenaElo1432%