Gemini 3.1 Flash Lite
budgetGoogle•Released on 2026-03-03
Google's fastest and most cost-efficient Gemini 3 series model. 2.5X faster Time to First Token and 45% faster output than 2.5 Flash. Designed for high-volume workloads including translation, content moderation, UI generation, and simulations. Supports adjustable thinking levels.
71
Overall Score
Voice of the community
sample 78“Gemini 3.1 Flash Lite is the dumbest Google model released so far. There is a widely reported bug from March 4 where it returns Finish_reason=STOP prematurely during multi-step tool use.”
“3 times as expensive as 2.5 Flash Lite and real world performance is horrible compared to what you're paying”
“Gemini Flash Lite is insanely fast for batch processing (like sorting thousands of photos), which was almost too expensive on older models”
Core Specs
1000K
Context Window
NaNK
Max Output
ReasoningOpen Sourcetextimage
Scenario Scores
Pros & Cons
Sentiment25% +20% ·55% −
Pros
- +Cheapest Gemini 3 model ($0.25/$1.50)
- +2.5X faster TTFT than 2.5 Flash
- +45% faster output speed
- +Adjustable thinking levels
- +Great for high-volume tasks
- +Free tier via AI Studio
Cons
- −New model, limited community feedback
- −Not optimized for complex coding tasks
- −Preview stage only
Pricing
Input (per 1M tokens)$0.25
Output (per 1M tokens)$1.50
Free trial available
Updated on 2026-03-26
Get Started
1Visit the provider's website
2Create an account
3Start using the model
Benchmarks
gpqaDiamond86.9%
mmmuPro76.8%
arenaElo1432%