Skip to content
Google

Gemini 3.1 Flash Lite

budget

GoogleReleased on 2026-03-03

Google's fastest and most cost-efficient Gemini 3 series model. 2.5X faster Time to First Token and 45% faster output than 2.5 Flash. Designed for high-volume workloads including translation, content moderation, UI generation, and simulations. Supports adjustable thinking levels.

71
Overall Score

Voice of the community

sample 78

Gemini 3.1 Flash Lite is the dumbest Google model released so far. There is a widely reported bug from March 4 where it returns Finish_reason=STOP prematurely during multi-step tool use.

r/Bard2026-03-09

3 times as expensive as 2.5 Flash Lite and real world performance is horrible compared to what you're paying

r/Bard2026-03-04

Gemini Flash Lite is insanely fast for batch processing (like sorting thousands of photos), which was almost too expensive on older models

r/OpenAI2026-03-05

Core Specs

1000K
Context Window
NaNK
Max Output
ReasoningOpen Sourcetextimage

Pros & Cons

Sentiment25% +20% ·55% −

Pros

  • +Cheapest Gemini 3 model ($0.25/$1.50)
  • +2.5X faster TTFT than 2.5 Flash
  • +45% faster output speed
  • +Adjustable thinking levels
  • +Great for high-volume tasks
  • +Free tier via AI Studio

Cons

  • New model, limited community feedback
  • Not optimized for complex coding tasks
  • Preview stage only

Pricing

Input (per 1M tokens)$0.25
Output (per 1M tokens)$1.50
Free trial available
Updated on 2026-03-26

Get Started

1Visit the provider's website
2Create an account
3Start using the model

Benchmarks

gpqaDiamond86.9%
mmmuPro76.8%
arenaElo1432%

Reliability

SLA99.5%
Incidents (30d)1
View Status Page →