M
MAI-Image-2
imageMicrosoft•Released on 2026-03-20
Microsoft's latest text-to-image model, ranked #3 on Arena.ai leaderboard. Excels at photorealistic images with natural lighting, accurate skin tones, and lived-in environments. Standout feature: consistent text generation within images for infographics, slides, and diagrams.
88
Overall Score
Voice of the community
sample 50“MAI-Image-2 is best used to generate photorealistic images with elements like natural light, accurate skin tones, environments that feel lived-in.”
Core Specs
0K
Context Window
0K
Max Output
ReasoningOpen Sourcetextimage
Pros & Cons
Sentiment75% +20% ·5% −
Pros
- +Excellent photorealism with natural lighting
- +Accurate text rendering in images
- +Free access via Copilot/Bing
- +Arena.ai top 3 ranking
- +Good for creative/professional work
Cons
- −Limited API access (enterprise only for now)
- −Rate limits on free tier (15 images/24h)
- −No public pricing announced
- −30-second cooldown between generations
Pricing
Input (per 1M tokens)$0.00
Output (per 1M tokens)$0.00
Free trial available
Updated on 2026-03-21
Get Started
1Visit the provider's website
2Create an account
3Start using the model
Benchmarks
arenaRank3%
noteRanked #3 on Arena.ai text-to-image leaderboard%
Reliability
Incidents (30d)0