Skip to content
M

MAI-Image-2

image

MicrosoftReleased on 2026-03-20

Microsoft's latest text-to-image model, ranked #3 on Arena.ai leaderboard. Excels at photorealistic images with natural lighting, accurate skin tones, and lived-in environments. Standout feature: consistent text generation within images for infographics, slides, and diagrams.

88
Overall Score

Voice of the community

sample 50

MAI-Image-2 is best used to generate photorealistic images with elements like natural light, accurate skin tones, environments that feel lived-in.

Microsoft AI2026-03-20

Core Specs

0K
Context Window
0K
Max Output
ReasoningOpen Sourcetextimage

Pros & Cons

Sentiment75% +20% ·5% −

Pros

  • +Excellent photorealism with natural lighting
  • +Accurate text rendering in images
  • +Free access via Copilot/Bing
  • +Arena.ai top 3 ranking
  • +Good for creative/professional work

Cons

  • Limited API access (enterprise only for now)
  • Rate limits on free tier (15 images/24h)
  • No public pricing announced
  • 30-second cooldown between generations

Pricing

Input (per 1M tokens)$0.00
Output (per 1M tokens)$0.00
Free trial available
Updated on 2026-03-21

Get Started

1Visit the provider's website
2Create an account
3Start using the model

Benchmarks

arenaRank3%
noteRanked #3 on Arena.ai text-to-image leaderboard%

Reliability

Incidents (30d)0