Skip to content
M

MAI-Image-2

image

MicrosoftReleased on 2026-03-20

Microsoft's latest text-to-image model, ranked #3 on Arena.ai leaderboard. Excels at photorealistic images with natural lighting, accurate skin tones, and lived-in environments. Standout feature: consistent text generation within images for infographics, slides, and diagrams.

88
Overall Score

Core Specs

0K
Context Window
0K
Max Output
Reasoning
Open Source
Multimodal Support
textimage

User Feedback Highlights

Based on community feedback. Hover to see original reviews.

+ Free access via Copilot/Bing+ Good for creative/professional work Limited API access (enterprise only for now)+ Accurate text rendering in images 30-second cooldown between generations+ Excellent photorealism with natural lighting Rate limits on free tier (15 images/24h)+ Arena.ai top 3 ranking No public pricing announced
Sentiment:👍 75%😐 20%👎 5%

Pros & Cons

Pros

  • +Excellent photorealism with natural lighting
  • +Accurate text rendering in images
  • +Free access via Copilot/Bing
  • +Arena.ai top 3 ranking
  • +Good for creative/professional work

Cons

  • Limited API access (enterprise only for now)
  • Rate limits on free tier (15 images/24h)
  • No public pricing announced
  • 30-second cooldown between generations

Reliability

Incidents (30d)0

Pricing

Input (per 1M tokens)$0.00
Output (per 1M tokens)$0.00
Free trial available
Updated on 2026-03-21

Get Started

1Visit the provider's website
2Create an account
3Start using the model

Benchmarks

arenaRank3%
noteRanked #3 on Arena.ai text-to-image leaderboard%