MAI-Thinking-1

Name: MAI-Thinking-1
Brand: Microsoft
Rating: 4.2 (422 reviews)

reasoning

Microsoft•Released on 2026-06-02

Microsoft's first in-house flagship reasoning model, unveiled at Build 2026. A ~35B active-parameter sparse Mixture-of-Experts model trained on commercially licensed data (Microsoft states it was trained without OpenAI data), with a 256K-token context window, function calling, and developer instruction support. Microsoft reports 97.0% on AIME 2025 and 94.5% on AIME 2026, and says it matches Claude Opus 4.6 on SWE-Bench Pro while being preferred over Claude Sonnet 4.6 in blind side-by-side evaluations run by its human-rating partner Surge. Available in private preview through Microsoft Foundry, with availability announced for OpenRouter, Fireworks AI, and Baseten. Public pricing is not yet finalized, and the benchmark claims have not yet been independently reproduced.

Overall Score

Voice of the community

“MAI-Thinking-1 reaches 97.0% on AIME 2025 and 94.5% on AIME 2026, and on SWE-Bench Pro Microsoft says it matches Claude Opus 4.6 on coding tasks.”
TechTimes2026-06-02

“Microsoft has published a preprint describing the evaluation methodology, but full reproduction of the results by independent labs has not yet occurred, leaving those benchmark claims open to challenge until confirmed externally.”
TechTimes2026-06-02

Core Specs

256K

Context Window

32K

Max Output

ReasoningOpen Sourcetext

Scenario Scores

Pros & Cons

Sentiment50% +50% ·0% −

Pros

+Strong reported math reasoning (AIME 2025 97.0%, AIME 2026 94.5%)
+Microsoft says it matches Claude Opus 4.6 on SWE-Bench Pro for its weight class
+256K-token context window with function calling and developer instructions
+Trained on commercially licensed data — lower IP/provenance risk
+Coming to third-party inference (OpenRouter, Fireworks, Baseten)

Cons

−Private preview only at launch — limited access via Microsoft Foundry
−Public pricing not yet finalized
−Benchmark claims not yet independently reproduced (preprint methodology only)
−Text-only at launch (no native multimodal input)
−No community track record yet

Pricing

Input (per 1M tokens)$0.00

Output (per 1M tokens)$0.00

Updated on 2026-06-03

Get Started

1Visit the provider's website

2Create an account

3Start using the model

Benchmarks

userRating%

aiIndex%

noteMicrosoft-reported: AIME 2025 97.0%, AIME 2026 94.5%; SWE-Bench Pro matching Claude Opus 4.6. Not yet independently reproduced.%

Reliability

Incidents (30d)0

Compare with Others

MAI-Thinking-1 vs Claude Fable 5 →MAI-Thinking-1 vs Nex-N2-Pro →MAI-Thinking-1 vs Nemotron 3 Ultra →