MiniMax M3 Free Coding Guide: GPT-5.5-Level Coding for $20/Month
Bottom line: MiniMax M3, released June 2026, scores 59% on SWE-Bench Pro — beating GPT-5.5 and Gemini 3.1 Pro, approaching Claude Opus 4.7. The $20/month Token Plan gives you 1.7 billion tokens, roughly 10x what Claude Pro offers at the same price. New accounts get ¥15 free credits, and OpenRouter has a free tier.
What is MiniMax M3
MiniMax M3 is the latest flagship model from MiniMax (稀宇科技), a Chinese AI company founded in 2022. It is not a general chatbot — it is specifically designed for coding and agentic tasks. Three things make it stand out:
- MSA (MiniMax Sparse Attention): A novel attention architecture with 1M token context window. Per-token compute at 1M context is just 1/20 of the previous generation.
- Native multimodality: Trained from Step 0 with interleaved text, image, speech, and music data — not stitched together after the fact.
- Open weights: MiniMax has committed to releasing model weights within 10 days of launch. M3 is the first and only open-weight model combining frontier coding, 1M context, and native multimodality.
Coding Benchmarks: Beats GPT-5.5
Here is how M3 performs on internationally recognized coding benchmarks:
- SWE-Bench Pro: 59.0% — Surpasses GPT-5.5 and Gemini 3.1 Pro, approaches Claude Opus 4.7
- Terminal-Bench 2.1: 66.0% — Terminal operation capability
- MCP Atlas: 74.2% — Tool calling ability
- KernelBench Hard: 28.8% — CUDA kernel optimization
- SVG-Bench — Outperforms Opus 4.7 on SVG generation
These are not self-reported numbers. MiniMax ran SWE-Bench with Claude Code as scaffolding (4 runs, averaged), and Terminal-Bench with the official Terminus 2 evaluation framework.
Real-World Tasks: 12 Hours of Autonomous Paper Reproduction
Benchmarks are one thing. Real tasks are another. MiniMax gave M3 an ICLR 2025 Outstanding Paper and asked it to reproduce the results independently:
- Ran autonomously for nearly 12 hours, producing 18 commits and 23 experimental files
- Successfully matched the prediction-probability trends during SFT
- Clearly observed the squeezing effect in DPO experiments
- Required multimodal understanding (curves and formulas), long context (paper + code + logs simultaneously), and strong coding skills
Another test: M3 optimized an FP8 matrix multiplication CUDA kernel on NVIDIA Hopper GPUs. In 24 hours, it completed 147 benchmark submissions and 1,959 tool calls, improving hardware peak utilization from 7.6% to 71.3% — a 9.4x speedup. Most other models stopped making progress within the first 30 submissions.
Pricing: $20 = 10x Claude Pro
This is the most exciting part. MiniMax Token Plan tiers:
- Plus $20/month: ~1.7 billion tokens/month of M3 usage
- Max $50/month: ~5.1 billion tokens/month
- Ultra $120/month: ~9.8 billion tokens/month
For comparison, Claude Pro is also $20/month but gives you roughly one-tenth the token quota. Text, image, speech, and music all share the same usage pool. Among subscription plans at comparable price points, MiniMax Token Plan offers one of the highest token quotas globally.
API pricing on OpenRouter: $0.30 per million input tokens, $1.20 per million output tokens. 1M context window, max output 512K tokens.
How to Try M3 for Free
Three ways to experience M3 at zero or low cost:
- ¥15 signup credits: Register at platform.minimaxi.com and get ¥15 in trial credits automatically
- OpenRouter free tier: Call M3 through OpenRouter API — new users get free credits
- MiniMax Code desktop app: Download from agent.minimaxi.com/download, works with Token Plan
MiniMax Code: Purpose-Built for M3
MiniMax Code is the official agent coding product, built on the open-source projects OpenCode and Pi. Key features:
- Agent Team: Breaks large tasks into multi-stage, concurrent, dynamically adjustable workflows, advanced collaboratively by a cluster of agents
- Producer + Verifier: One agent writes code, another verifies it, forming a self-correcting loop
- Computer Use: Can operate your computer — e.g., "Open the local ERP client and batch-enter invoice data from this Excel spreadsheet"
- Skills and Memories: Learns your work habits and turns repetitive tasks into custom skills
MiniMax Code's Agent Team is similar in direction to Claude Code's recently released Dynamic Workflows. Claude Code emphasizes fixed orchestration based on JS code; MiniMax Code focuses more on deep reflection and continuous error correction.
M3 vs Top Coding Models
- M3 vs Claude Opus 4.7: M3 approaches Opus 4.7 on SWE-Bench Pro but costs a fraction of the price. Opus 4.7 is $5/$25 per million tokens; M3 is $0.30/$1.20
- M3 vs GPT-5.5: M3 surpasses GPT-5.5 on SWE-Bench Pro. GPT-5.5 scored 83.4% via Codex CLI (different scaffolding); M3 scored 59% via Claude Code (official evaluation)
- M3 vs Gemini 3.1 Pro: M3 outperforms Gemini 3.1 Pro on multiple benchmarks including coding and multimodal tasks
- M3 vs M2.7: M3 is a comprehensive upgrade over M2.7 in coding, context length, and multimodality
MSA Architecture: Why 1M Context Matters
MSA (MiniMax Sparse Attention) is the core technical innovation in M3. Compared to DSA and MoBA sparse attention approaches, MSA partitions KV blocks more precisely, achieving higher effective context coverage.
What this means in practice: a 1M token context window lets you feed an entire codebase, research paper, and experiment logs into the model simultaneously. For coding agents, this determines whether they can handle real-world large projects.
At 1M context length, M3's per-token compute is just 1/20 of the previous generation model. This is architectural innovation, not brute-force scaling.
Limitations
Some honest caveats:
- M3's SWE-Bench Pro score (59%) beats GPT-5.5 but still has a gap with Opus 4.7
- The open weights promise ("within 10 days") has not been fulfilled yet as of June 8, 2026
- MiniMax is a relatively young company — ecosystem and community are less mature than OpenAI/Anthropic
- API stability needs time to prove; the official site mentions "continuously improving model serving stability"
- Free credits are limited (¥15); heavy usage requires payment
Who Should Use M3
- Budget-conscious developers: $20/month for 1.7B tokens is unbeatable value
- Long-context projects: 1M context window for large codebase analysis
- Agent/automation use cases: M3 excels on agentic benchmarks
- Teams wanting open-source models: Weights are coming soon, enabling local deployment