MiniMax M3 Free Coding Guide: GPT-5.5-Level Coding for $20/Month

Q: When will M3 open weights be released?

The official commitment is within 10 days of launch. As of June 8, 2026, weights have not yet been released.

Bottom line: MiniMax M3, released June 2026, scores 59% on SWE-Bench Pro — beating GPT-5.5 and Gemini 3.1 Pro, approaching Claude Opus 4.7. The $20/month Token Plan gives you 1.7 billion tokens, roughly 10x what Claude Pro offers at the same price. New accounts get ¥15 free credits, and OpenRouter has a free tier.

What is MiniMax M3

MiniMax M3 is the latest flagship model from MiniMax (稀宇科技), a Chinese AI company founded in 2022. It is not a general chatbot — it is specifically designed for coding and agentic tasks. Three things make it stand out:

MSA (MiniMax Sparse Attention): A novel attention architecture with 1M token context window. Per-token compute at 1M context is just 1/20 of the previous generation.
Native multimodality: Trained from Step 0 with interleaved text, image, speech, and music data — not stitched together after the fact.
Open weights: MiniMax has committed to releasing model weights within 10 days of launch. M3 is the first and only open-weight model combining frontier coding, 1M context, and native multimodality.

Coding Benchmarks: Beats GPT-5.5

Here is how M3 performs on internationally recognized coding benchmarks:

SWE-Bench Pro: 59.0% — Surpasses GPT-5.5 and Gemini 3.1 Pro, approaches Claude Opus 4.7
Terminal-Bench 2.1: 66.0% — Terminal operation capability
MCP Atlas: 74.2% — Tool calling ability
KernelBench Hard: 28.8% — CUDA kernel optimization
SVG-Bench — Outperforms Opus 4.7 on SVG generation

These are not self-reported numbers. MiniMax ran SWE-Bench with Claude Code as scaffolding (4 runs, averaged), and Terminal-Bench with the official Terminus 2 evaluation framework.

Real-World Tasks: 12 Hours of Autonomous Paper Reproduction

Benchmarks are one thing. Real tasks are another. MiniMax gave M3 an ICLR 2025 Outstanding Paper and asked it to reproduce the results independently:

Ran autonomously for nearly 12 hours, producing 18 commits and 23 experimental files
Successfully matched the prediction-probability trends during SFT
Clearly observed the squeezing effect in DPO experiments
Required multimodal understanding (curves and formulas), long context (paper + code + logs simultaneously), and strong coding skills

Another test: M3 optimized an FP8 matrix multiplication CUDA kernel on NVIDIA Hopper GPUs. In 24 hours, it completed 147 benchmark submissions and 1,959 tool calls, improving hardware peak utilization from 7.6% to 71.3% — a 9.4x speedup. Most other models stopped making progress within the first 30 submissions.

Pricing: $20 = 10x Claude Pro

This is the most exciting part. MiniMax Token Plan tiers:

Plus $20/month: ~1.7 billion tokens/month of M3 usage
Max $50/month: ~5.1 billion tokens/month
Ultra $120/month: ~9.8 billion tokens/month

For comparison, Claude Pro is also $20/month but gives you roughly one-tenth the token quota. Text, image, speech, and music all share the same usage pool. Among subscription plans at comparable price points, MiniMax Token Plan offers one of the highest token quotas globally.

API pricing on OpenRouter: $0.30 per million input tokens, $1.20 per million output tokens. 1M context window, max output 512K tokens.

How to Try M3 for Free

Three ways to experience M3 at zero or low cost:

¥15 signup credits: Register at platform.minimaxi.com and get ¥15 in trial credits automatically
OpenRouter free tier: Call M3 through OpenRouter API — new users get free credits
MiniMax Code desktop app: Download from agent.minimaxi.com/download, works with Token Plan

MiniMax Code: Purpose-Built for M3

MiniMax Code is the official agent coding product, built on the open-source projects OpenCode and Pi. Key features:

Agent Team: Breaks large tasks into multi-stage, concurrent, dynamically adjustable workflows, advanced collaboratively by a cluster of agents
Producer + Verifier: One agent writes code, another verifies it, forming a self-correcting loop
Computer Use: Can operate your computer — e.g., "Open the local ERP client and batch-enter invoice data from this Excel spreadsheet"
Skills and Memories: Learns your work habits and turns repetitive tasks into custom skills

MiniMax Code's Agent Team is similar in direction to Claude Code's recently released Dynamic Workflows. Claude Code emphasizes fixed orchestration based on JS code; MiniMax Code focuses more on deep reflection and continuous error correction.

M3 vs Top Coding Models

M3 vs Claude Opus 4.7: M3 approaches Opus 4.7 on SWE-Bench Pro but costs a fraction of the price. Opus 4.7 is $5/$25 per million tokens; M3 is $0.30/$1.20
M3 vs GPT-5.5: M3 surpasses GPT-5.5 on SWE-Bench Pro. GPT-5.5 scored 83.4% via Codex CLI (different scaffolding); M3 scored 59% via Claude Code (official evaluation)
M3 vs Gemini 3.1 Pro: M3 outperforms Gemini 3.1 Pro on multiple benchmarks including coding and multimodal tasks
M3 vs M2.7: M3 is a comprehensive upgrade over M2.7 in coding, context length, and multimodality

MSA Architecture: Why 1M Context Matters

MSA (MiniMax Sparse Attention) is the core technical innovation in M3. Compared to DSA and MoBA sparse attention approaches, MSA partitions KV blocks more precisely, achieving higher effective context coverage.

What this means in practice: a 1M token context window lets you feed an entire codebase, research paper, and experiment logs into the model simultaneously. For coding agents, this determines whether they can handle real-world large projects.

At 1M context length, M3's per-token compute is just 1/20 of the previous generation model. This is architectural innovation, not brute-force scaling.

Limitations

Some honest caveats:

M3's SWE-Bench Pro score (59%) beats GPT-5.5 but still has a gap with Opus 4.7
The open weights promise ("within 10 days") has not been fulfilled yet as of June 8, 2026
MiniMax is a relatively young company — ecosystem and community are less mature than OpenAI/Anthropic
API stability needs time to prove; the official site mentions "continuously improving model serving stability"
Free credits are limited (¥15); heavy usage requires payment

Who Should Use M3

Budget-conscious developers: $20/month for 1.7B tokens is unbeatable value
Long-context projects: 1M context window for large codebase analysis
Agent/automation use cases: M3 excels on agentic benchmarks
Teams wanting open-source models: Weights are coming soon, enabling local deployment

❓ FAQ

Q: Is MiniMax M3 really free?

A: New accounts get ¥15 in trial credits. OpenRouter has a free tier. The Token Plan starts at $20/month with the highest token quota at that price point. API is pay-per-use at $0.30/million input tokens.

Q: M3 vs Claude Code — which is better?

A: M3 approaches Opus 4.7 on SWE-Bench Pro at a fraction of the cost. Claude Code has a more mature ecosystem. Choose M3 for budget, Claude Code for stability.

Q: Does M3 support Chinese?

A: Yes. M3 is a native multimodal model supporting both Chinese and English. MiniMax is a Chinese company, so Chinese language capability is well-supported.

Q: When will M3 open weights be released?

A: The official commitment is "within 10 days of launch." As of June 8, 2026, weights have not yet been released. Check MiniMax official channels for updates.

Q: How can I try M3 at the lowest cost?

A: Register at platform.minimaxi.com for ¥15 free credits, or use OpenRouter's free tier to call the M3 API.