What specific programming tasks improved from M2 to MiniMax M2.1?

Refactoring accuracy, feature scaffolding structure, bug-fix precision, and automated code-review adherence all improved. The gains show most on multi-file tasks that require sustained instruction fidelity.

Does MiniMax M2.1 support Interleaved Thinking?

Yes. The 2.1 generation introduced this capability, letting the model alternate between reasoning and action during complex instruction sequences.

How does MiniMax M2.1 handle a five-step tool-call chain?

MiniMax M2.1 executes steps sequentially without reordering or omission. M2 occasionally dropped or shuffled later steps in long chains.

What is the migration path from M2?

Swap the model identifier to `minimax/minimax-m2.1` in your API calls. The request and response formats are unchanged.

Is MiniMax M2.1 appropriate for a CI bot that reviews every pull request?

Yes, it's a fit. Automated review is asynchronous, so the baseline inference rate carries no penalty. MiniMax M2.1's improved instruction adherence means review criteria apply consistently across every PR.

When should I look past the 2.1 generation entirely?

When your workflow benefits from the plan-then-code architecture that M2.5 introduced, or the multi-agent coordination in M2.7. Those later generations address different design patterns.

MiniMax M2.1

MiniMax M2.1 is MiniMax's second-generation model, focused on coding accuracy, tool use, instruction following, and long-horizon planning. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.

ReasoningTool UseImplicit Caching

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2.1',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out MiniMax M2.1 by MiniMax. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

About MiniMax M2.1

Released on October 27, 2025, MiniMax M2.1 ships alongside a speed-optimized sibling (M2.1 Lightning). Standard MiniMax M2.1 targets reliability gaps that kept M2 out of serious engineering pipelines.

M2 could write passable code, but outputs grew inconsistent on harder assignments. Instruction sequences with four or five chained tool invocations sometimes arrived reordered or incomplete. MiniMax M2.1 addresses both issues through targeted training improvements across Go, C++, JavaScript, C#, TypeScript, Rust, Java, Kotlin, and Objective-C. The result is cleaner output on refactoring, feature scaffolding, bug isolation, and automated review.

Interleaved Thinking (alternating reasoning steps with action steps) debuted with the 2.1 generation. It gives the model a structured way to plan before executing multi-part instructions. For asynchronous workloads like CI-triggered reviews, nightly code audits, or batch refactoring queues, standard MiniMax M2.1 delivers identical output quality at the baseline rate.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

205K

2.0s

256tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

Legal:Terms

•

Privacy

205K

1.4s

83tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:—

—

10/27/2025

Legal:Terms

•

Privacy

205K

2.8s

97tps

$0.30/M

$1.20/M

Read:$0.15/M

Write:—

—

10/27/2025

More models by MiniMax

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

205K

0.4s

159tps

$0.30/M

$1.20/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

205K

0.8s

49tps

$0.60/M

$2.40/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

0.4s

246tps

$0.27/M

$0.95/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

205K

0.8s

64tps

$0.60/M

$2.40/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

205K

0.7s

67tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

205K

1.1s

48tps

$0.30/M

$2.40/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

What To Consider When Choosing a Provider

Configuration: Teams whose requests run asynchronously (background jobs, scheduled pipelines, queued reviews) gain nothing from a throughput premium. Standard MiniMax M2.1 at the baseline rate is the right choice for those patterns.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use MiniMax M2.1

Best For

Polyglot codebases: Projects using Go, C++, JavaScript, C#, TypeScript, Rust, Java, Kotlin, or Objective-C
Asynchronous engineering pipelines: CI review bots, nightly audit scripts, and queued refactoring
Long tool-call chains: Sequences of four or more steps that demand sequential fidelity
Upgrading from M2: Teams that need measurably better code without changing the cost envelope

Consider Alternatives When

Real-time latency sensitivity: End users watch response tokens arrive live and perceive delays, so use M2.1 Lightning
Architectural planning needed: M2.5 introduced plan-then-code capability
Vision input required: M2.1 is text-only; route image-bearing requests to a multimodal model instead

Conclusion

MiniMax M2.1 fixed the rough edges that kept M2 out of production engineering workflows. It delivers cleaner multilingual output, reliable multi-step execution, and Interleaved Thinking at the standard rate. It serves as the foundation of MiniMax's second generation for teams that prioritize correctness over velocity.

Frequently Asked Questions

What specific programming tasks improved from M2 to MiniMax M2.1?
Refactoring accuracy, feature scaffolding structure, bug-fix precision, and automated code-review adherence all improved. The gains show most on multi-file tasks that require sustained instruction fidelity.
Does MiniMax M2.1 support Interleaved Thinking?
Yes. The 2.1 generation introduced this capability, letting the model alternate between reasoning and action during complex instruction sequences.
How does MiniMax M2.1 handle a five-step tool-call chain?
MiniMax M2.1 executes steps sequentially without reordering or omission. M2 occasionally dropped or shuffled later steps in long chains.
What is the migration path from M2?
Swap the model identifier to minimax/minimax-m2.1 in your API calls. The request and response formats are unchanged.
Is MiniMax M2.1 appropriate for a CI bot that reviews every pull request?
Yes, it's a fit. Automated review is asynchronous, so the baseline inference rate carries no penalty. MiniMax M2.1's improved instruction adherence means review criteria apply consistently across every PR.
When should I look past the 2.1 generation entirely?
When your workflow benefits from the plan-then-code architecture that M2.5 introduced, or the multi-agent coordination in M2.7. Those later generations address different design patterns.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiniMax M2.1

Playground

About MiniMax M2.1

Providers

More models by MiniMax

What To Consider When Choosing a Provider

When to Use MiniMax M2.1

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions

Playground

About MiniMax M2.1

Providers

More models by MiniMax