xiaomi/mimo-v2.5-pro
1,048,576 context · $1.00/M input tokens · $3.00/M output tokens
MiMo-V2.5-Pro is Xiaomi’s flagship open model for advanced agentic workflows, complex software engineering, and long-horizon task execution. Built on a sparse Mixture-of-Experts architecture with 1.02T total parameters and 42B active parameters, it supports a 1M-token context window and is optimized for autonomous coding agents, large codebase reasoning, tool-use workflows, and multi-step problem solving. It delivers strong performance on agentic and software engineering benchmarks such as ClawEval, GDPVal, and SWE-bench Pro, with an emphasis on token-efficient long-context execution.
Pay-per-use
No upfront costs, pay only for what you use
Use the following code examples to integrate with our API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="xiaomi/mimo-v2.5-pro",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)MiMo-V2.5-Pro is Xiaomi’s flagship open model for advanced agentic workflows, complex software engineering, and long-horizon task execution. Built on a sparse Mixture-of-Experts architecture with 1.02T total parameters and 42B active parameters, it is optimized for autonomous coding agents, large codebase reasoning, tool use, and multi-step problem solving.
| Specification | Value |
|---|---|
| Provider | xiaomi |
| Model Type | Chat Completions model |
| Architecture | Sparse Mixture-of-Experts |
| Parameters | 1.02T total / 42B active |
| Context Window | 1,048,576 tokens |
| Max Input | 1,032,192 tokens |
| Max Output | 16,384 tokens |
| Input | Text |
| Output | Text |
| Vision | Not listed |
| Function Calling | Supported |
| Structured Outputs | Supported |
| Primary Use Cases | Agentic coding, complex software engineering, long-horizon tasks, tool use |
| Token Type | Cost |
|---|---|
| Input | $1.00 per million tokens |
| Output | $3.00 per million tokens |
| Cached Input | $0.20 per million tokens |
Base URL: https://llm.wavespeed.ai/v1
API Endpoint: chat/completions
Model ID: xiaomi/mimo-v2.5-pro
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="xiaomi/mimo-v2.5-pro",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "xiaomi/mimo-v2.5-pro",
"messages": [{"role": "user", "content": "Hello!"}]
}'
xiaomi/mimo-v2.5-pro
MiMo-V2.5-Pro is Xiaomi’s flagship open model for advanced agentic workflows, complex software engineering, and long-horizon task execution. Built on a sparse Mixture-of-Experts architecture with 1.02T total parameters and 42B active parameters, it supports a 1M-token context window and is optimized for autonomous coding agents, large codebase reasoning, tool-use workflows, and multi-step problem solving. It delivers strong performance on agentic and software engineering benchmarks such as ClawEval, GDPVal, and SWE-bench Pro, with an emphasis on token-efficient long-context execution.
Input
$1 /M
Output
$3 /M
Context
1049K
Max Output
16K
Tool Use
Supported
Access Mimo V2.5 Pro through our unified API — OpenAI-compatible, no cold starts, transparent pricing.
Pricing on WaveSpeedAI: $1.00 per million input tokens and $3.00 per million output tokens. Prompt caching and batch processing are billed separately and reduce effective cost on long, repetitive workloads.
Mimo V2.5 Pro supports up to 1049K tokens of context with up to 16K tokens of output per request.
Yes. WaveSpeedAI exposes Mimo V2.5 Pro through an OpenAI-compatible endpoint at https://llm.wavespeed.ai/v1. Point the official OpenAI SDK at this base URL with your WaveSpeedAI API key — no other code changes required.
Sign in to WaveSpeedAI, create an API key in Access Keys, then send a request to https://llm.wavespeed.ai/v1/chat/completions with model id set to the value shown above. New accounts receive free credits to evaluate Mimo V2.5 Pro before paying per token.