mistralai/mistral-small-24b-instruct-2501
32,768 context · $0.05/M input tokens · $0.08/M output tokens
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...
Pay-per-Use
Keine Vorabkosten, zahlen Sie nur, was Sie nutzen
Verwenden Sie die folgenden Codebeispiele zur Integration mit unserer API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="mistralai/mistral-small-24b-instruct-2501",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed for efficient local deployment.
The model achieves 81% accuracy on the MMLU benchmark and performs competitively with larger models like Llama 3.3 70B and Qwen 32B, while operating at three times the speed on equivalent hardware. Read the blog post about the model here.
| Specification | Value |
|---|---|
| Provider | Mistralai |
| Model Type | Large Language Model (LLM) |
| Architecture | N/A |
| Context Window | 32768 tokens |
| Max Output | 16384 tokens |
| Input | Text |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Token Type | Cost per Million Tokens |
|---|---|
| Input | $0.0 |
| Output | $0.1 |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: mistralai/mistral-small-24b-instruct-2501
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="mistralai/mistral-small-24b-instruct-2501",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "mistralai/mistral-small-24b-instruct-2501",
"messages": [{"role": "user", "content": "Hello!"}]
}'
mistralai/mistral-small-24b-instruct-2501
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...
Eingabe
$0.05 /M
Ausgabe
$0.08 /M
Kontext
33K
Max. Ausgabe
16K
Tool-Nutzung
Unterstützt
Zugriff auf Mistral Small 24b Instruct 2501 über unsere einheitliche API — OpenAI-kompatibel, keine Kaltstarts, transparente Preise.
Preise auf WaveSpeedAI: $0.05 pro Million Input-Tokens und $0.08 pro Million Output-Tokens. Prompt-Caching und Batch-Verarbeitung werden separat berechnet und reduzieren die effektiven Kosten bei langen, sich wiederholenden Workloads.
Mistral Small 24b Instruct 2501 unterstützt bis zu 33K Kontext-Tokens und bis zu 16K Output-Tokens pro Anfrage.
Ja. WaveSpeedAI stellt Mistral Small 24b Instruct 2501 über einen OpenAI-kompatiblen Endpunkt unter https://llm.wavespeed.ai/v1 bereit. Richten Sie das offizielle OpenAI SDK mit Ihrem WaveSpeedAI-API-Schlüssel auf diese Base-URL — keine weiteren Codeänderungen erforderlich.
Bei WaveSpeedAI anmelden, in Access Keys einen API-Schlüssel erstellen und eine Anfrage an https://llm.wavespeed.ai/v1/chat/completions mit der oben angezeigten Model-ID senden. Neue Konten erhalten kostenlose Credits, um Mistral Small 24b Instruct 2501 zu testen.