baidu/ernie-4.5-300b-a47b
123,000 context · $0.28/M input tokens · $1.10/M output tokens
ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...
Pay-per-use
Nessun costo iniziale, paga solo per ciò che usi
Usa i seguenti esempi di codice per integrare la nostra API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="baidu/ernie-4.5-300b-a47b",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B p
ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in both English and Chinese. Optimized for high-throughput inference and efficient scaling, it uses a heterogeneous MoE structure with advanced routing and quantization strategies, including FP8 and 2-bit formats. This version is fine-tuned for language-only tasks and supports reasoning, tool parameters, and extended context lengths up to 131k tokens. Suitable for general-purpose LLM applications with high reasoning and throughput demands.
| Specification | Value |
|---|---|
| Provider | Baidu |
| Model Type | Large Language Model (LLM) |
| Architecture | N/A |
| Context Window | 123000 tokens |
| Max Output | 12000 tokens |
| Input | Text |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Token Type | Cost per Million Tokens |
|---|---|
| Input | $0.3 |
| Output | $1.2 |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: baidu/ernie-4.5-300b-a47b
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="baidu/ernie-4.5-300b-a47b",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "baidu/ernie-4.5-300b-a47b",
"messages": [{"role": "user", "content": "Hello!"}]
}'
baidu/ernie-4.5-300b-a47b
ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...
Input
$0.28 /M
Output
$1.1 /M
Contesto
123K
Output max
12K
Accedi a Ernie 4.5 300b A47b tramite la nostra API unificata — compatibile con OpenAI, senza cold start, prezzi trasparenti.
Prezzi su WaveSpeedAI: $0.28 per milione di token in input e $1.10 per milione di token in output. Prompt caching e batch processing sono fatturati separatamente e riducono il costo effettivo su carichi lunghi e ripetitivi.
Ernie 4.5 300b A47b supporta fino a 123K token di contesto e fino a 12K token di output per richiesta.
Sì. WaveSpeedAI espone Ernie 4.5 300b A47b tramite un endpoint compatibile con OpenAI all'indirizzo https://llm.wavespeed.ai/v1. Punta l'SDK ufficiale di OpenAI a questa base URL con la tua API key WaveSpeedAI — senza altre modifiche al codice.
Accedi a WaveSpeedAI, crea una API key in Access Keys, poi invia una richiesta a https://llm.wavespeed.ai/v1/chat/completions con il model id mostrato sopra. I nuovi account ricevono crediti gratuiti per testare Ernie 4.5 300b A47b.