openai/gpt-4o-audio-preview
128,000 context · $2.50/M input tokens · $10.00/M output tokens
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
Pay-per-use
Nessun costo iniziale, paga solo per ciò che usi
Usa i seguenti esempi di codice per integrare la nostra API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="openai/gpt-4o-audio-preview",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)The gpt-4o-audio-preview model adds support for audio inputs as prompts
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
| Specification | Value |
|---|---|
| Provider | Openai |
| Model Type | Large Language Model (LLM) |
| Architecture | N/A |
| Context Window | 128000 tokens |
| Max Output | 16384 tokens |
| Input | Text |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Token Type | Cost per Million Tokens |
|---|---|
| Input | $2.5 |
| Output | $10.0 |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: openai/gpt-4o-audio-preview
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="openai/gpt-4o-audio-preview",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "openai/gpt-4o-audio-preview",
"messages": [{"role": "user", "content": "Hello!"}]
}'
openai/gpt-4o-audio-preview
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
Input
$2.5 /M
Output
$10 /M
Contesto
128K
Output max
16K
Uso strumenti
Supportato
Accedi a GPT 4o Audio Preview tramite la nostra API unificata — compatibile con OpenAI, senza cold start, prezzi trasparenti.
Prezzi su WaveSpeedAI: $2.50 per milione di token in input e $10.00 per milione di token in output. Prompt caching e batch processing sono fatturati separatamente e riducono il costo effettivo su carichi lunghi e ripetitivi.
GPT 4o Audio Preview supporta fino a 128K token di contesto e fino a 16K token di output per richiesta.
Sì. WaveSpeedAI espone GPT 4o Audio Preview tramite un endpoint compatibile con OpenAI all'indirizzo https://llm.wavespeed.ai/v1. Punta l'SDK ufficiale di OpenAI a questa base URL con la tua API key WaveSpeedAI — senza altre modifiche al codice.
Accedi a WaveSpeedAI, crea una API key in Access Keys, poi invia una richiesta a https://llm.wavespeed.ai/v1/chat/completions con il model id mostrato sopra. I nuovi account ricevono crediti gratuiti per testare GPT 4o Audio Preview.