就绪
您的请求将花费 $0.035 每次运行。
使用 $1 您可以运行此模型大约 28 次。
Inworld Realtime TTS 2 converts text into natural-sounding speech with low-latency generation and flexible voice controls. It supports multiple output audio formats and lets you adjust speaking rate and temperature for different delivery styles.
Low-latency text-to-speech Generate speech quickly for interactive apps, assistants, and real-time voice experiences.
Natural voice output Create smooth, human-like speech from plain text with selectable voices.
Flexible voice controls Adjust speaking rate and temperature to better match tone, pacing, and delivery style.
Multiple output formats
Export audio in MP3, LINEAR16, OGG_OPUS, FLAC, or WAV depending on your workflow.
Production-ready API Access the model through a realtime-friendly API for apps, agents, games, and voice products.
| Parameter | Required | Description |
|---|---|---|
| text | Yes | Input text to convert into speech. |
| voice_id | No | Voice selection for the generated speech, such as Julia. |
| speaking_rate | No | Controls how fast the voice speaks. Default: 1. |
| temperature | No | Controls variation and expressiveness in the generated speech. Default: 1. |
| output_format | No | Output audio format: MP3, LINEAR16, OGG_OPUS, FLAC, or WAV. |
MP3, LINEAR16, OGG_OPUS, FLAC, or WAV.Welcome to our product demo. Today we will walk through the key features, explain how the workflow operates, and show how quickly you can integrate voice output into your application.
| Text Length | Cost |
|---|---|
| 1–1000 chars | $0.035 |
| 1001–2000 chars | $0.070 |
| 2001–3000 chars | $0.105 |
| 3001–4000 chars | $0.140 |
| 4001–5000 chars | $0.175 |
text.1,000-character block.1,000 characters adds $0.035.voice_id, speaking_rate, temperature, and output_format do not affect pricing.speaking_rate to match the use case, such as slower for tutorials and faster for assistants.temperature when you want more variation in delivery style.MP3 for broad compatibility, and use lossless formats like WAV or FLAC when audio quality matters more.text is the only required field.MP3, LINEAR16, OGG_OPUS, FLAC, and WAV.