Qwen3 Tts Text To Speech
Playground
Try it on WavespeedAI!Qwen3 TTS: Multi-language, multi-voice text-to-speech synthesis with style control. Supports 11 languages and 9 voice characters. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.
Features
Qwen3-TTS Text-to-Speech
Qwen3-TTS Text-to-Speech is a high-quality text-to-speech model with a curated selection of preset voices. Choose from 9 distinct voices spanning different genders and speaking styles, with optional style instructions to fine-tune the delivery.
Why Choose This?
-
Curated voice library 9 preset voices with distinct personalities — from professional narrators to friendly conversational tones.
-
Style instruction support Guide the speaking style with natural language instructions for customized delivery.
-
Auto language detection Set language to “auto” and the model intelligently detects the language from your text.
-
Simple and fast Straightforward interface — select a voice, enter text, and generate.
Parameters
| Parameter | Required | Description |
|---|---|---|
| text | Yes | The text to convert to speech |
| language | Yes | Language code or “auto” for automatic detection |
| voice | Yes | Preset voice to use (see Available Voices below) |
| style_instruction | No | Natural language guidance for speaking style |
Available Voices
| Voice | Description |
|---|---|
| Vivian | Female voice |
| Serena | Female voice |
| Ono_Anna | Female voice |
| Sohee | Female voice |
| Uncle_Fu | Male voice |
| Dylan | Male voice |
| Eric | Male voice |
| Ryan | Male voice |
| Aiden | Male voice |
Style Instruction Examples
- “Speak slowly and calmly, like a meditation guide”
- “Energetic and enthusiastic, like a sports announcer”
- “Professional and clear, suitable for corporate presentations”
- “Warm and friendly, like talking to a close friend”
How to Use
- Enter your text — write or paste the content you want to convert to speech.
- Select language — choose the target language or use “auto” for automatic detection.
- Choose a voice — select from the 9 available preset voices.
- Add style instruction (optional) — describe how you want the voice to sound.
- Run — submit and download your audio file.
Pricing
| Text Length | Cost |
|---|---|
| Under 1,000 chars | $0.02 |
| 1,000+ chars | $0.02 per 1,000 characters |
Billing Rules
- Minimum charge: $0.02 (for texts under 1,000 characters)
- For longer texts: $0.02 × (character count / 1,000)
Best Use Cases
- Video Voiceovers — Generate professional narration for YouTube, ads, or explainer videos.
- Audiobook Production — Convert manuscripts into natural-sounding narration.
- Podcasts & Broadcasting — Create consistent voice content without recording equipment.
- E-learning & Training — Produce clear, engaging audio for educational materials.
- Accessibility — Convert written content to audio for visually impaired users.
Pro Tips
- Try different voices to find the best match for your content type.
- Use style_instruction to adjust tone without changing the voice itself.
- Match female voices (Vivian, Serena, Ono_Anna, Sohee) for softer content; male voices (Uncle_Fu, Dylan, Eric, Ryan, Aiden) for authoritative content.
- Test with short text first to preview how the voice sounds before generating longer content.
Related Models
- Qwen3-TTS Voice Clone — Clone any voice from a short audio sample.
- Qwen3-TTS Voice Design — Design custom voices using natural language descriptions.
Notes
- All 9 voices are optimized for natural, clear speech output.
- Style instructions work best when they describe emotion, pace, or tone rather than technical audio settings.
- For best quality, match the language parameter to your text content.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen3-tts/text-to-speech" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"language": "auto",
"voice": "Vivian"
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| text | string | Yes | - | - | The text content to convert into speech |
| language | string | Yes | auto | auto, Chinese, English, German, Italian, Portuguese, Spanish, Japanese, Korean, French, Russian | Language of the speech output (use 'auto' for automatic detection) |
| voice | string | Yes | Vivian | Vivian, Serena, Ono_Anna, Sohee, Uncle_Fu, Dylan, Eric, Ryan, Aiden | Voice character to use for speech synthesis |
| style_instruction | string | No | - | - | Optional instruction to control the speaking style, tone, or emotion |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |