Alibaba Qwen3 Tts Flash

Playground

Alibaba Qwen3 TTS Flash: Low-latency Text-to-Speech for English and Chinese with multiple voices, ideal for real-time dialogue. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Alibaba Qwen3 TTS Flash — Fast Text-to-Speech

Qwen3 TTS Flash is Alibaba’s low-latency, natural-sounding Text-to-Speech model that supports English and Chinese with multiple voice styles. It is designed for real-time conversations, product narration, and short-form video dubbing.

Highlights

Low latency / high concurrency for real-time interaction
Multi-language / multi-style voices (English/Chinese priority)
Parameter control: speed, pitch, volume, speaker (voice_id), emotion
Production-ready: stable output, easy integration, common audio formats

Input & Parameters

text (string, required): The text to synthesize (recommended < 2000 characters per request)
voice_id (string, optional): Voice style ID (e.g., qwen-female-1, qwen-male-1; see platform docs for the full list)
language (string, optional): Language code (en, zh)
speed (number, optional): Speaking rate, default 1.0 (range 0.5–2.0)
pitch (number, optional): Pitch adjustment, default 0
volume (number, optional): Output gain, default 0
emotion (string, optional): Voice emotion/style, e.g., neutral, happy, sad
sample_rate (int, optional): Sample rate, default 22050 (e.g., 16000/22050/24000/44100)
format (string, optional): Output format, default mp3 (supports mp3, wav, ogg)

Note: The available speakers and parameter ranges depend on the platform configuration.

Pricing

Formula: total_price = base_price * text_length / 1000
Current base_price: 1000 (unit depends on platform configuration)

Example

{ “model”: “alibaba/qwen3-tts-flash”, “input”: { “text”: “Hello, welcome to WaveSpeedAI!”, “voice_id”: “qwen-female-1”, “language”: “en”, “speed”: 1.0, “format”: “mp3” } }

Use Cases

Real-time conversational agents / voice replies
Short-form video, advertising, and e-commerce dubbing
App/IoT voice prompts and announcements
Education, customer service, and knowledge base narration

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/qwen-image/translate" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "voice": "Cherry",
    "language_type": "Auto"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
text	string	Yes	-	-	Text to translate
voice	string	Yes	Cherry	Cherry, Ethan, Nofish, Jennifer, Ryan, Katerina, Elias, Jada, Dylan, Sunny, li, Marcus, Roy, Peter, Rocky, Kiki, Eric	Voice name for translation
language_type	string	No	Auto	Auto, Chinese, English, German, Italian, Portuguese, Spanish, Japanese, Korean, French, Russian, Thai	Language type for translation

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	object	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Alibaba Qwen Image Translate Alibaba Wan 2.1 I2V Plus 720p