WaveSpeedAI APIAlibabaAlibaba Qwen3 Tts Flash

Alibaba Qwen3 Tts Flash

Alibaba Qwen3 Tts Flash

Playground

Try it on WavespeedAI!

Alibaba Qwen3 TTS Flash is a low-latency Text-to-Speech (TTS) model that supports English and Chinese with multiple voice styles. It is ideal for real-time voice interaction, product narration, and short‑form video dubbing.

Features

Alibaba Qwen3 TTS Flash — Fast Text-to-Speech

Qwen3 TTS Flash is Alibaba’s low-latency, natural-sounding Text-to-Speech model that supports English and Chinese with multiple voice styles. It is designed for real-time conversations, product narration, and short-form video dubbing.

Highlights

  • Low latency / high concurrency for real-time interaction
  • Multi-language / multi-style voices (English/Chinese priority)
  • Parameter control: speed, pitch, volume, speaker (voice_id), emotion
  • Production-ready: stable output, easy integration, common audio formats

Input & Parameters

  • text (string, required): The text to synthesize (recommended < 2000 characters per request)
  • voice_id (string, optional): Voice style ID (e.g., qwen-female-1, qwen-male-1; see platform docs for the full list)
  • language (string, optional): Language code (en, zh)
  • speed (number, optional): Speaking rate, default 1.0 (range 0.5–2.0)
  • pitch (number, optional): Pitch adjustment, default 0
  • volume (number, optional): Output gain, default 0
  • emotion (string, optional): Voice emotion/style, e.g., neutral, happy, sad
  • sample_rate (int, optional): Sample rate, default 22050 (e.g., 16000/22050/24000/44100)
  • format (string, optional): Output format, default mp3 (supports mp3, wav, ogg)

Note: The available speakers and parameter ranges depend on the platform configuration.

Pricing

  • Formula: total_price = base_price * text_length / 1000
  • Current base_price: 1000 (unit depends on platform configuration)

Example

{ “model”: “alibaba/qwen3-tts-flash”, “input”: { “text”: “Hello, welcome to WaveSpeedAI!”, “voice_id”: “qwen-female-1”, “language”: “en”, “speed”: 1.0, “format”: “mp3” } }

Use Cases

  • Real-time conversational agents / voice replies
  • Short-form video, advertising, and e-commerce dubbing
  • App/IoT voice prompts and announcements
  • Education, customer service, and knowledge base narration

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/qwen-image/translate" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "voice": "Cherry",
    "language_type": "Auto"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
textstringYes--Text to translate
voicestringYesCherryCherry, Ethan, Nofish, Jennifer, Ryan, Katerina, Elias, Jada, Dylan, Sunny, li, Marcus, Roy, Peter, Rocky, Kiki, EricVoice name for translation
language_typestringNoAutoAuto, Chinese, English, German, Italian, Portuguese, Spanish, Japanese, Korean, French, Russian, ThaiLanguage type for translation

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.