Elevenlabs Multilingual V2

Playground

ElevenLabs Multilingual V2 is a multilingual text-to-speech model; cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

ElevenLabs — Multilingual V2 Text-to-Speech

Multilingual V2 converts written text into natural, expressive speech across multiple languages. It delivers clear pronunciation, smooth pacing, and lifelike tone—ideal for voiceovers, narration, learning content, product videos, and global customer support. See the list here.

Key Features

High naturalness with humanlike intonation and timing
Strong multilingual support and improved accent handling
Tunable delivery via similarity and stability
Speaker Boost for clearer English numerals, dates, and units

Pricing

$0.1 per 1,000 characters
If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.

How to Use

Enter your script in the text field.
Choose a voice_id from the built-in catalog or your custom voices. See the voice list for options.
Optional controls • similarity: 0–1 (higher = closer to the base voice timbre) • stability: 0–1 (higher = more consistent delivery) • use_speaker_boost: improves English number and unit reading
Click Run to synthesize and preview your audio.

Notes

Use clear punctuation and split very long text into shorter segments for the most stable prosody.
voice_id must be valid; if you see a voice-ID error, pick one from the official list linked above.
Speaker Boost is especially helpful for financial, time, and measurement reads in English.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/multilingual-v2" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "voice_id": "Alice",
    "similarity": 1,
    "stability": 0.5,
    "use_speaker_boost": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
text	string	Yes	-	-	Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
voice_id	string	Yes	Alice	Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, Bill	The voice to use for speech generation
similarity	number	No	1	0.00 ~ 1.00	High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged.
stability	number	No	0.5	0.00 ~ 1.00	Voice stability (0-1) Default value: 0.5
use_speaker_boost	boolean	No	true	-	This parameter supports English text normalization, which improves performance in number-reading scenarios.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Elevenlabs Multilingual V1 Elevenlabs Turbo V2