Elevenlabs Multilingual V2
Playground
Try it on WavespeedAI!ElevenLabs Multilingual V2 is a multilingual text-to-speech model; cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
ElevenLabs — Multilingual V2 Text-to-Speech
Multilingual V2 converts written text into natural, expressive speech across multiple languages. It delivers clear pronunciation, smooth pacing, and lifelike tone—ideal for voiceovers, narration, learning content, product videos, and global customer support. See the list here.
Key Features
- High naturalness with humanlike intonation and timing
- Strong multilingual support and improved accent handling
- Tunable delivery via similarity and stability
- Speaker Boost for clearer English numerals, dates, and units
Pricing
- $0.1 per 1,000 characters
- If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.
How to Use
- Enter your script in the text field.
- Choose a voice_id from the built-in catalog or your custom voices. See the voice list for options.
- Optional controls • similarity: 0–1 (higher = closer to the base voice timbre) • stability: 0–1 (higher = more consistent delivery) • use_speaker_boost: improves English number and unit reading
- Click Run to synthesize and preview your audio.
Notes
- Use clear punctuation and split very long text into shorter segments for the most stable prosody.
- voice_id must be valid; if you see a voice-ID error, pick one from the official list linked above.
- Speaker Boost is especially helpful for financial, time, and measurement reads in English.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/multilingual-v2" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"voice_id": "Alice",
"similarity": 1,
"stability": 0.5,
"use_speaker_boost": true
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| text | string | Yes | - | - | Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s). |
| voice_id | string | Yes | Alice | Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, Bill | The voice to use for speech generation |
| similarity | number | No | 1 | 0.00 ~ 1.00 | High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged. |
| stability | number | No | 0.5 | 0.00 ~ 1.00 | Voice stability (0-1) Default value: 0.5 |
| use_speaker_boost | boolean | No | true | - | This parameter supports English text normalization, which improves performance in number-reading scenarios. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |