Elevenlabs Eleven V3
Playground
Try it on WavespeedAI!ElevenLabs eleven-v3 is a text-to-speech model available as a hosted endpoint; requests cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
ElevenLabs — Eleven V3 Text-to-Speech
Eleven V3 converts written text into natural, expressive speech using ElevenLabs’ advanced deep-learning speech synthesis technology. It delivers clear pronunciation, smooth pacing, and lifelike emotion — ideal for voiceovers, narrations, podcasts, and digital content.
🎧 Key Features
- High Naturalness — produces human-like intonation, timing, and articulation.
- Multi-Language Support — generate voices in multiple global languages with automatic accent adaptation.
- Customizable Parameters — control tone and realism via similarity and stability settings.
- Speaker Boost — enhances clarity for English numerals, times, and measurements.
- Wide Voice Library — choose from a rich set of built-in voices (see voice list here).
💰 Pricing
- Just $0.1 per 1,000 characters !!!
Billing Rules
- If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.
🚀 How to Use
-
Enter your text in the
textfield (up to 5,000 characters). -
Select a voice from the
voice_iddropdown (e.g., Alice, Elli, George). -
Adjust optional parameters:
- similarity: 0–1 (higher = closer to base voice tone)
- stability: 0–1 (higher = consistent delivery)
- use_speaker_boost: enhances number reading in English.
-
Click Run to generate and preview your audio.
📝 Notes
- Audio output is returned in MP3 format.
- Works best for English, but supports multiple languages.
- Long texts may require splitting for stable generation.
- Ensure text avoids ambiguous punctuation for optimal rhythm and tone.
- If the model returns an error message like incorrect voice ID, please modify the code according to the table mentioned earlier voice list here.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/eleven-v3" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"text": "Welcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation.",
"voice_id": "Alice",
"similarity": 1,
"stability": 0.5,
"use_speaker_boost": true
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| text | string | Yes | Welcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation. | - | Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s). |
| voice_id | string | Yes | Alice | Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, Bill | The voice to use for speech generation |
| similarity | number | No | 1 | 0.00 ~ 1.00 | High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged. |
| stability | number | No | 0.5 | 0.00 ~ 1.00 | Voice stability (0-1) Default value: 0.5 |
| use_speaker_boost | boolean | No | true | - | This parameter supports English text normalization, which improves performance in number-reading scenarios. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |