Elevenlabs Eleven V3

Playground

ElevenLabs eleven-v3 is a text-to-speech model available as a hosted endpoint; requests cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

ElevenLabs — Eleven V3 Text-to-Speech

Eleven V3 converts written text into natural, expressive speech using ElevenLabs’ advanced deep-learning speech synthesis technology. It delivers clear pronunciation, smooth pacing, and lifelike emotion — ideal for voiceovers, narrations, podcasts, and digital content.

🎧 Key Features

High Naturalness — produces human-like intonation, timing, and articulation.
Multi-Language Support — generate voices in multiple global languages with automatic accent adaptation.
Customizable Parameters — control tone and realism via similarity and stability settings.
Speaker Boost — enhances clarity for English numerals, times, and measurements.
Wide Voice Library — choose from a rich set of built-in voices (see voice list here).

💰 Pricing

Just $0.1 per 1,000 characters !!!

Billing Rules

If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.

🚀 How to Use

Enter your text in the text field (up to 5,000 characters).
Select a voice from the voice_id dropdown (e.g., Alice, Elli, George).
Adjust optional parameters:
- similarity: 0–1 (higher = closer to base voice tone)
- stability: 0–1 (higher = consistent delivery)
- use_speaker_boost: enhances number reading in English.
Click Run to generate and preview your audio.

📝 Notes

Audio output is returned in MP3 format.
Works best for English, but supports multiple languages.
Long texts may require splitting for stable generation.
Ensure text avoids ambiguous punctuation for optimal rhythm and tone.
If the model returns an error message like incorrect voice ID, please modify the code according to the table mentioned earlier voice list here.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/eleven-v3" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "text": "Welcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation.",
    "voice_id": "Alice",
    "similarity": 1,
    "stability": 0.5,
    "use_speaker_boost": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
text	string	Yes	Welcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation.	-	Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
voice_id	string	Yes	Alice	Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, Bill	The voice to use for speech generation
similarity	number	No	1	0.00 ~ 1.00	High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged.
stability	number	No	0.5	0.00 ~ 1.00	Voice stability (0-1) Default value: 0.5
use_speaker_boost	boolean	No	true	-	This parameter supports English text normalization, which improves performance in number-reading scenarios.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Elevenlabs Dubbing Elevenlabs Eleven V3 Timing