WaveSpeedAI APIElevenlabsElevenlabs Eleven V3

Elevenlabs Eleven V3

Elevenlabs Eleven V3

Playground

Try it on WavespeedAI!

ElevenLabs eleven-v3 is a text-to-speech model available as a hosted endpoint; requests cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

ElevenLabs — Eleven V3 Text-to-Speech

Eleven V3 converts written text into natural, expressive speech using ElevenLabs’ advanced deep-learning speech synthesis technology. It delivers clear pronunciation, smooth pacing, and lifelike emotion — ideal for voiceovers, narrations, podcasts, and digital content.


🎧 Key Features

  • High Naturalness — produces human-like intonation, timing, and articulation.
  • Multi-Language Support — generate voices in multiple global languages with automatic accent adaptation.
  • Customizable Parameters — control tone and realism via similarity and stability settings.
  • Speaker Boost — enhances clarity for English numerals, times, and measurements.
  • Wide Voice Library — choose from a rich set of built-in voices (see voice list here).

💰 Pricing

  • Just $0.1 per 1,000 characters !!!

Billing Rules

  • If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.

🚀 How to Use

  1. Enter your text in the text field (up to 5,000 characters).

  2. Select a voice from the voice_id dropdown (e.g., Alice, Elli, George).

  3. Adjust optional parameters:

    • similarity: 0–1 (higher = closer to base voice tone)
    • stability: 0–1 (higher = consistent delivery)
    • use_speaker_boost: enhances number reading in English.
  4. Click Run to generate and preview your audio.


📝 Notes

  • Audio output is returned in MP3 format.
  • Works best for English, but supports multiple languages.
  • Long texts may require splitting for stable generation.
  • Ensure text avoids ambiguous punctuation for optimal rhythm and tone.
  • If the model returns an error message like incorrect voice ID, please modify the code according to the table mentioned earlier voice list here.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/eleven-v3" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "text": "Welcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation.",
    "voice_id": "Alice",
    "similarity": 1,
    "stability": 0.5,
    "use_speaker_boost": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
textstringYesWelcome to our advanced text-to-speech system! Experience high-quality voice synthesis with natural pronunciation and clear articulation.-Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
voice_idstringYesAliceAria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, BillThe voice to use for speech generation
similaritynumberNo10.00 ~ 1.00High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged.
stabilitynumberNo0.50.00 ~ 1.00Voice stability (0-1) Default value: 0.5
use_speaker_boostbooleanNotrue-This parameter supports English text normalization, which improves performance in number-reading scenarios.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.