Elevenlabs Voice Changer

Playground

ElevenLabs Voice Changer transforms any audio into speech with a different voice while preserving the original speech patterns and timing. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Features

ElevenLabs Voice Changer

ElevenLabs Voice Changer transforms any audio into speech with a different voice. Upload your audio and select a target voice — the model converts the speech while preserving the original timing, emotion, and delivery. Built on ElevenLabs’ industry-leading voice AI with best-in-class quality.

REST inference API, best performance, no cold starts, affordable pricing.

Why Choose This?

High-quality voice conversion Industry-leading voice transformation that maintains natural speech patterns and emotional delivery.
Multiple voice options Choose from a variety of pre-built voices to match your content needs.
Background noise removal Optional noise reduction to clean up audio before conversion.
Fast processing Optimized for quick turnaround with no cold starts.
Production-ready API Reliable REST endpoint with predictable per-minute pricing.

Parameters

Parameter	Required	Description
audio	Yes	Source audio file to transform (upload or URL)
voice_id	No	Target voice for conversion (default: Alice)
remove_background_noise	No	Remove background noise from the audio

How to Use

Upload your audio — drag and drop, paste a URL, or record directly.
Select voice — choose the target voice for conversion.
Enable noise removal (optional) — check to clean up background noise.
Run — submit and download the converted audio.

Pricing

Duration	Cost
Per minute	$0.30
30 seconds	$0.15
5 minutes	$1.50

Best Use Cases

Content Creation — Change voices for podcasts, videos, or audiobooks.
Dubbing — Convert speech to different voices for localization.
Privacy — Anonymize voice recordings while preserving content.
Character Voices — Create distinct character voices for storytelling.
Accessibility — Convert speech to preferred voice styles.

Pro Tips

Use clean, high-quality source audio for best results.
Enable background noise removal if your source has ambient sounds.
Shorter clips process faster — split long audio for parallel processing.
Test with different voices to find the best match for your content.

Notes

Maximum audio duration is 10 minutes per job.
For longer content, split into segments and process separately.
Supported audio formats include MP3, WAV, and other common formats.

ElevenLabs V3 — Generate speech from text with natural-sounding voices.
OpenAI Whisper — Transcribe audio to text with high accuracy.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/voice-changer" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "voice_id": "Alice",
    "remove_background_noise": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
audio	string	Yes	-	-	URL of the audio file to transform
voice_id	string	No	Alice	Alice, Aria, Bill, Brian, Callum, Charlie, Charlotte, Chris, Daniel, Eric, George, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, Will	Voice to apply to the audio
remove_background_noise	boolean	No	false	-	Remove background noise from the audio

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Elevenlabs Turbo V2.5 ElevenLabs Voice ID

Elevenlabs Voice Changer

Playground

Features

ElevenLabs Voice Changer

Why Choose This?

Parameters

How to Use

Pricing

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters