Elevenlabs Voice Changer
Playground
Try it on WavespeedAI!ElevenLabs Voice Changer transforms any audio into speech with a different voice while preserving the original speech patterns and timing. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.
Features
ElevenLabs Voice Changer
ElevenLabs Voice Changer transforms any audio into speech with a different voice. Upload your audio and select a target voice — the model converts the speech while preserving the original timing, emotion, and delivery. Built on ElevenLabs’ industry-leading voice AI with best-in-class quality.
REST inference API, best performance, no cold starts, affordable pricing.
Why Choose This?
-
High-quality voice conversion Industry-leading voice transformation that maintains natural speech patterns and emotional delivery.
-
Multiple voice options Choose from a variety of pre-built voices to match your content needs.
-
Background noise removal Optional noise reduction to clean up audio before conversion.
-
Fast processing Optimized for quick turnaround with no cold starts.
-
Production-ready API Reliable REST endpoint with predictable per-minute pricing.
Parameters
| Parameter | Required | Description |
|---|---|---|
| audio | Yes | Source audio file to transform (upload or URL) |
| voice_id | No | Target voice for conversion (default: Alice) |
| remove_background_noise | No | Remove background noise from the audio |
How to Use
- Upload your audio — drag and drop, paste a URL, or record directly.
- Select voice — choose the target voice for conversion.
- Enable noise removal (optional) — check to clean up background noise.
- Run — submit and download the converted audio.
Pricing
| Duration | Cost |
|---|---|
| Per minute | $0.30 |
| 30 seconds | $0.15 |
| 5 minutes | $1.50 |
Best Use Cases
- Content Creation — Change voices for podcasts, videos, or audiobooks.
- Dubbing — Convert speech to different voices for localization.
- Privacy — Anonymize voice recordings while preserving content.
- Character Voices — Create distinct character voices for storytelling.
- Accessibility — Convert speech to preferred voice styles.
Pro Tips
- Use clean, high-quality source audio for best results.
- Enable background noise removal if your source has ambient sounds.
- Shorter clips process faster — split long audio for parallel processing.
- Test with different voices to find the best match for your content.
Notes
- Maximum audio duration is 10 minutes per job.
- For longer content, split into segments and process separately.
- Supported audio formats include MP3, WAV, and other common formats.
Related Models
- ElevenLabs V3 — Generate speech from text with natural-sounding voices.
- OpenAI Whisper — Transcribe audio to text with high accuracy.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/elevenlabs/voice-changer" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"voice_id": "Alice",
"remove_background_noise": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| audio | string | Yes | - | - | URL of the audio file to transform |
| voice_id | string | No | Alice | Alice, Aria, Bill, Brian, Callum, Charlie, Charlotte, Chris, Daniel, Eric, George, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, Will | Voice to apply to the audio |
| remove_background_noise | boolean | No | false | - | Remove background noise from the audio |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |