Ai Vocal Remover

Ai Vocal Remover

Playground

Try it on WavespeedAI!

AI Vocal Remover separates vocals from instrumental in any audio track. Upload an audio file and choose to extract vocals or instrumental. Ready-to-use REST inference API, no coldstarts, affordable pricing.

Features

AI Vocal Remover

AI Vocal Remover separates vocals and instrumentals from any audio track with a single click. Choose what you want to extract — the clean vocal track or the backing instrumental — and get a studio-quality separated file in seconds.

Perfect for karaoke creation, music production, remixing, and content workflows.


Why Choose This?

  • Clean separation Advanced source separation technology isolates vocals and instrumentals with minimal bleed or artifacts.

  • Two extraction modes Extract exactly what you need — vocals only or instrumental only — with no extra processing steps.

  • Works on any audio Songs, podcasts, live recordings, mixed tracks — the model handles a wide range of audio sources.

  • Fast and affordable Per-second billing means you only pay for exactly what you process.


Parameters

ParameterRequiredDescription
audioYesInput audio file to process (URL or file upload).
modeNoWhat to extract: vocals (default) or instrumental.

How to Use

  1. Upload your audio — provide the track you want to separate via URL or drag-and-drop.
  2. Select mode — choose vocals to extract the vocal track, or instrumental to extract the backing music.
  3. Submit — download your separated audio file.

Pricing

$0.001 per second of input audio.


Best Use Cases

  • Karaoke creation — Strip out vocals to produce a clean instrumental backing track.
  • Music production & remixing — Isolate vocals or instrumentals for sampling, remixing, and mashups.
  • Content creation — Remove background music from recordings or extract a clean vocal for voiceover work.
  • Practice & education — Isolate individual elements of a track to study arrangement or performance.

Pro Tips

  • High-quality, well-mixed audio produces the cleanest separation results.
  • Tracks with a strong stereo mix and clear frequency separation between vocals and instruments work best.
  • Use vocals mode to get a clean a cappella track, and instrumental mode to get a karaoke-ready backing.

Notes

  • audio is the only required field; mode defaults to vocals if not specified.
  • Ensure audio URLs are publicly accessible if using a link rather than a direct upload.
  • Pricing is based on the duration of the input audio at $0.001 per second.
  • Please ensure your content complies with WaveSpeed AI’s usage policies.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ai-vocal-remover" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "mode": "vocals"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--The URL of the input audio file.
modestringNovocalsvocals, instrumentalOutput type: vocals (extract vocals) or instrumental (extract accompaniment).

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.