Browse ModelsWavespeed AIAudio Vocal Isolator

Audio Vocal Isolator

Audio Vocal Isolator

Playground

Try it on WavespeedAI!

AI Vocal Remover separates vocals from instrumental in any audio track. Upload an audio file and choose to extract vocals or instrumental. Ready-to-use REST inference API, no coldstarts, affordable pricing.

Features

AI Vocal Remover

AI Vocal Remover separates vocals and instrumentals from any audio track with a single click. Upload an audio file and get both the isolated vocal track and the instrumental track in seconds.

Perfect for karaoke creation, music production, remixing, and content workflows.


Why Choose This?

  • Clean separation Advanced source separation technology isolates vocals and instrumentals with minimal bleed or artifacts.

  • Both tracks at once Get the vocal and instrumental tracks simultaneously from a single request.

  • Works on any audio Songs, podcasts, live recordings, mixed tracks — the model handles a wide range of audio sources.

  • Fast and affordable Per-second billing means you only pay for exactly what you process.


Parameters

ParameterRequiredDescription
audioYesInput audio file to process (URL or file upload).

How to Use

  1. Upload your audio — provide the track you want to separate via URL or drag-and-drop.
  2. Submit — the model processes and separates the audio.
  3. Download — the outputs array returns two URLs: the first is the vocal track, the second is the instrumental track.

Pricing

$0.001 per second of input audio.


Best Use Cases

  • Karaoke creation — Strip out vocals to produce a clean instrumental backing track.
  • Music production & remixing — Isolate vocals or instrumentals for sampling, remixing, and mashups.
  • Content creation — Remove background music from recordings or extract a clean vocal for voiceover work.
  • Practice & education — Isolate individual elements of a track to study arrangement or performance.

Pro Tips

  • High-quality, well-mixed audio produces the cleanest separation results.
  • Tracks with a strong stereo mix and clear frequency separation between vocals and instruments work best.

Notes

  • audio is the only required parameter.
  • The outputs array contains two URLs: the first is the vocal track, the second is the instrumental track.
  • Ensure audio URLs are publicly accessible if using a link rather than a direct upload.
  • Pricing is based on the duration of the input audio at $0.001 per second.
  • Please ensure your content complies with WaveSpeed AI’s usage policies.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/audio-vocal-isolator" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--The URL of the input audio file.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.