WaveSpeedAI APIWavespeed AIAce Step Audio Inpaint

Ace Step Audio Inpaint

Ace Step Audio Inpaint

Playground

Try it on WavespeedAI!

ACE-Step Audio Inpaint modifies a specific portion of provided audio with new lyrics and/or style while preserving the surrounding audio. Perfect for editing specific sections of a track, fixing mistakes, or trying different variations in the middle of a song without regenerating the entire track.

Features

ACE-Step: Audio Inpaint (Edit)

ACE-Step Audio Inpaint modifies a specific portion of provided audio with new content.

Key Features

  1. Precise Editing: Target specific time ranges in your audio for modification.
  2. Seamless Integration: Modified sections blend naturally with surrounding audio.
  3. Flexible Timing: Specify start and end times relative to the beginning or end of the track.
  4. Variance Control: Adjust how much the edited section differs from the original.

Use Cases

  • Fix mistakes in a specific section of a track
  • Try different lyrics for a verse or chorus
  • Replace instrumental sections with different styles
  • Edit specific parts without regenerating the entire song

Pricing

$0.0002 per second of generated audio.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-inpaint" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "start_time_relative_to": "start",
    "start_time": 0,
    "end_time_relative_to": "start",
    "end_time": 30,
    "lyrics": "",
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--Audio file to transcribe. Provide an HTTPS URL or upload a file (MP3, WAV, FLAC up to 60 minutes).
tagsstringYes--Comma-separated list of genre tags to control the style.
start_time_relative_tostringNostartstart, endReference point for start time.
start_timenumberNo-0 ~ 240Start time in seconds.
end_time_relative_tostringNostartstart, endReference point for end time.
end_timenumberNo300 ~ 240End time in seconds.
lyricsstringNo--Lyrics to be sung in the audio. Use [inst] or [instrumental] for no vocals.
seedintegerNo-1-1 ~ 2147483647The random seed for reproducibility.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.