WaveSpeedAI APIWavespeed AIAce Step Audio Inpaint

Ace Step Audio Inpaint

Ace Step Audio Inpaint

Playground

Try it on WavespeedAI!

ACE-Step Audio Inpaint edits a specific audio segment to change lyrics or style while preserving the surrounding audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

ace-step/audio-inpaint 🎧

ACE-Step Audio Inpaint is a powerful music and audio editing model developed by WaveSpeedAI. It enables precise, intelligent modification of selected time ranges within an existing audio clip — perfect for fixing, remixing, or creatively reimagining tracks without re-generating the whole piece.


🌟 Key Features

  • 🎯 Precise Segment Editing Modify only the section you want — define start and end times to edit exactly the range you need.

  • 🎵 Seamless Audio Blending New content merges naturally with surrounding audio for smooth, undetectable transitions.

  • 🕒 Flexible Timing Control Choose whether your start or end times are relative to the beginning or end of the track.

  • 🎶 Style & Lyric Adaptability Add new instrumentation, effects, or lyrics while preserving the overall flow and tone.

  • ⚙️ Controlled Variation Adjust how much the regenerated section diverges from the original using seed and creative parameters.


⚙️ Parameters

ParameterDescription
audio*Upload or link to an existing audio file (MP3 / WAV).
tags*Define the target style or mood (e.g., lofi, hiphop, trap, chill).
start_time / end_timeSelect the time range (in seconds) to edit.
start_time_relative_to / end_time_relative_toChoose whether the range is relative to the start or end of the audio.
lyrics(Optional) Add or replace lyrics for the edited section.
seedFix for reproducible results; -1 for randomized variation.

💡 Use Cases

  • 🎧 Repair or refine — Fix errors or off-beat moments in specific sections.
  • 📝 Rewrite lyrics — Try new vocal phrasing or emotional tone.
  • 🎚 Remix segments — Replace or restyle a part of a song without altering the rest.
  • 🎬 Audio storytelling — Modify voiceovers or sound effects within a fixed-length clip.

💰 Pricing

MetricPrice
Per second of generated audio$0.0002 / s

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-inpaint" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "start_time_relative_to": "start",
    "start_time": 0,
    "end_time_relative_to": "start",
    "end_time": 30,
    "lyrics": "",
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--Audio file to transcribe. Provide an HTTPS URL or upload a file (MP3, WAV, FLAC up to 60 minutes).
tagsstringYes--Comma-separated list of genre tags to control the style.
start_time_relative_tostringNostartstart, endReference point for start time.
start_timenumberNo-0 ~ 240Start time in seconds.
end_time_relative_tostringNostartstart, endReference point for end time.
end_timenumberNo300 ~ 240End time in seconds.
lyricsstringNo--Lyrics to be sung in the audio. Use [inst] or [instrumental] for no vocals.
seedintegerNo-1-1 ~ 2147483647The random seed for reproducibility.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.