ACE-Step Audio-to-Audio turns existing tracks into remixes or vocal edits using remix and lyrics modes while preserving audio character. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
ว่าง
$0.0002ต่อครั้ง·~5000 / $1
ACE-Step Audio-to-Audio is a creative music transformation model that generates new versions of existing tracks. It allows you to remix, rewrite, or restyle a song directly from an uploaded audio file — perfect for producers, remixers, and creators looking to evolve their sound.
🎛 Remix Mode Change the musical style while preserving rhythm, tempo, and melodic structure. (e.g., turn a pop track into a lo-fi or EDM remix)
🎤 Lyrics Mode Edit or replace the song’s vocal content while keeping the instrumental layers intact.
🎚 Style Control via Tags Guide generation using genre or mood tags like “jazz,” “cinematic,” “trap,” “ambient chill.”
🎵 High Fidelity Preservation Keeps fine-grained acoustic and timbral details from the original audio — ensuring professional-grade sound quality.
🪄 Reproducible Outputs Use the seed parameter to reproduce or slightly vary your remix results.
| Parameter | Description |
|---|---|
| audio* | Upload or link to an existing track (mp3/wav) |
| original_tags* | Tags that describe the current genre/style |
| tags* | Target tags for the remix (e.g., “jazz”, “rock”, “electronic”) |
| edit_mode | Choose between remix or lyrics editing modes |
| original_lyrics | (Optional) Input existing lyrics for contextual editing |
| lyrics | (Optional) New or modified lyrics to be generated |
| seed | Randomization control — use -1 for auto or set a fixed value for reproducibility |
1. Create a remix:
Upload a pop song → set edit_mode: remix → add tags like “synthwave, retro” → generate a new version.
2. Rewrite lyrics:
Upload a vocal track → choose edit_mode: lyrics → enter new lyrics → generate a rewritten version keeping rhythm and tone.
| Metric | Price |
|---|---|
| Per second of generated audio | $0.0002 / s |
ACE-Step Audio-to-Audio transforms existing music into new creative expressions. Whether you’re remixing genres, rewriting lyrics, or refining mood and tone — it’s your all-in-one AI assistant for dynamic music evolution.
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Ace Step Audio To Audio below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"audio": "https://example.com/your-audio.mp3",
"edit_mode": "remix",
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("wavespeed-ai/ace-step/audio-to-audio", {
"audio": "https://example.com/your-audio.mp3",
"edit_mode": "remix",
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"wavespeed-ai/ace-step/audio-to-audio",
{
"audio": "https://example.com/your-audio.mp3",
"edit_mode": "remix",
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputAce Step Audio To Audio is a WaveSpeedAI model for AI inference, exposed as a REST API on WaveSpeedAI. ACE-Step Audio-to-Audio turns existing tracks into remixes or vocal edits using remix and lyrics modes while preserving audio character. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/ace-step-audio-to-audio.
Ace Step Audio To Audio starts at $0.000 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `audio`, `seed`, `edit_mode`, `lyrics`, `original_lyrics`, `original_tags`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/ace-step-audio-to-audio.
Average end-to-end generation time on WaveSpeedAI is around 214 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.