ACE Step Audio to Audio | AI Voice Conversion API

ACE-Step — Audio to Audio 🎧

ACE-Step Audio-to-Audio is a creative music transformation model that generates new versions of existing tracks. It allows you to remix, rewrite, or restyle a song directly from an uploaded audio file — perfect for producers, remixers, and creators looking to evolve their sound.

✨ Key Features

🎛 Remix Mode Change the musical style while preserving rhythm, tempo, and melodic structure. (e.g., turn a pop track into a lo-fi or EDM remix)
🎤 Lyrics Mode Edit or replace the song’s vocal content while keeping the instrumental layers intact.
🎚 Style Control via Tags Guide generation using genre or mood tags like “jazz,” “cinematic,” “trap,” “ambient chill.”
🎵 High Fidelity Preservation Keeps fine-grained acoustic and timbral details from the original audio — ensuring professional-grade sound quality.
🪄 Reproducible Outputs Use the seed parameter to reproduce or slightly vary your remix results.

🧩 Parameters

Parameter	Description
audio*	Upload or link to an existing track (mp3/wav)
original_tags*	Tags that describe the current genre/style
tags*	Target tags for the remix (e.g., “jazz”, “rock”, “electronic”)
edit_mode	Choose between remix or lyrics editing modes
original_lyrics	(Optional) Input existing lyrics for contextual editing
lyrics	(Optional) New or modified lyrics to be generated
seed	Randomization control — use `-1` for auto or set a fixed value for reproducibility

🎶 Use Cases

Remixing existing tracks into new genres or moods
Rewriting lyrics while preserving the backing music
Adapting songs for different campaigns, platforms, or cultural contexts
Creating A/B variations for music production or content testing
Expanding music datasets with stylistic diversity

💡 Example Workflows

1. Create a remix: Upload a pop song → set edit_mode: remix → add tags like “synthwave, retro” → generate a new version.

2. Rewrite lyrics: Upload a vocal track → choose edit_mode: lyrics → enter new lyrics → generate a rewritten version keeping rhythm and tone.

💰 Pricing

Metric	Price
Per second of generated audio	$0.0002 / s

🎵 Summary

ACE-Step Audio-to-Audio transforms existing music into new creative expressions. Whether you’re remixing genres, rewriting lyrics, or refining mood and tone — it’s your all-in-one AI assistant for dynamic music evolution.

Ace Step Audio To Audio API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio with your input as JSON. The endpoint returns a prediction id. Start polling the result endpoint around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. On completed, read output values from data.outputs. Examples for Ace Step Audio To Audio below.

HTTP example

set -euo pipefail

: "${WAVESPEED_API_KEY:?Set WAVESPEED_API_KEY}"

REQUEST_BODY=$(cat <<'JSON'
{
    "audio": "https://interactive-examples.mdn.mozilla.net/media/cc0-audio/t-rex-roar.mp3",
    "original_tags": "example",
    "tags": "example",
    "edit_mode": "remix",
    "seed": -1
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d "$REQUEST_BODY")

TASK=$(printf '%s' "$SUBMIT_RESPONSE" | jq 'if has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "$TASK" | jq -r '.id')
if [ -z "$PREDICTION_ID" ] || [ "$PREDICTION_ID" = "null" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "$TASK" | jq -r '.urls.get // empty')
if [ -z "$RESULT_URL" ]; then
  RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/$PREDICTION_ID/result"
fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body "$RESULT_URL" \
    -H "Authorization: Bearer $WAVESPEED_API_KEY")
  RESULT=$(printf '%s' "$RESPONSE" | jq 'if has("data") then .data else . end')
  STATUS=$(printf '%s' "$RESULT" | jq -r '.status')
  case "$STATUS" in
    completed) printf '%s\n' "$RESULT" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "$RESULT" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "$STATUS" >&2; exit 1 ;;
  esac
done

Node.js example

const submitUrl = "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio";
const apiKey = process.env.WAVESPEED_API_KEY;
if (!apiKey) throw new Error('Set WAVESPEED_API_KEY');

async function requestJson(url, options = {}) {
  const response = await fetch(url, options);
  if (!response.ok) throw new Error(await response.text());
  return response.json();
}

// 1. Submit the prediction.
const body = await requestJson(submitUrl, {
  method: "POST",
  headers: {
    "Authorization": `Bearer ${apiKey}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
        "audio": "https://interactive-examples.mdn.mozilla.net/media/cc0-audio/t-rex-roar.mp3",
        "original_tags": "example",
        "tags": "example",
        "edit_mode": "remix",
        "seed": -1
}),
});
const task = body.data ?? body;
if (!task.id) throw new Error("Submission response did not contain a prediction id");
const resultUrl = task.urls?.get ||
  `https://api.wavespeed.ai/api/v3/predictions/${task.id}/result`;

// 2. Poll until the prediction finishes.
while (true) {
  const resultBody = await requestJson(resultUrl, {
    headers: { "Authorization": `Bearer ${apiKey}` },
  });
  const result = resultBody.data ?? resultBody;
  if (result.status === "completed") {
    console.log(result.outputs);
    break;
  }
  if (["failed", "cancelled", "timeout"].includes(result.status)) throw new Error(JSON.stringify(result));
  if (!["created", "processing"].includes(result.status)) throw new Error("Unexpected status: " + result.status);
  await new Promise(resolve => setTimeout(resolve, 2000));
}

Python example

import json
import os
import time
from urllib.request import Request, urlopen

api_key = os.environ["WAVESPEED_API_KEY"]
headers = {"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"}
payload = {
    "audio": "https://interactive-examples.mdn.mozilla.net/media/cc0-audio/t-rex-roar.mp3",
    "original_tags": "example",
    "tags": "example",
    "edit_mode": "remix",
    "seed": -1
}

def request_json(url, data=None):
    request = Request(url, data=data, headers=headers, method="POST" if data else "GET")
    with urlopen(request) as response:
        return json.load(response)

# 1. Submit the prediction.
body = request_json("https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-to-audio", json.dumps(payload).encode())
task = body.get("data", body)
if not task.get("id"):
    raise RuntimeError("Submission response did not contain a prediction id")
result_url = task.get("urls", {}).get("get") or f"https://api.wavespeed.ai/api/v3/predictions/{task['id']}/result"

# 2. Poll until the prediction finishes.
while True:
    result_body = request_json(result_url)
    result = result_body.get("data", result_body)
    status = result.get("status")
    if status == "completed":
        print(result.get("outputs", []))
        break
    if status in {"failed", "cancelled", "timeout"}:
        raise RuntimeError(result)
    if status not in {"created", "processing"}:
        raise RuntimeError(f"Unexpected status: {status}")
    time.sleep(2)

Ace Step Audio To Audio API — Frequently asked questions

What is the Ace Step Audio To Audio API?

Ace Step Audio To Audio is a WaveSpeedAI model for AI inference, exposed as a REST API on WaveSpeedAI. ACE-Step Audio-to-Audio turns existing tracks into remixes or vocal edits using remix and lyrics modes while preserving audio character. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Ace Step Audio To Audio API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID. Poll the result endpoint starting around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. The playground generates production-oriented Python, JavaScript, and cURL examples with timeouts, transient-error handling, and safe GET retries. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/ace-step-audio-to-audio.

How much does Ace Step Audio To Audio cost per run?

Ace Step Audio To Audio starts at $0.000 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Ace Step Audio To Audio accept?

Key inputs: `audio`, `seed`, `edit_mode`, `lyrics`, `original_lyrics`, `original_tags`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/ace-step-audio-to-audio.

How long does Ace Step Audio To Audio take to generate?

Median end-to-end generation time on WaveSpeedAI is around 162 seconds per request, based on recent successful runs. Queue time varies with global demand; live status is visible in the prediction record.

Can I use Ace Step Audio To Audio outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ตัวอย่างดูทั้งหมด

โมเดลที่เกี่ยวข้อง

README

ACE-Step — Audio to Audio 🎧

✨ Key Features

🧩 Parameters

🎶 Use Cases

💡 Example Workflows

💰 Pricing

🎵 Summary

Ace Step Audio To Audio API — Quick start

Ace Step Audio To Audio API — Frequently asked questions

เรียนรู้เพิ่มเติม

กฎหมาย

แหล่งข้อมูล

โมเดล

เครื่องมือ