Ace Step Prompt To Audio

Playground

ACE-Step Prompt-to-Audio creates music from simple prompts, auto-generating genre tags and lyrics for quick song creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

ACE-Step Prompt-to-Audio is an intelligent music generation model that composes full-length audio tracks directly from text prompts. Simply describe the sound you want — from chill jazz to cinematic orchestral — and ACE-Step creates a polished piece in seconds.

✨ Key Features

Prompt-to-Music Creation Just write your idea in plain language. For example: “A jazzy chillout track with a cozy vibe about rainy evenings in a quiet café.”
Instrumental Mode Toggle the instrumental option to generate music without vocals — perfect for background use, podcasts, or film scoring.
Duration Control Use the slider to set your track length — anywhere from a few seconds to full-minute compositions (e.g., 60s).
Seed for Reproducibility Set the seed value to recreate the same song later, or randomize it for unique variations.
Genre & Emotion Understanding The model interprets keywords like “jazzy,” “dark,” “energetic,” “melancholic,” and blends rhythm, instruments, and mood accordingly.

🎧 Use Cases

Music Production & Songwriting — Draft melodies, instrumentals, or complete compositions in seconds.
Film, Game & Animation Scoring — Create background themes, ambient layers, and emotional moments effortlessly.
Social Media & Marketing — Produce custom soundtracks for reels, ads, or brand videos.
Education & Experimentation — Teach musical structure or explore AI-based composition.
Creative Exploration — Turn story ideas, emotions, or visual scenes into sound.

🧠 Example Prompts

“A cheerful pop song about summer memories.”
“Dark electronic beat with deep bass and atmospheric pads.”
“Calm piano and violin piece inspired by sunrise.”
“Lo-fi hip-hop track for late-night studying.”
“Epic orchestral theme with rising intensity.”

⚙️ How to Use

Enter a prompt describing mood, genre, or theme.
(Optional) Enable Instrumental for vocal-free music.
Adjust duration with the slider (e.g., 30s, 45s, 60s).
Set seed for reproducibility (or keep random for new results).
Click Generate — and listen to your AI-composed track.

💰 Pricing

Metric	Price
Per second of generated audio	$0.0002 / s

🎵 Summary

ACE-Step Prompt-to-Audio transforms words into music — from short jingles to full-length compositions. It’s your AI-powered music studio, ready to help musicians, creators, and filmmakers bring sound to life with just a sentence.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result

set -euo pipefail

export WAVESPEED_API_KEY="your-api-key"

REQUEST_BODY=$(cat <<'JSON'
{
  "prompt": "A cinematic ocean wave at sunrise, highly detailed",
  "instrumental": false,
  "duration": 60,
  "seed": -1
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/prompt-to-audio" \
  -H "Authorization: Bearer ${WAVESPEED_API_KEY}" \
  -H "Content-Type: application/json" \
  -d "${REQUEST_BODY}")

TASK=$(printf '%s' "${SUBMIT_RESPONSE}" | jq 'if type == "object" and has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "${TASK}" | jq -r '.id // empty')
if [ -z "${PREDICTION_ID}" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "${TASK}" | jq -r '.urls.get // empty')
if [ -z "${RESULT_URL}" ]; then RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/${PREDICTION_ID}/result"; fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body \
    "${RESULT_URL}" \
    -H "Authorization: Bearer ${WAVESPEED_API_KEY}")
  RESULT=$(printf '%s' "${RESPONSE}" | jq 'if type == "object" and has("data") then .data else . end')
  STATUS=$(printf '%s' "${RESULT}" | jq -r '.status // empty')

  case "${STATUS}" in
    completed) printf '%s\n' "${RESULT}" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "${RESULT}" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "${STATUS}" >&2; exit 1 ;;
  esac
done

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	Prompt to control the style of the generated audio. This will be used to generate tags and lyrics.
instrumental	boolean	No	false	-	Whether to generate an instrumental version.
duration	number	No	60	5 ~ 240	Audio length in seconds.
seed	integer	No	-1	-	The random seed for reproducibility.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Output values, usually URL strings; some models return text strings or structured result objects (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction
data.model	string	Model ID used for the prediction
data.outputs	array<string \| object>	Array of generated outputs (empty when status is not completed). Items are usually URL strings, but may be text strings or structured result objects, depending on the model.
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to poll for the prediction result
data.status	string	Status: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Overview