Ace Step Audio Outpaint
Playground
Try it on WavespeedAI!ACE-Step Audio Outpaint generates seamless start or end extensions that match the original, ideal for intros, outros and longer tracks. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
ACE-Step Audio Outpaint
Extend your audio tracks seamlessly with AI-powered music generation. Upload any audio clip and let the model generate new content before or after it — perfectly matching the style, rhythm, and mood of your original track.
Why It Sounds Great
- Style-aware generation: Uses tag-based guidance to match genres like lofi, hiphop, trap, drum and bass, and more.
- Seamless transitions: Generated audio blends naturally with your original track without abrupt cuts or mismatched beats.
- Bidirectional extension: Extend audio at the beginning, end, or both directions simultaneously.
- Lyrics support: Optionally provide lyrics to guide vocal generation in extended sections.
- Reproducible results: Use the seed parameter to recreate exact outputs or explore variations.
Parameters
| Parameter | Required | Description |
|---|---|---|
| audio | Yes | Source audio file (upload or public URL). |
| tags | Yes | Comma-separated style tags to guide generation (e.g., “lofi, hiphop, chill, trap”). |
| extend_before_duration | No | Seconds to generate before the original audio. Default: 0. |
| extend_after_duration | No | Seconds to generate after the original audio. Default: 30. |
| lyrics | No | Optional lyrics to guide vocal generation in extended sections. |
| seed | No | Random seed for reproducibility. Use -1 for random. |
How to Use
- Upload your audio — drag and drop or paste a public URL.
- Add style tags — describe the genre and mood (e.g., “lofi, hiphop, chill”).
- Set extension duration:
- Use extend_after_duration to add time at the end.
- Use extend_before_duration to add time at the beginning.
- Add lyrics (optional) — provide text if you want vocals in the extended section.
- Set seed (optional) — use -1 for random, or a specific number for reproducible results.
- Run — click the button and wait for generation.
- Download — preview and save your extended audio.
Pricing
Per-second billing based on total output duration.
| Metric | Cost |
|---|---|
| Per second | $0.0002 |
Billing Formula
Total cost = (original audio duration + extend_before_duration + extend_after_duration) × $0.0002
Examples
| Original | Extend Before | Extend After | Total Duration | Total Cost |
|---|---|---|---|---|
| 60s | 0s | 30s | 90s | $0.018 |
| 90s | 10s | 30s | 130s | $0.026 |
| 120s | 0s | 60s | 180s | $0.036 |
| 180s | 30s | 30s | 240s | $0.048 |
Best Use Cases
- Music Production — Extend loops, intros, or outros for full-length tracks.
- Content Creation — Generate longer background music for videos and podcasts.
- DJ & Remix Work — Create extended mixes or seamless transitions between tracks.
- Game & Media Audio — Produce adaptive music that can loop or extend dynamically.
- Songwriting — Explore new directions by extending existing ideas with AI assistance.
Pro Tips for Best Results
- Use descriptive, specific tags — “lofi, hiphop, jazzy, chill, piano” works better than just “music”.
- For consistent style, keep the original audio and extension durations balanced.
- Experiment with different seeds to find the perfect variation.
- When using lyrics, match the syllable count and rhythm to the expected musical phrasing.
- Start with shorter extensions (15-30s) to test the style match before generating longer segments.
Notes
- If using a URL, ensure it is publicly accessible. A preview player in the interface confirms successful loading.
- Processing time scales with total output duration.
- For best results, use clean source audio with consistent tempo and style.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/ace-step/audio-outpaint" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"extend_before_duration": 0,
"extend_after_duration": 30,
"lyrics": "",
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| audio | string | Yes | - | - | Audio file to transcribe. Provide an HTTPS URL or upload a file (MP3, WAV, FLAC up to 60 minutes). |
| tags | string | Yes | - | - | Comma-separated list of genre tags to control the style. |
| extend_before_duration | number | No | - | 0 ~ 240 | Duration to extend from the start in seconds. |
| extend_after_duration | number | No | 30 | 0 ~ 240 | Duration to extend from the end in seconds. |
| lyrics | string | No | - | - | Vocal content for generation. Use [inst] or [instrumental] for no vocals. |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed for reproducibility. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |