Stability Ai Stable Audio 3 Music
Playground
Try it on WavespeedAI!Stable Audio 3 Music is a fast AI music generation model that creates music from text prompts with controllable duration and output format. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, game music, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing.
Features
Stability AI Stable Audio 3 Music
Stability AI Stable Audio 3 Music generates music from a natural-language prompt, with controls for duration, negative prompting, inference steps, guidance strength, and output format. It is suitable for background music, soundtrack ideation, content scoring, trailer cues, and other prompt-driven music generation workflows.
Why Choose This?
-
Prompt-based music generation
Generate original music from a text description of mood, genre, instrumentation, and arrangement. -
Flexible duration control
Choose the target music length from short clips to longer pieces up to120seconds. -
Negative prompt support
Usenegative_promptto steer the model away from unwanted instruments, moods, or qualities. -
Generation controls
Adjustnum_inference_stepsandguidance_scaleto balance prompt adherence and output behavior. -
Multiple output formats
Export results inmp3,wav,flac,ogg,opus,m4a, oraac. -
Production-ready API
Suitable for videos, podcasts, trailers, games, social content, and music prototyping workflows.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text prompt describing the music style, mood, instruments, and arrangement. |
| duration | No | Target audio duration in seconds. Range: 1–120. Default: 30. |
| negative_prompt | No | Optional terms to avoid in the generated music. |
| num_inference_steps | No | Number of inference steps. Range: 1–100. Default: 8. |
| guidance_scale | No | Prompt guidance strength. Range: 0–25. Default: 1. |
| output_format | No | Output audio format. Supported values: mp3, wav, flac, ogg, opus, m4a, aac. Default: mp3. |
How to Use
- Write your prompt — describe the genre, mood, instrumentation, arrangement, and production feel you want.
- Set duration (optional) — choose how many seconds of music to generate.
- Add a negative prompt (optional) — describe sounds or qualities you want to avoid.
- Adjust generation controls (optional) — tune
num_inference_stepsandguidance_scaleif needed. - Choose output format — select the audio format that best fits your workflow.
- Submit — run the model and download the generated music.
Example Prompt
Cinematic emotional orchestral track with soft piano, warm strings, subtle percussion, slow build, uplifting trailer mood, polished modern production
Pricing
Just $0.0217 per request.
Billing Rules
- Each music generation request costs $0.0217
- Pricing is fixed per request
duration,negative_prompt,num_inference_steps,guidance_scale, andoutput_formatdo not affect pricing
Best Use Cases
- Background music generation — Create music beds for videos, podcasts, and social content.
- Trailer and ad concepts — Generate cinematic music ideas for promos and campaigns.
- Game and app audio — Produce original music for interactive or ambient playback.
- Music prototyping — Explore multiple soundtrack directions quickly from prompts.
- Content production — Generate original music for creator and brand workflows.
Pro Tips
- Be specific in your prompt about genre, tempo, instrumentation, and emotional tone.
- Use
negative_promptwhen you want to avoid vocals, heavy drums, distortion, or certain styles. - Increase
num_inference_stepsif you want potentially more refined output and can tolerate more runtime. - Adjust
guidance_scalewhen you want tighter prompt adherence. - Start with a short, clear prompt before adding more arrangement detail.
Notes
promptis required.durationsupports1–120seconds.output_formatdefaults tomp3.- Pricing is fixed at $0.0217 per request.
- This workflow is intended for music generation rather than general sound-effect generation.
Related Models
- Stability AI Stable Audio 3 Text-to-Audio — Generate general audio and sound scenes from text prompts.
- Stability AI Stable Audio 3 Audio-Outpainting — Extend an existing audio clip before and/or after the source.
- Stability AI Stable Audio 3 Audio-Inpainting — Replace a selected region inside an existing audio clip.
<ApiPage model={model}>
## Authentication
For authentication details, please refer to the [Authentication Guide](/docs-authentication).
## API Endpoints
### Submit Task & Query Result
## Parameters
### Task Submission Parameters
#### Request Parameters
#### Response Parameters
<SubmitResponse />
#### Result Request Parameters
| Parameter | Type | Required | Default | Description |
|-----------|------|----------|---------|-------------|
| id | string | Yes | - | Task ID |
#### Result Response Parameters
| Parameter | Type | Description |
|-----------|------|-------------|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., "success") |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of generated audio URLs. |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: `created`, `processing`, `completed`, or `failed` |
| data.created_at | string | ISO timestamp of when the request was created (e.g., "2023-04-01T12:34:56.789Z") |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
</ApiPage>