Stability Ai Stable Audio 3 Music

Playground

Stable Audio 3 Music is a fast AI music generation model that creates music from text prompts with controllable duration and output format. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, game music, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing.

Features

Stability AI Stable Audio 3 Music

Stability AI Stable Audio 3 Music generates music from a natural-language prompt, with controls for duration, negative prompting, inference steps, guidance strength, and output format. It is suitable for background music, soundtrack ideation, content scoring, trailer cues, and other prompt-driven music generation workflows.

Why Choose This?

Prompt-based music generation
Generate original music from a text description of mood, genre, instrumentation, and arrangement.
Flexible duration control
Choose the target music length from short clips to longer pieces up to 120 seconds.
Negative prompt support
Use negative_prompt to steer the model away from unwanted instruments, moods, or qualities.
Generation controls
Adjust num_inference_steps and guidance_scale to balance prompt adherence and output behavior.
Multiple output formats
Export results in mp3, wav, flac, ogg, opus, m4a, or aac.
Production-ready API
Suitable for videos, podcasts, trailers, games, social content, and music prototyping workflows.

Parameters

Parameter	Required	Description
prompt	Yes	Text prompt describing the music style, mood, instruments, and arrangement.
duration	No	Target audio duration in seconds. Range: `1–120`. Default: `30`.
negative_prompt	No	Optional terms to avoid in the generated music.
num_inference_steps	No	Number of inference steps. Range: `1–100`. Default: `8`.
guidance_scale	No	Prompt guidance strength. Range: `0–25`. Default: `1`.
output_format	No	Output audio format. Supported values: `mp3`, `wav`, `flac`, `ogg`, `opus`, `m4a`, `aac`. Default: `mp3`.

How to Use

Write your prompt — describe the genre, mood, instrumentation, arrangement, and production feel you want.
Set duration (optional) — choose how many seconds of music to generate.
Add a negative prompt (optional) — describe sounds or qualities you want to avoid.
Adjust generation controls (optional) — tune num_inference_steps and guidance_scale if needed.
Choose output format — select the audio format that best fits your workflow.
Submit — run the model and download the generated music.

Example Prompt

Cinematic emotional orchestral track with soft piano, warm strings, subtle percussion, slow build, uplifting trailer mood, polished modern production

Pricing

Just $0.0217 per request.

Billing Rules

Each music generation request costs $0.0217
Pricing is fixed per request
duration, negative_prompt, num_inference_steps, guidance_scale, and output_format do not affect pricing

Best Use Cases

Background music generation — Create music beds for videos, podcasts, and social content.
Trailer and ad concepts — Generate cinematic music ideas for promos and campaigns.
Game and app audio — Produce original music for interactive or ambient playback.
Music prototyping — Explore multiple soundtrack directions quickly from prompts.
Content production — Generate original music for creator and brand workflows.

Pro Tips

Be specific in your prompt about genre, tempo, instrumentation, and emotional tone.
Use negative_prompt when you want to avoid vocals, heavy drums, distortion, or certain styles.
Increase num_inference_steps if you want potentially more refined output and can tolerate more runtime.
Adjust guidance_scale when you want tighter prompt adherence.
Start with a short, clear prompt before adding more arrangement detail.

Notes

prompt is required.
duration supports 1–120 seconds.
output_format defaults to mp3.
Pricing is fixed at $0.0217 per request.
This workflow is intended for music generation rather than general sound-effect generation.

Stability AI Stable Audio 3 Text-to-Audio — Generate general audio and sound scenes from text prompts.
Stability AI Stable Audio 3 Audio-Outpainting — Extend an existing audio clip before and/or after the source.
Stability AI Stable Audio 3 Audio-Inpainting — Replace a selected region inside an existing audio clip.



<ApiPage model={model}>
  ## Authentication

  For authentication details, please refer to the [Authentication Guide](/docs-authentication).

  ## API Endpoints

  ### Submit Task & Query Result

  ## Parameters

  ### Task Submission Parameters

  #### Request Parameters

  #### Response Parameters

  <SubmitResponse />

  #### Result Request Parameters

  | Parameter | Type | Required | Default | Description |
  |-----------|------|----------|---------|-------------|
  | id | string | Yes | - | Task ID |

  #### Result Response Parameters

  | Parameter | Type | Description |
  |-----------|------|-------------|
  | code | integer | HTTP status code (e.g., 200 for success) |
  | message | string | Status message (e.g., "success") |
  | data | object | The prediction data object containing all details |
  | data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
  | data.model | string | Model ID used for the prediction |
  | data.outputs | string | Array of generated audio URLs. |
  | data.urls | object | Object containing related API endpoints |
  | data.urls.get | string | URL to retrieve the prediction result |
  | data.status | string | Status of the task: `created`, `processing`, `completed`, or `failed` |
  | data.created_at | string | ISO timestamp of when the request was created (e.g., "2023-04-01T12:34:56.789Z") |
  | data.error | string | Error message (empty if no error occurred) |
  | data.timings | object | Object containing timing details |
  | data.timings.inference | integer | Inference time in milliseconds |

</ApiPage>

Stability AI Stable Audio 3 Audio To Audio Stability AI Stable Audio 3 Text To Audio