Browse ModelsStability AIStability AI Stable Audio 3 Music

Stability Ai Stable Audio 3 Music

Stability Ai Stable Audio 3 Music

Playground

Try it on WavespeedAI!

Stable Audio 3 Music is a fast AI music generation model that creates music from text prompts with controllable duration and output format. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, game music, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing.

Features

Stability AI Stable Audio 3 Music

Stability AI Stable Audio 3 Music generates music from a natural-language prompt, with controls for duration, negative prompting, inference steps, guidance strength, and output format. It is suitable for background music, soundtrack ideation, content scoring, trailer cues, and other prompt-driven music generation workflows.


Why Choose This?

  • Prompt-based music generation
    Generate original music from a text description of mood, genre, instrumentation, and arrangement.

  • Flexible duration control
    Choose the target music length from short clips to longer pieces up to 120 seconds.

  • Negative prompt support
    Use negative_prompt to steer the model away from unwanted instruments, moods, or qualities.

  • Generation controls
    Adjust num_inference_steps and guidance_scale to balance prompt adherence and output behavior.

  • Multiple output formats
    Export results in mp3, wav, flac, ogg, opus, m4a, or aac.

  • Production-ready API
    Suitable for videos, podcasts, trailers, games, social content, and music prototyping workflows.


Parameters

ParameterRequiredDescription
promptYesText prompt describing the music style, mood, instruments, and arrangement.
durationNoTarget audio duration in seconds. Range: 1–120. Default: 30.
negative_promptNoOptional terms to avoid in the generated music.
num_inference_stepsNoNumber of inference steps. Range: 1–100. Default: 8.
guidance_scaleNoPrompt guidance strength. Range: 0–25. Default: 1.
output_formatNoOutput audio format. Supported values: mp3, wav, flac, ogg, opus, m4a, aac. Default: mp3.

How to Use

  1. Write your prompt — describe the genre, mood, instrumentation, arrangement, and production feel you want.
  2. Set duration (optional) — choose how many seconds of music to generate.
  3. Add a negative prompt (optional) — describe sounds or qualities you want to avoid.
  4. Adjust generation controls (optional) — tune num_inference_steps and guidance_scale if needed.
  5. Choose output format — select the audio format that best fits your workflow.
  6. Submit — run the model and download the generated music.

Example Prompt

Cinematic emotional orchestral track with soft piano, warm strings, subtle percussion, slow build, uplifting trailer mood, polished modern production


Pricing

Just $0.0217 per request.

Billing Rules

  • Each music generation request costs $0.0217
  • Pricing is fixed per request
  • duration, negative_prompt, num_inference_steps, guidance_scale, and output_format do not affect pricing

Best Use Cases

  • Background music generation — Create music beds for videos, podcasts, and social content.
  • Trailer and ad concepts — Generate cinematic music ideas for promos and campaigns.
  • Game and app audio — Produce original music for interactive or ambient playback.
  • Music prototyping — Explore multiple soundtrack directions quickly from prompts.
  • Content production — Generate original music for creator and brand workflows.

Pro Tips

  • Be specific in your prompt about genre, tempo, instrumentation, and emotional tone.
  • Use negative_prompt when you want to avoid vocals, heavy drums, distortion, or certain styles.
  • Increase num_inference_steps if you want potentially more refined output and can tolerate more runtime.
  • Adjust guidance_scale when you want tighter prompt adherence.
  • Start with a short, clear prompt before adding more arrangement detail.

Notes

  • prompt is required.
  • duration supports 1–120 seconds.
  • output_format defaults to mp3.
  • Pricing is fixed at $0.0217 per request.
  • This workflow is intended for music generation rather than general sound-effect generation.

  • Stability AI Stable Audio 3 Text-to-Audio — Generate general audio and sound scenes from text prompts.
  • Stability AI Stable Audio 3 Audio-Outpainting — Extend an existing audio clip before and/or after the source.
  • Stability AI Stable Audio 3 Audio-Inpainting — Replace a selected region inside an existing audio clip.


<ApiPage model={model}>
  ## Authentication

  For authentication details, please refer to the [Authentication Guide](/docs-authentication).

  ## API Endpoints

  ### Submit Task & Query Result

  ## Parameters

  ### Task Submission Parameters

  #### Request Parameters

  #### Response Parameters

  <SubmitResponse />

  #### Result Request Parameters

  | Parameter | Type | Required | Default | Description |
  |-----------|------|----------|---------|-------------|
  | id | string | Yes | - | Task ID |

  #### Result Response Parameters

  | Parameter | Type | Description |
  |-----------|------|-------------|
  | code | integer | HTTP status code (e.g., 200 for success) |
  | message | string | Status message (e.g., "success") |
  | data | object | The prediction data object containing all details |
  | data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
  | data.model | string | Model ID used for the prediction |
  | data.outputs | string | Array of generated audio URLs. |
  | data.urls | object | Object containing related API endpoints |
  | data.urls.get | string | URL to retrieve the prediction result |
  | data.status | string | Status of the task: `created`, `processing`, `completed`, or `failed` |
  | data.created_at | string | ISO timestamp of when the request was created (e.g., "2023-04-01T12:34:56.789Z") |
  | data.error | string | Error message (empty if no error occurred) |
  | data.timings | object | Object containing timing details |
  | data.timings.inference | integer | Inference time in milliseconds |

</ApiPage>

  
© 2025 WaveSpeedAI. All rights reserved.