Seedance 2.0 20% TANIEJ | Twórz w Video Generator →

Stable Audio 3 Music API

stability-ai /

Stable Audio 3 Music is a fast AI music generation model that creates music from text prompts with controllable duration and output format. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, game music, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing.

text-to-audio
Wejście

Bezczynny

$0.0217za uruchomienie·~46 / $1

PrzykładyZobacz wszystkie

30-second instrumental cue: smoky trip-hop noir with tremolo guitar, dusty Rhodes chords, and heavy slow drums. Clean stereo mix, memorable motif, no vocals, no lyrics.

Powiązane modele

README

Stability AI Stable Audio 3 Music

Stability AI Stable Audio 3 Music generates music from a natural-language prompt, with controls for duration, negative prompting, inference steps, guidance strength, and output format. It is suitable for background music, soundtrack ideation, content scoring, trailer cues, and other prompt-driven music generation workflows.

Why Choose This?

  • Prompt-based music generation
    Generate original music from a text description of mood, genre, instrumentation, and arrangement.

  • Flexible duration control
    Choose the target music length from short clips to longer pieces up to 120 seconds.

  • Negative prompt support
    Use negative_prompt to steer the model away from unwanted instruments, moods, or qualities.

  • Generation controls
    Adjust num_inference_steps and guidance_scale to balance prompt adherence and output behavior.

  • Multiple output formats
    Export results in mp3, wav, flac, ogg, opus, m4a, or aac.

  • Production-ready API
    Suitable for videos, podcasts, trailers, games, social content, and music prototyping workflows.

Parameters

ParameterRequiredDescription
promptYesText prompt describing the music style, mood, instruments, and arrangement.
durationNoTarget audio duration in seconds. Range: 1–120. Default: 30.
negative_promptNoOptional terms to avoid in the generated music.
num_inference_stepsNoNumber of inference steps. Range: 1–100. Default: 8.
guidance_scaleNoPrompt guidance strength. Range: 0–25. Default: 1.
output_formatNoOutput audio format. Supported values: mp3, wav, flac, ogg, opus, m4a, aac. Default: mp3.

How to Use

  1. Write your prompt — describe the genre, mood, instrumentation, arrangement, and production feel you want.
  2. Set duration (optional) — choose how many seconds of music to generate.
  3. Add a negative prompt (optional) — describe sounds or qualities you want to avoid.
  4. Adjust generation controls (optional) — tune num_inference_steps and guidance_scale if needed.
  5. Choose output format — select the audio format that best fits your workflow.
  6. Submit — run the model and download the generated music.

Example Prompt

Cinematic emotional orchestral track with soft piano, warm strings, subtle percussion, slow build, uplifting trailer mood, polished modern production

Pricing

Just $0.0217 per request.

Billing Rules

  • Each music generation request costs $0.0217
  • Pricing is fixed per request
  • duration, negative_prompt, num_inference_steps, guidance_scale, and output_format do not affect pricing

Best Use Cases

  • Background music generation — Create music beds for videos, podcasts, and social content.
  • Trailer and ad concepts — Generate cinematic music ideas for promos and campaigns.
  • Game and app audio — Produce original music for interactive or ambient playback.
  • Music prototyping — Explore multiple soundtrack directions quickly from prompts.
  • Content production — Generate original music for creator and brand workflows.

Pro Tips

  • Be specific in your prompt about genre, tempo, instrumentation, and emotional tone.
  • Use negative_prompt when you want to avoid vocals, heavy drums, distortion, or certain styles.
  • Increase num_inference_steps if you want potentially more refined output and can tolerate more runtime.
  • Adjust guidance_scale when you want tighter prompt adherence.
  • Start with a short, clear prompt before adding more arrangement detail.

Notes

  • prompt is required.
  • duration supports 1–120 seconds.
  • output_format defaults to mp3.
  • Pricing is fixed at $0.0217 per request.
  • This workflow is intended for music generation rather than general sound-effect generation.

Related Models

  • Stability AI Stable Audio 3 Text-to-Audio — Generate general audio and sound scenes from text prompts.
  • Stability AI Stable Audio 3 Audio-Outpainting — Extend an existing audio clip before and/or after the source.
  • Stability AI Stable Audio 3 Audio-Inpainting — Replace a selected region inside an existing audio clip.
Dostępność:Ta strona korzysta z modeli AI udostępnianych przez podmioty trzecie.

Stable Audio 3 Music API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/stability-ai/stable-audio-3/music with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Stable Audio 3 Music below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/stability-ai/stable-audio-3/music" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "duration": 30,
    "negative_prompt": "blurry, low quality, distorted",
    "num_inference_steps": 8,
    "guidance_scale": 1,
    "output_format": "mp3"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("stability-ai/stable-audio-3/music", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "duration": 30,
        "negative_prompt": "blurry, low quality, distorted",
        "num_inference_steps": 8,
        "guidance_scale": 1,
        "output_format": "mp3"
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "stability-ai/stable-audio-3/music",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "duration": 30,
    "negative_prompt": "blurry, low quality, distorted",
    "num_inference_steps": 8,
    "guidance_scale": 1,
    "output_format": "mp3"
}
)

print(output["outputs"][0])  # → URL of the generated output

Stable Audio 3 Music API — Frequently asked questions

What is the Stable Audio 3 Music API?

Stable Audio 3 Music is a Stability AI model for audio generation, exposed as a REST API on WaveSpeedAI. Stable Audio 3 Music is a fast AI music generation model that creates music from text prompts with controllable duration and output format. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, game music, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Stable Audio 3 Music API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/stability-ai/stability-ai-stable-audio-3-music.

How much does Stable Audio 3 Music cost per run?

Stable Audio 3 Music starts at $0.022 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Stable Audio 3 Music accept?

Key inputs: `prompt`, `duration`, `guidance_scale`, `num_inference_steps`, `negative_prompt`, `output_format`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/stability-ai/stability-ai-stable-audio-3-music.

How do I get started with the Stable Audio 3 Music API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Stable Audio 3 Music outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Stability AI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.