Sonilo Text-to-Music API

Sonilo Text-to-Music

Sonilo Text-to-Music generates music directly from a natural-language prompt, with controllable output duration up to 360 seconds. It is designed for background music creation, soundtrack ideation, content scoring, and other prompt-driven music generation workflows.

Why Choose This?

Prompt-based music generation Create music from a text description of style, mood, instrumentation, and arrangement.
Flexible duration control Choose the target music length from short clips to longer pieces up to 360 seconds.
Simple workflow Provide a prompt, choose a duration, and generate music with minimal setup.
Useful for many creative workflows Suitable for videos, ads, games, social content, trailers, and prototype soundtracks.
Production-ready API Easy to integrate into music generation tools, creator workflows, and media pipelines.

Parameters

Parameter	Required	Description
prompt	Yes	Text prompt describing the music style, mood, instruments, and arrangement.
duration	Yes	Target music duration in seconds. Range: `1–360`. Default: `30`.

How to Use

Write your prompt — describe the genre, mood, instrumentation, pacing, and production feel you want.
Set duration — choose how many seconds of music to generate.
Submit — run the model and download the generated music.

Example Prompt

Cinematic emotional orchestral music with soft piano, warm strings, slow build, inspiring trailer mood, modern polished production

Pricing

Pricing is based on the selected duration.

Duration	Cost
1s	$0.0025
10s	$0.025
30s	$0.075
60s	$0.15
120s	$0.30
300s	$0.75
360s	$0.90

Billing Rules

Pricing is $0.0025 per second
Billing is based on the selected duration
Maximum billed duration is 360 seconds
prompt does not affect pricing

Best Use Cases

Background music generation — Create music beds for videos, podcasts, and social content.
Trailer and ad concepts — Generate soundtrack ideas for marketing or promo edits.
Game and app audio — Produce music for interactive or ambient playback.
Music prototyping — Explore multiple soundtrack directions quickly from prompts.
Content production — Create original music for creator and brand workflows.

Pro Tips

Be specific in your prompt about genre, mood, instrumentation, and pacing.
Mention arrangement cues like intro, build, drop, chorus, or ambient bed when needed.
Start with shorter durations to validate the musical direction before generating longer pieces.
Use concise prompts when you want tighter control, and broader prompts when you want more interpretive results.

Notes

Both prompt and duration are required.
duration supports values from 1 to 360 seconds.
Pricing depends only on the selected duration.
Longer generated music may be more suitable for trailers, background scoring, and extended content workflows.

Related Models

Sonilo Video-to-Music — Generate music that matches an uploaded video.
Other prompt-based music generation workflows — Useful when you want alternate music-generation styles or model behavior.
Video sound design workflows — Useful when you need synchronized effects instead of generated music.

Text To Music API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/sonilo/text-to-music with your input as JSON. The endpoint returns a prediction id. Start polling the result endpoint around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. On completed, read URLs from data.outputs. Examples for Text To Music below.

HTTP example

# Submit the prediction
curl --fail-with-body --connect-timeout 10 --max-time 60 \
  -X POST "https://api.wavespeed.ai/api/v3/sonilo/text-to-music" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "duration": 30
}'

# Wait at least 2 seconds, then poll. Safe GET requests may be retried.
curl --fail-with-body --connect-timeout 10 --max-time 30 \
  --retry 4 --retry-all-errors --retry-delay 1 \
  -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# Start at 2 seconds and increase the interval for long-running tasks.
# Stop on completed, failed, cancelled, or timeout.

Node.js example

// npm install wavespeed
const { Client } = require('wavespeed');

const apiKey = process.env.WAVESPEED_API_KEY;
if (!apiKey) throw new Error('Set WAVESPEED_API_KEY');
const client = new Client(apiKey);

try {
  const result = await client.run("sonilo/text-to-music", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "duration": 30
}, {
    timeout: 3600,
    pollInterval: 2.0,
  });
  console.log(result.outputs);
} catch (error) {
  console.error('Generation failed:', error);
  process.exitCode = 1;
}

Python example

# pip install wavespeed
import os
from wavespeed import Client

client = Client(api_key=os.environ["WAVESPEED_API_KEY"])

try:
    output = client.run(
        "sonilo/text-to-music",
        {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "duration": 30
},
        timeout=3600.0,
        poll_interval=2.0,
    )
    print(output["outputs"])
except Exception as error:
    raise SystemExit(f"Generation failed: {error}") from error

Text To Music API — Frequently asked questions

What is the Text To Music API?

Text To Music is a Sonilo model for audio generation, exposed as a REST API on WaveSpeedAI. Sonilo Text-to-Music is a fast AI music generation model that creates full music tracks from text prompts with manual duration control. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, social media content, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Text To Music API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID. Poll the result endpoint starting around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. The playground generates production-oriented Python, JavaScript, and cURL examples with timeouts, transient-error handling, and safe GET retries. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-text-to-music.

How much does Text To Music cost per run?

Text To Music starts at $0.003 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Text To Music accept?

Key inputs: `prompt`, `duration`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-text-to-music.

How do I get started with the Text To Music API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Text To Music outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Sonilo). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ExemplesTout voir

Modèles associés

README