Sonilo Text-to-Music is a fast AI music generation model that creates full music tracks from text prompts with manual duration control. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, social media content, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing.
就緒
$0.0025每次運行·~400 / $1
Premium futuristic electronic instrumental music for a technology brand film, clean synth arpeggios, subtle bass pulse, elegant minimal rhythm, inspiring and innovative mood, polished commercial sound design, no vocals, no lyrics
Sonilo Text-to-Music generates music directly from a natural-language prompt, with controllable output duration up to 360 seconds. It is designed for background music creation, soundtrack ideation, content scoring, and other prompt-driven music generation workflows.
Prompt-based music generation Create music from a text description of style, mood, instrumentation, and arrangement.
Flexible duration control
Choose the target music length from short clips to longer pieces up to 360 seconds.
Simple workflow Provide a prompt, choose a duration, and generate music with minimal setup.
Useful for many creative workflows Suitable for videos, ads, games, social content, trailers, and prototype soundtracks.
Production-ready API Easy to integrate into music generation tools, creator workflows, and media pipelines.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text prompt describing the music style, mood, instruments, and arrangement. |
| duration | Yes | Target music duration in seconds. Range: 1–360. Default: 30. |
Cinematic emotional orchestral music with soft piano, warm strings, slow build, inspiring trailer mood, modern polished production
Pricing is based on the selected duration.
| Duration | Cost |
|---|---|
| 1s | $0.0025 |
| 10s | $0.025 |
| 30s | $0.075 |
| 60s | $0.15 |
| 120s | $0.30 |
| 300s | $0.75 |
| 360s | $0.90 |
durationprompt does not affect pricingprompt and duration are required.duration supports values from 1 to 360 seconds.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/sonilo/text-to-music with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Text To Music below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/sonilo/text-to-music" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 30
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("sonilo/text-to-music", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 30
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"sonilo/text-to-music",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 30
}
)
print(output["outputs"][0]) # → URL of the generated outputText To Music is a Sonilo model for audio generation, exposed as a REST API on WaveSpeedAI. Sonilo Text-to-Music is a fast AI music generation model that creates full music tracks from text prompts with manual duration control. Ready-to-use REST inference API for AI music generation, background music, creator content, video soundtracks, advertising audio, social media content, and professional text-to-music workflows with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-text-to-music.
Text To Music starts at $0.003 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `duration`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-text-to-music.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (Sonilo). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.