Seedance 2.0 20% OFF | Create in Video Generator →

Sonilo Video-to-Music API

sonilo /

Sonilo Video-to-Music is a fast AI music generation model that creates background music synced to an input video’s mood, pacing, and scene transitions. Ready-to-use REST inference API for video soundtracks, social media content, advertising creatives, cinematic clips, product videos, creator workflows, and professional video-to-music generation with simple integration, no coldstarts, and affordable pricing.

video-dubbing
Input

Drag & drop or click to upload

Idle

$0.009per run·~111 / $1

ExamplesView all

Related Models

README

Sonilo Video-to-Music

Sonilo Video-to-Music generates music from a video input, with an optional style prompt to guide the soundtrack. It is designed for turning visual content into matching background music for short films, ads, social content, trailers, highlight clips, and other video-driven audio workflows.

Why Choose This?

  • Video-driven music generation
    Generate music that fits the pacing and feel of an uploaded video.

  • Optional style guidance
    Add a prompt to steer the mood, genre, instrumentation, or production style of the generated music.

  • Simple workflow
    Upload one video, optionally add a style prompt, and generate a matching music track.

  • Supports longer inputs
    Works with videos up to 360 seconds.

  • Production-ready API
    Suitable for trailers, branded content, social videos, cinematic edits, and background scoring workflows.

Parameters

ParameterRequiredDescription
videoYesInput video URL. Maximum supported video length is 360 seconds.
promptNoOptional style prompt for the generated music.

How to Use

  1. Upload your video — provide the source video you want to score with music.
  2. Add a style prompt (optional) — describe the mood, genre, instrumentation, or production feel you want.
  3. Submit — run the model and download the generated music.

Example Prompt

Cinematic emotional orchestral score with soft piano, warm strings, slow build, inspiring and modern trailer mood

Pricing

Pricing is based on the uploaded video duration.

Billing Rules

  • Pricing is $0.009 per billed second
  • Billing is based on the uploaded video duration
  • Billed duration is rounded up to the next whole second
  • Minimum billed duration is 1 second
  • Maximum billed duration is 360 seconds
  • prompt does not affect pricing

Example Costs

Video DurationCost
1s$0.009
5s$0.045
10s$0.090
30s$0.270
60s$0.540
120s$1.080
360s$3.240

Best Use Cases

  • Social media videos — Generate music beds for short-form clips.
  • Ads and promos — Create matching soundtrack material for branded content.
  • Trailers and highlights — Add cinematic or energetic music to visual edits.
  • Creator workflows — Quickly generate background music for uploaded video content.
  • Prototype scoring — Explore soundtrack directions before final post-production.

Pro Tips

  • Use a style prompt when you want stronger control over genre, mood, or instrumentation.
  • Keep the prompt focused and specific for more predictable results.
  • Shorter videos are useful for quickly testing soundtrack direction before scoring longer content.
  • Upload the cleanest final or near-final edit possible so the music better matches pacing and structure.

Notes

  • video is required.
  • Maximum supported video length is 360 seconds.
  • Pricing depends only on billed video duration.
  • Video duration is rounded up to the next whole second for billing.

Related Models

  • Sonilo audio generation workflows — Useful when you need prompt-first music generation instead of video-driven scoring.
  • Background music generation workflows — Useful when you need standalone music without a video input.
  • Video sound design workflows — Useful when you want synchronized effects instead of generated music.
Accessibility:This website uses AI models provided by third parties.

Video To Music API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/sonilo/video-to-music with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Video To Music below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/sonilo/video-to-music" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "video": "https://example.com/your-input.mp4",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("sonilo/video-to-music", {
        "video": "https://example.com/your-input.mp4",
        "prompt": "A cinematic shot of a city at sunset, soft golden light"
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "sonilo/video-to-music",
    {
    "video": "https://example.com/your-input.mp4",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}
)

print(output["outputs"][0])  # → URL of the generated output

Video To Music API — Frequently asked questions

What is the Video To Music API?

Video To Music is a Sonilo model for AI inference, exposed as a REST API on WaveSpeedAI. Sonilo Video-to-Music is a fast AI music generation model that creates background music synced to an input video’s mood, pacing, and scene transitions. Ready-to-use REST inference API for video soundtracks, social media content, advertising creatives, cinematic clips, product videos, creator workflows, and professional video-to-music generation with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Video To Music API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-video-to-music.

How much does Video To Music cost per run?

Video To Music starts at $0.009 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Video To Music accept?

Key inputs: `prompt`, `video`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/sonilo/sonilo-video-to-music.

How do I get started with the Video To Music API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Video To Music outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Sonilo). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.