Mirelo Ai Sfx 1.6 Video To Video
Playground
Try it on WavespeedAI!Mirelo SFX1.6 Video to Audio is a fast AI audio generation model that creates synchronized sound effects for video and returns the video with a new audio track. Supports clips up to 60 seconds. Ready-to-use REST inference API for video sound design, synced SFX generation, game trailers, social media clips, cinematic videos, product demos, and professional audio-for-video workflows with simple integration, no coldstarts, and affordable pricing.
Features
Mirelo AI SFX 1.6 Video-to-Video
Mirelo AI SFX 1.6 Video-to-Video generates synchronized sound effects for an uploaded video, with optional prompt guidance, multiple variations, and seed control for reproducibility. It is designed for adding or redesigning audio for short videos, trailers, demos, gameplay clips, and other visual content workflows.
Why Choose This?
-
Video-to-sound workflow Generate synchronized sound effects directly from video input.
-
Prompt-guided audio generation Add an optional text prompt to steer the type, mood, or intensity of the generated sound effects.
-
Multiple variations Generate up to
4variations in one request withnum_samples. -
Flexible audio duration Choose how many seconds of SFX audio to generate, up to
60seconds. -
Seed support Use
seedfor more reproducible results, or-1for random generation. -
Production-ready API Useful for sound design, trailer audio, short-form video, social content, and creative audio workflows.
Parameters
| Parameter | Required | Description |
|---|---|---|
| video | Yes | Video URL or uploaded video to add synchronized sound effects to. |
| prompt | No | Optional text prompt to guide the generated sound effects. |
| duration | No | Duration of the generated SFX audio in seconds. Range: 1–60. Default: 10. This does not extend the input video. |
| num_samples | No | Number of variations to generate. Range: 1–4. Default: 1. |
| seed | No | Seed for reproducibility. Use -1 for a random seed. Default: -1. |
How to Use
- Upload your video — provide the source video you want to add sound effects to.
- Add a prompt (optional) — describe the sound design, mood, or effect direction you want.
- Set duration — choose how many seconds of synchronized SFX audio to generate.
- Set number of samples — choose how many variations you want, from
1to4. - Set a seed (optional) — use
-1for random output, or a fixed value for more reproducible results. - Submit — run the model and download the generated audio outputs.
Example Prompt
Cinematic trailer sound design with deep impacts, airy risers, subtle whooshes, and tense low-end atmosphere
Pricing
Pricing is based on generated SFX duration and number of samples.
| Duration | 1 Sample | 2 Samples | 3 Samples | 4 Samples |
|---|---|---|---|---|
| 1s | $0.01 | $0.02 | $0.03 | $0.04 |
| 5s | $0.05 | $0.10 | $0.15 | $0.20 |
| 10s | $0.10 | $0.20 | $0.30 | $0.40 |
| 20s | $0.20 | $0.40 | $0.60 | $0.80 |
| 30s | $0.30 | $0.60 | $0.90 | $1.20 |
| 60s | $0.60 | $1.20 | $1.80 | $2.40 |
Billing Rules
- Pricing is $0.01 per generated second
- Pricing increases with both selected
durationandnum_samples promptandseeddo not affect pricing- Billing is based on the generated SFX audio duration, not the source video duration
- The selected
durationdoes not extend the input video
Best Use Cases
- Trailer and teaser sound design — Add cinematic effects to short video clips.
- Social content enhancement — Generate sound layers for reels, shorts, and promotional videos.
- Game and app previews — Add designed effects to gameplay or demo footage.
- Prototype audio direction — Quickly explore multiple sound directions from the same video.
- Creative post-production — Generate synced SFX for short-form editing workflows.
Pro Tips
- Use a prompt only when you want stronger control over the sound style or mood.
- Increase
num_sampleswhen you want multiple design options from the same clip. - Start with shorter durations for testing, then extend if the direction feels right.
- Use a fixed
seedwhen comparing prompt changes on the same source video. - Clear visual motion in the video usually helps produce more meaningful synchronized sound effects.
Notes
videois required.durationsupports1–60seconds.num_samplessupports1–4.seed = -1means random generation.- The generated
durationrefers to the SFX audio length only and does not extend the uploaded video. - Pricing depends on generated duration and sample count.
Related Models
- Mirelo AI SFX 1.6 Text-to-Audio — Generate sound effects or ambient audio directly from text prompts.
- Mirelo AI SFX 1.6 Extend Audio — Extend an existing audio clip with newly generated continuation.
- Mirelo AI SFX 1.6 Inpaint Audio — Regenerate a selected segment inside an existing audio clip.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/mirelo-ai/sfx-1.6/video-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"duration": 10,
"num_samples": 1,
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| video | string | Yes | - | Video URL or uploaded video to add synchronized sound effects to. The generated duration can be up to 60 seconds. | |
| prompt | string | No | - | Optional text prompt to guide the generated sound effects. | |
| duration | number | No | 10 | 1 ~ 60 | Duration of the generated SFX audio in seconds. This does not extend the input video. |
| num_samples | integer | No | 1 | 1 ~ 4 | Number of variations to generate. |
| seed | integer | No | -1 | -1 ~ 2147483647 | Seed for reproducibility. Use -1 for a random seed. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content. |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |