Giảm 50% mô hình Vidu Q3 & Q3 Pro · Chỉ trên WaveSpeedAI | 20/5 – 2/6
Audio for Video

Audio for Video

Add music, voiceovers, and sound effects to your videos with WaveSpeedAI’s audio-for-video tools.

Our selection

wavespeed-ai/mmaudio-v2
video-dubbing

wavespeed-ai/mmaudio-v2

MMaudio v2 produces synchronized audio from video or text inputs, ideal for adding soundtracks to videos when paired with video models. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

All models

8 models
wavespeed-ai/mmaudio-v2
video-dubbing

wavespeed-ai/mmaudio-v2

MMaudio v2 produces synchronized audio from video or text inputs, ideal for adding soundtracks to videos when paired with video models. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

kwaivgi/kling-video-to-audio
video-dubbing

kwaivgi/kling-video-to-audio

Kling Video-to-Audio auto-generates or extracts matching sound effects and audio tracks from video using KlingAI's audio generation model. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

kwaivgi/kling-text-to-audio
text-to-audio

kwaivgi/kling-text-to-audio

Kling Text-to-Audio turns text prompts into custom sound effects for videos, games, and multimedia using KlingAI's audio model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/hunyuan-video-foley
video-dubbing

wavespeed-ai/hunyuan-video-foley

HunyuanVideo-Foley generates realistic Foley and ambient audio from an uploaded video using a text prompt to describe desired sounds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/ace-step/prompt-to-audio
text-to-audio

wavespeed-ai/ace-step/prompt-to-audio

ACE-Step Prompt-to-Audio creates music from simple prompts, auto-generating genre tags and lyrics for quick song creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

mirelo-ai/sfx-v1.5/video-to-video
video-dubbing

mirelo-ai/sfx-v1.5/video-to-video

Mirelo SFX V1.5 generates synchronized sound effects and audio for any video, producing synced SFX to enhance visuals. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

elevenlabs/dubbing
video-dubbing

elevenlabs/dubbing

ElevenLabs Dubbing automatically translates and dubs video/audio content into different languages while preserving the original speakers' voices. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

mirelo-ai/sfx-v1/video-to-audio
video-to-audio

mirelo-ai/sfx-v1/video-to-audio

Mirelo SFX V1 Video-to-Audio generates synchronized sound effects from video input with text prompt guidance. Supports multiple sample generation and customizable duration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Audio for Video API — pricing & performance

Run any model in the Audio for Video collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Audio for Video on WaveSpeedAI

Transparent pricing

Per-call pricing for every Audio for Video model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Audio for Video image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Audio for Video API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Audio for Video models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Có — mỗi tài khoản nhận $1 tín dụng miễn phí khi đăng ký, đủ để thử hầu hết các mô hình Audio for Video mà không cần thẻ tín dụng.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.