
Add music, voiceovers, and sound effects to your videos with WaveSpeedAI’s audio-for-video tools.

MMaudio v2 produces synchronized audio from video or text inputs, ideal for adding soundtracks to videos when paired with video models. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Kling Video-to-Audio auto-generates or extracts matching sound effects and audio tracks from video using KlingAI's audio generation model. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Kling Text-to-Audio turns text prompts into custom sound effects for videos, games, and multimedia using KlingAI's audio model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

HunyuanVideo-Foley generates realistic Foley and ambient audio from an uploaded video using a text prompt to describe desired sounds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

ACE-Step Prompt-to-Audio creates music from simple prompts, auto-generating genre tags and lyrics for quick song creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Mirelo SFX V1.5 generates synchronized sound effects and audio for any video, producing synced SFX to enhance visuals. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

ElevenLabs Dubbing automatically translates and dubs video/audio content into different languages while preserving the original speakers' voices. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Mirelo SFX V1 Video-to-Audio generates synchronized sound effects from video input with text prompt guidance. Supports multiple sample generation and customizable duration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
透過單一 REST API 執行 Audio for Video 系列中的任何模型。按生成計費 — 無訂閱、無最低消費 — 在可用率 99.9% 的基礎架構上提供業界領先的延遲。
每個 Audio for Video 模型都採按呼叫計費。價格列在每個模型的頁面上 — 不會額外加收平台費。
大多數 Audio for Video 影像模型在 2 秒內完成。影片與 3D 模型比自架方案快數倍。
多區域故障轉移與自動重試可在供應商故障期間 — 仍將您的生產流量保持線上。
每個模型在其模型頁面上都列有自己的按呼叫價格。我們按每次成功生成計費,沒有訂閱費或最低消費。
本系列中的影像模型通常在 2 秒內完成。影片與 3D 模型取決於長度與解析度,但通常比自架執行快數倍。
可以 — 每個帳戶註冊時即可獲得 $1 的免費額度,足以在不使用信用卡的情況下試用大多數 Audio for Video 模型。
標準帳戶具有充足的並行任務限制。Enterprise 方案提供自訂 RPM、更高並行性和專屬容量 — 詳情請聯繫業務。