Kling Text-To-Audio | Generate Sound Effects From Text For Games And Video | WaveSpeedAI

Home/Explore/Kling Models/kwaivgi/kling-text-to-audio

text-to-audio

text-to-audio

kwaivgi/kling-text-to-audio

Kling Text-to-Audio turns text prompts into custom sound effects for videos, games, and multimedia using KlingAI's audio model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Enable Safety Checker

Idle

Your request will cost $0.035 per run.

For $1 you can run this model approximately 28 times.

ExamplesView all

README

Kuaivgi — Kling Text-to-SFX

Generate cinematic sound effects directly from text. Describe the scene or action, and Kling creates matching foley, ambience, risers, booms, whooshes, and textures—perfect for trailers, shorts, games, podcasts, and multimedia projects.

Key Features

Text-to-audio SFX with scene-aware textures and timing
Wide palette: weather, impacts, machinery, footsteps, creatures, atmospheres
Clean renders ready for layering and post-mix
Fast iteration for cue sheets and temp tracks

Parameters

prompt

Describe what you want to hear. Example: Cold winter night with howling wind across barren fields; deep gusts; distant creaks; approaching snowstorm tension.
duration

Length of the generated SFX bed in seconds.

How to Use

Write a concise, concrete prompt naming sources, space, and mood.
Set the duration to match your shot or loop length.
Run and download the audio. Trim or loop in your DAW as needed.

Output

Single SFX track aligned to your requested duration.
Format follows platform defaults with a downloadable URL.

Pricing

Just $0.035 per run!!!

Prompting Tips

Call out materials and distance: metal gate clang close, wood door thud mid, crowd murmur far.
Add pacing: slow build, big hit at 0:08, decay to silence.
For loops, keep the ending sparse or symmetrical for seamless repeats.
Generate stems by running separate prompts for ambience, impacts, and ear-candy, then mix.