Home/Explore/Kling Video Models/kwaivgi/kling-text-to-audio

text-to-audio

kwaivgi/kling-text-to-audio

Generate sound effects from text descriptions using KlingAI's advanced audio generation model. Perfect for creating custom sound effects for videos, games, and multimedia projects.

Idle

Your request will cost $0.035 per run.

For $1 you can run this model approximately 28 times.

ExamplesView all

README

Kuaivgi — Kling Text-to-SFX

Generate cinematic sound effects directly from text. Describe the scene or action, and Kling creates matching foley, ambience, risers, booms, whooshes, and textures—perfect for trailers, shorts, games, podcasts, and multimedia projects.

Key Features

  • Text-to-audio SFX with scene-aware textures and timing
  • Wide palette: weather, impacts, machinery, footsteps, creatures, atmospheres
  • Clean renders ready for layering and post-mix
  • Fast iteration for cue sheets and temp tracks

Parameters

  • prompt

    Describe what you want to hear. Example: Cold winter night with howling wind across barren fields; deep gusts; distant creaks; approaching snowstorm tension.

  • duration

    Length of the generated SFX bed in seconds.

How to Use

  1. Write a concise, concrete prompt naming sources, space, and mood.
  2. Set the duration to match your shot or loop length.
  3. Run and download the audio. Trim or loop in your DAW as needed.

Output

  • Single SFX track aligned to your requested duration.
  • Format follows platform defaults with a downloadable URL.

Pricing

  • Just $0.035 per run!!!

Prompting Tips

  • Call out materials and distance: metal gate clang close, wood door thud mid, crowd murmur far.
  • Add pacing: slow build, big hit at 0:08, decay to silence.
  • For loops, keep the ending sparse or symmetrical for seamless repeats.
  • Generate stems by running separate prompts for ambience, impacts, and ear-candy, then mix.