bannerbanner
Join Waitlist
Home/Explore/Kling Video Models/kwaivgi/kling-text-to-audio

text-to-audio

Kling Text-To-Audio | Generate Sound Effects From Text For Games And Video | WaveSpeedAI

kwaivgi/kling-text-to-audio

Kling Text-to-Audio turns text prompts into custom sound effects for videos, games, and multimedia using KlingAI's audio model. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Idle

Cold winter night with howling northern wind sweeping across barren fields and forests, deep and chilling gusts creating a lonely, tense atmosphere as if a snowstorm is approaching.

Your request will cost $0.035 per run.

For $1 you can run this model approximately 28 times.

ExamplesView all

README

Kuaivgi — Kling Text-to-SFX

Generate cinematic sound effects directly from text. Describe the scene or action, and Kling creates matching foley, ambience, risers, booms, whooshes, and textures—perfect for trailers, shorts, games, podcasts, and multimedia projects.

Key Features

  • Text-to-audio SFX with scene-aware textures and timing
  • Wide palette: weather, impacts, machinery, footsteps, creatures, atmospheres
  • Clean renders ready for layering and post-mix
  • Fast iteration for cue sheets and temp tracks

Parameters

  • prompt

    Describe what you want to hear. Example: Cold winter night with howling wind across barren fields; deep gusts; distant creaks; approaching snowstorm tension.

  • duration

    Length of the generated SFX bed in seconds.

How to Use

  1. Write a concise, concrete prompt naming sources, space, and mood.
  2. Set the duration to match your shot or loop length.
  3. Run and download the audio. Trim or loop in your DAW as needed.

Output

  • Single SFX track aligned to your requested duration.
  • Format follows platform defaults with a downloadable URL.

Pricing

  • Just $0.035 per run!!!

Prompting Tips

  • Call out materials and distance: metal gate clang close, wood door thud mid, crowd murmur far.
  • Add pacing: slow build, big hit at 0:08, decay to silence.
  • For loops, keep the ending sparse or symmetrical for seamless repeats.
  • Generate stems by running separate prompts for ambience, impacts, and ear-candy, then mix.