WaveSpeedAI APIKwaivgiKwaivgi Kling Text To Audio

Kwaivgi Kling Text To Audio

Kwaivgi Kling Text To Audio

Playground

Try it on WavespeedAI!

Generate sound effects from text descriptions using KlingAI’s advanced audio generation model. Perfect for creating custom sound effects for videos, games, and multimedia projects.

Features

Kuaivgi — Kling Text-to-SFX

Generate cinematic sound effects directly from text. Describe the scene or action, and Kling creates matching foley, ambience, risers, booms, whooshes, and textures—perfect for trailers, shorts, games, podcasts, and multimedia projects.


Key Features

  • Text-to-audio SFX with scene-aware textures and timing
  • Wide palette: weather, impacts, machinery, footsteps, creatures, atmospheres
  • Clean renders ready for layering and post-mix
  • Fast iteration for cue sheets and temp tracks

Parameters

  • prompt

    Describe what you want to hear. Example: Cold winter night with howling wind across barren fields; deep gusts; distant creaks; approaching snowstorm tension.

  • duration

    Length of the generated SFX bed in seconds.


How to Use

  1. Write a concise, concrete prompt naming sources, space, and mood.
  2. Set the duration to match your shot or loop length.
  3. Run and download the audio. Trim or loop in your DAW as needed.

Output

  • Single SFX track aligned to your requested duration.
  • Format follows platform defaults with a downloadable URL.

Pricing

  • Just $0.035 per run!!!

Prompting Tips

  • Call out materials and distance: metal gate clang close, wood door thud mid, crowd murmur far.
  • Add pacing: slow build, big hit at 0:08, decay to silence.
  • For loops, keep the ending sparse or symmetrical for seamless repeats.
  • Generate stems by running separate prompts for ambience, impacts, and ear-candy, then mix.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-text-to-audio" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "prompt": "Cold winter night with howling northern wind sweeping across barren fields and forests, deep and chilling gusts creating a lonely, tense atmosphere as if a snowstorm is approaching.",
    "duration": 10
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-Text prompt for audio generation, maximum 200 characters
durationnumberYes103 ~ 10Duration of the generated audio in seconds, range: 3.0 to 10.0 seconds, supports one decimal place

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.