Kwaivgi Kling Text To Audio
Playground
Try it on WavespeedAI!Generate sound effects from text descriptions using KlingAI’s advanced audio generation model. Perfect for creating custom sound effects for videos, games, and multimedia projects.
Features
Kuaivgi — Kling Text-to-SFX
Generate cinematic sound effects directly from text. Describe the scene or action, and Kling creates matching foley, ambience, risers, booms, whooshes, and textures—perfect for trailers, shorts, games, podcasts, and multimedia projects.
Key Features
- Text-to-audio SFX with scene-aware textures and timing
- Wide palette: weather, impacts, machinery, footsteps, creatures, atmospheres
- Clean renders ready for layering and post-mix
- Fast iteration for cue sheets and temp tracks
Parameters
-
prompt
Describe what you want to hear. Example: Cold winter night with howling wind across barren fields; deep gusts; distant creaks; approaching snowstorm tension.
-
duration
Length of the generated SFX bed in seconds.
How to Use
- Write a concise, concrete prompt naming sources, space, and mood.
- Set the duration to match your shot or loop length.
- Run and download the audio. Trim or loop in your DAW as needed.
Output
- Single SFX track aligned to your requested duration.
- Format follows platform defaults with a downloadable URL.
Pricing
- Just $0.035 per run!!!
Prompting Tips
- Call out materials and distance: metal gate clang close, wood door thud mid, crowd murmur far.
- Add pacing: slow build, big hit at 0:08, decay to silence.
- For loops, keep the ending sparse or symmetrical for seamless repeats.
- Generate stems by running separate prompts for ambience, impacts, and ear-candy, then mix.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-text-to-audio" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"prompt": "Cold winter night with howling northern wind sweeping across barren fields and forests, deep and chilling gusts creating a lonely, tense atmosphere as if a snowstorm is approaching.",
"duration": 10
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | Text prompt for audio generation, maximum 200 characters | |
| duration | number | Yes | 10 | 3 ~ 10 | Duration of the generated audio in seconds, range: 3.0 to 10.0 seconds, supports one decimal place |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |