Kling V1 TTS | Realistic Voice & TTS API

Kling V1 Text-to-Speech

Kling V1 TTS is a high-quality text-to-speech model that converts written text into natural, expressive audio. With multiple voice options and adjustable speed control, it produces lifelike speech perfect for voiceovers, content creation, and accessibility applications.

Why It Stands Out

Natural-sounding voices: Generate realistic speech with human-like intonation and expression.
Multiple voice options: Choose from a variety of voice profiles to match your content needs.
Speed control: Adjust speech rate to fit your pacing requirements.
Simple workflow: Just input text, select a voice, and generate audio instantly.
Cost-effective: Flat rate for short texts, scalable pricing for longer content.

Parameters

Parameter	Required	Description
text	Yes	The text you want to convert to speech.
voice_id	Yes	Voice profile to use (e.g., chat1_female_new-3).
speed	No	Speech rate multiplier (default: 1).

How to Use

Enter your text — type or paste the content you want to convert to speech.
Select a voice — choose from available voice profiles.
Adjust speed (optional) — set speech rate (lower = slower, higher = faster).
Click Run and wait for audio generation.
Preview and download the result.

Best Use Cases

Voiceovers — Create narration for videos, presentations, and tutorials.
Content Creation — Generate audio versions of articles, blogs, and scripts.
Advertising — Produce voice content for ads, promos, and announcements.
Accessibility — Convert written content to audio for visually impaired users.
E-learning — Create spoken content for courses and educational materials.
Podcasts & Audiobooks — Generate draft narration or supplementary audio.

Pricing

Text Length	Price
Up to 1000 characters	$0.10
Beyond 1000 characters	$0.10 per 1000 characters

Examples

500 characters → $0.10 (minimum)
1000 characters → $0.10
2500 characters → 2.5 × $0.10 = $0.25
5000 characters → 5 × $0.10 = $0.50

Pro Tips for Best Quality

Use proper punctuation — commas and periods help create natural pauses.
Break long content into shorter segments for better pacing.
Test different voice profiles to find the best match for your content.
Adjust speed based on content type — slower for educational, faster for energetic ads.
Avoid excessive abbreviations or special characters that may affect pronunciation.

Notes

Processing time varies based on text length and current queue load.
Please ensure your content complies with usage guidelines.

Kling v1 Tts API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-tts with your input as JSON. The endpoint returns a prediction id. Start polling the result endpoint around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. On completed, read output values from data.outputs. Examples for Kling v1 Tts below.

HTTP example

set -euo pipefail

: "${WAVESPEED_API_KEY:?Set WAVESPEED_API_KEY}"

REQUEST_BODY=$(cat <<'JSON'
{
    "text": "A clear example input",
    "voice_id": "genshin_vindi2",
    "speed": 1
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-tts" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d "$REQUEST_BODY")

TASK=$(printf '%s' "$SUBMIT_RESPONSE" | jq 'if has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "$TASK" | jq -r '.id')
if [ -z "$PREDICTION_ID" ] || [ "$PREDICTION_ID" = "null" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "$TASK" | jq -r '.urls.get // empty')
if [ -z "$RESULT_URL" ]; then
  RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/$PREDICTION_ID/result"
fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body "$RESULT_URL" \
    -H "Authorization: Bearer $WAVESPEED_API_KEY")
  RESULT=$(printf '%s' "$RESPONSE" | jq 'if has("data") then .data else . end')
  STATUS=$(printf '%s' "$RESULT" | jq -r '.status')
  case "$STATUS" in
    completed) printf '%s\n' "$RESULT" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "$RESULT" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "$STATUS" >&2; exit 1 ;;
  esac
done

Node.js example

const submitUrl = "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-tts";
const apiKey = process.env.WAVESPEED_API_KEY;
if (!apiKey) throw new Error('Set WAVESPEED_API_KEY');

async function requestJson(url, options = {}) {
  const response = await fetch(url, options);
  if (!response.ok) throw new Error(await response.text());
  return response.json();
}

// 1. Submit the prediction.
const body = await requestJson(submitUrl, {
  method: "POST",
  headers: {
    "Authorization": `Bearer ${apiKey}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
        "text": "A clear example input",
        "voice_id": "genshin_vindi2",
        "speed": 1
}),
});
const task = body.data ?? body;
if (!task.id) throw new Error("Submission response did not contain a prediction id");
const resultUrl = task.urls?.get ||
  `https://api.wavespeed.ai/api/v3/predictions/${task.id}/result`;

// 2. Poll until the prediction finishes.
while (true) {
  const resultBody = await requestJson(resultUrl, {
    headers: { "Authorization": `Bearer ${apiKey}` },
  });
  const result = resultBody.data ?? resultBody;
  if (result.status === "completed") {
    console.log(result.outputs);
    break;
  }
  if (["failed", "cancelled", "timeout"].includes(result.status)) throw new Error(JSON.stringify(result));
  if (!["created", "processing"].includes(result.status)) throw new Error("Unexpected status: " + result.status);
  await new Promise(resolve => setTimeout(resolve, 2000));
}

Python example

import json
import os
import time
from urllib.request import Request, urlopen

api_key = os.environ["WAVESPEED_API_KEY"]
headers = {"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"}
payload = {
    "text": "A clear example input",
    "voice_id": "genshin_vindi2",
    "speed": 1
}

def request_json(url, data=None):
    request = Request(url, data=data, headers=headers, method="POST" if data else "GET")
    with urlopen(request) as response:
        return json.load(response)

# 1. Submit the prediction.
body = request_json("https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-tts", json.dumps(payload).encode())
task = body.get("data", body)
if not task.get("id"):
    raise RuntimeError("Submission response did not contain a prediction id")
result_url = task.get("urls", {}).get("get") or f"https://api.wavespeed.ai/api/v3/predictions/{task['id']}/result"

# 2. Poll until the prediction finishes.
while True:
    result_body = request_json(result_url)
    result = result_body.get("data", result_body)
    status = result.get("status")
    if status == "completed":
        print(result.get("outputs", []))
        break
    if status in {"failed", "cancelled", "timeout"}:
        raise RuntimeError(result)
    if status not in {"created", "processing"}:
        raise RuntimeError(f"Unexpected status: {status}")
    time.sleep(2)

Kling v1 Tts API — Frequently asked questions

What is the Kling v1 Tts API?

Kling v1 Tts is a Kuaishou model for audio generation, exposed as a REST API on WaveSpeedAI. Kling V1 TTS creates natural-sounding audio and supports KlingAI image, video, sound effect, virtual model, and custom AI workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Kling v1 Tts API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID. Poll the result endpoint starting around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. The playground generates production-oriented Python, JavaScript, and cURL examples with timeouts, transient-error handling, and safe GET retries. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v1-tts.

How much does Kling v1 Tts cost per run?

Kling v1 Tts starts at $0.10 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Kling v1 Tts accept?

Key inputs: `speed`, `text`, `voice_id`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v1-tts.

How long does Kling v1 Tts take to generate?

Median end-to-end generation time on WaveSpeedAI is around 5 seconds per request, based on recent successful runs. Queue time varies with global demand; live status is visible in the prediction record.

Can I use Kling v1 Tts outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

Ví dụXem tất cả

Mô hình liên quan

README

Kling V1 Text-to-Speech

Why It Stands Out

Parameters

How to Use

Best Use Cases

Pricing

Examples

Pro Tips for Best Quality

Notes

Kling v1 Tts API — Quick start

Kling v1 Tts API — Frequently asked questions

Tìm hiểu thêm

Pháp lý

Tài nguyên

Mô hình

Công cụ