Vidu Contest
WaveSpeed.ai
Inicio/Explorar/Kling O3 Models/kwaivgi/kling-video-o3-std/text-to-video
text-to-video

text-to-video

Kling Omni Video O3 Standard Text-To-Video

kwaivgi/kling-video-o3-std/text-to-video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Supports audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Input
Whether to generate audio for the video.

Idle

Tu solicitud costará $0.9 por ejecución.

Con $10 puedes ejecutar este modelo aproximadamente 11 veces.

Una cosa más:

EjemplosVer todo

README

Kling Video O3 Standard Text-to-Video

Kling Video O3 Standard is Kuaishou's advanced text-to-video model in the O3 family, delivering high-quality cinematic video from text descriptions. With optional synchronized sound generation, multiple aspect ratios, and flexible duration from 3 to 15 seconds, it offers a strong balance of quality and cost.

Why Choose This?

  • O3-level quality Advanced visual fidelity and motion realism beyond V3.0 models.

  • Sound generation Optional synchronized sound effects generated alongside the video.

  • Flexible duration Generate videos from 3 to 15 seconds — any length you need.

  • Multiple aspect ratios Support for 16:9, 9:16, and 1:1 to fit any platform.

  • Prompt Enhancer Built-in tool to automatically improve your video descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the video scene and motion
aspect_ratioNoOutput ratio: 16:9 (default), 9:16, 1:1
durationNoVideo length: 3-15 seconds (default: 5)
soundNoGenerate synchronized sound (default: disabled)

How to Use

  1. Write your prompt — describe the scene, characters, motion, and style in detail.
  2. Select aspect ratio — match your target platform.
  3. Set duration — choose any length from 3 to 15 seconds.
  4. Enable sound (optional) — generate synchronized audio with the video.
  5. Run — submit and download your video.

Pricing

DurationSound OffSound On
3s$0.54$0.72
5s$0.90$1.20
10s$1.80$2.40
15s$2.70$3.60

Billing Rules

  • Base rate: $0.90 per 5 seconds
  • Sound multiplier: disabled = 1×, enabled = 4/3×

Best Use Cases

  • Professional Content — High-quality videos at a more accessible price than O3 Pro.
  • Social Media — Create engaging videos for TikTok, Reels, and Stories.
  • Marketing Videos — Produce promotional content with optional sound.
  • Concept Visualization — Bring creative ideas to life from text.
  • Long-Form Scenes — Up to 15 seconds for extended scene development.

Pro Tips

  • Use the Prompt Enhancer to refine your descriptions automatically.
  • Match aspect ratio to your platform: 16:9 for YouTube, 9:16 for TikTok/Reels, 1:1 for Instagram.
  • Enable sound for a complete video experience with synchronized audio.
  • Be specific about camera movements, lighting, and atmosphere for best results.
  • Use shorter durations (3-5s) for testing, longer (10-15s) for final production.
  • Use O3 Standard for regular production; upgrade to O3 Pro for maximum quality.

Notes

  • Only prompt is required; other parameters have defaults.
  • Duration supports any value from 3 to 15 seconds.
  • Sound generation increases cost by approximately 33%.

Related Models