Kimi Contest
Explore/Kling Models

Kling Models

kwaivgi/kling-v3.0-pro/image-to-video

kwaivgi

kling-v3.0-pro/image-to-video

kwaivgi/kling-v3.0-std/text-to-video

kwaivgi

kling-v3.0-std/text-to-video

kwaivgi/kling-v3.0-std/image-to-video

kwaivgi

kling-v3.0-std/image-to-video

kwaivgi/kling-v3.0-pro/text-to-video

kwaivgi

kling-v3.0-pro/text-to-video

kwaivgi/kling-image-v3/edit
kwaivgi/kling-image-v3/edit

kwaivgi

kling-image-v3/edit

kwaivgi/kling-image-v3/text-to-image
kwaivgi/kling-image-v3/text-to-image

kwaivgi

kling-image-v3/text-to-image

kwaivgi/kling-v2.6-std/text-to-video

kwaivgi

kling-v2.6-std/text-to-video

kwaivgi/kling-v2.6-std/image-to-video

kwaivgi

kling-v2.6-std/image-to-video

kwaivgi/kling-v2.6-std/motion-control

kwaivgi

kling-v2.6-std/motion-control

kwaivgi/kling-v2.5-turbo-std/image-to-video

kwaivgi

kling-v2.5-turbo-std/image-to-video

kwaivgi/kling-video-o1-std/video-edit

kwaivgi

kling-video-o1-std/video-edit

kwaivgi/kling-video-o1-std/reference-to-video

kwaivgi

kling-video-o1-std/reference-to-video

kwaivgi/kling-video-o1-std/image-to-video

kwaivgi

kling-video-o1-std/image-to-video

kwaivgi/kling-video-o1/video-edit

kwaivgi

kling-video-o1/video-edit

kwaivgi/kling-image-o1
kwaivgi/kling-image-o1

kwaivgi

kling-image-o1

kwaivgi/kling-video-o1/text-to-video

kwaivgi

kling-video-o1/text-to-video

kwaivgi/kling-video-o1/reference-to-video

kwaivgi

kling-video-o1/reference-to-video

kwaivgi/kling-video-o1/video-edit-fast

kwaivgi

kling-video-o1/video-edit-fast

kwaivgi/kling-video-o1/image-to-video

kwaivgi

kling-video-o1/image-to-video

kwaivgi/kling-v2.5-turbo-pro/image-to-video

kwaivgi

kling-v2.5-turbo-pro/image-to-video

kwaivgi/kling-v2.5-turbo-pro/text-to-video

kwaivgi

kling-v2.5-turbo-pro/text-to-video

kwaivgi/kling-video-to-audio

kwaivgi

kling-video-to-audio

kwaivgi/kling-v1-ai-avatar-standard

kwaivgi

kling-v1-ai-avatar-standard

kwaivgi/kling-effects

kwaivgi

kling-effects

kwaivgi/kling-v2.1-i2v-master

kwaivgi

kling-v2.1-i2v-master

kwaivgi/kling-v2.1-t2v-master

kwaivgi

kling-v2.1-t2v-master

kwaivgi/kling-v2.0-i2v-master

kwaivgi

kling-v2.0-i2v-master

kwaivgi/kling-v2.0-t2v-master

kwaivgi

kling-v2.0-t2v-master

kwaivgi/kling-v2.6-pro/motion-control

kwaivgi

kling-v2.6-pro/motion-control

kwaivgi/kling-v2.6-pro/text-to-video

kwaivgi

kling-v2.6-pro/text-to-video

kwaivgi/kling-v2.6-pro/image-to-video

kwaivgi

kling-v2.6-pro/image-to-video

kwaivgi/kling-v2.1-i2v-pro

kwaivgi

kling-v2.1-i2v-pro

kwaivgi/kling-v1/ai-multi-shot
kwaivgi/kling-v1/ai-multi-shot

kwaivgi

kling-v1/ai-multi-shot

kwaivgi/kling-text-to-audio
kwaivgi/kling-text-to-audio

kwaivgi

kling-text-to-audio

kwaivgi/kling-lipsync/audio-to-video

kwaivgi

kling-lipsync/audio-to-video

kwaivgi/kling-lipsync/text-to-video

kwaivgi

kling-lipsync/text-to-video

kwaivgi/kling-v2.6/create-voice
kwaivgi/kling-v2.6/create-voice

kwaivgi

kling-v2.6/create-voice

kwaivgi/kling-v1.6-multi-i2v-pro

kwaivgi

kling-v1.6-multi-i2v-pro

kwaivgi/kling-v1.6-multi-i2v-standard

kwaivgi

kling-v1.6-multi-i2v-standard

kwaivgi/kling-video-o1-std/text-to-video

kwaivgi

kling-video-o1-std/text-to-video

kwaivgi/kling-v2.1-i2v-pro/start-end-frame

kwaivgi

kling-v2.1-i2v-pro/start-end-frame

kwaivgi/kling-v2-ai-avatar-pro

kwaivgi

kling-v2-ai-avatar-pro

kwaivgi/kling-v1-ai-avatar-pro

kwaivgi

kling-v1-ai-avatar-pro

kwaivgi/kling-v2-ai-avatar-standard

kwaivgi

kling-v2-ai-avatar-standard

kwaivgi/kling-v2.1-i2v-standard

kwaivgi

kling-v2.1-i2v-standard

kwaivgi/kling-v1.6-t2v-standard

kwaivgi

kling-v1.6-t2v-standard

kwaivgi/kling-v1-tts
kwaivgi/kling-v1-tts

kwaivgi

kling-v1-tts

kwaivgi/kling-v1.6-i2v-pro

kwaivgi

kling-v1.6-i2v-pro

kwaivgi/kling-v1.6-i2v-standard

kwaivgi

kling-v1.6-i2v-standard

Cinematic Video Quality

  1. Advanced rendering for natural motion, lighting, and atmospheric realism
  2. High-fidelity color reproduction and detailed textures

Semantic Understanding & Prompt Control

  1. Precise interpretation of visual intent
  2. Strong alignment between text prompts and generated video output

Unified Multimodal Architecture (v3.0)

  1. All-in-one model that merges text-to-video, image-to-video, reference-based generation, and audio synthesis under a single native training framework
  2. Eliminates the need for separate tools and post-production patching — the entire creative lifecycle from generation to refinement is handled in one stream

AI Director — Multi-Shot Storytelling (v3.0)

  1. Automatically interprets scene coverage and shot patterns from a single prompt
  2. Generates structured, rhythmic sequences with professional camera transitions in one generation cycle
  3. Supports up to 6 distinct camera cuts per video

Native Audio-Visual Co-Generation (v3.0)

  1. Character dialogue, environmental sound, and music produced simultaneously with video
  2. Character-specific voice referencing for multi-speaker scenes with accurate spatial attribution
  3. Supports bilingual/multilingual dialogue within the same scene

Continuous Version Evolution

  1. From v1.6 to v3.0
  2. Each update improves realism, generation speed, and creative flexibility — with v3.0 introducing unified multimodal architecture and native audio-visual co-generation

Outstanding Value

  1. Higher visual quality at significantly lower cost
  2. 25% cheaper than the Kling 2.1 Standard model

Professional-Grade Performance

  1. Matches the response quality of the 2.5 Turbo Pro text model
  2. Although output is 720p, the visual detail remains rich and suitable for most creative and commercial use cases

Kling Model Lineup

  1. Kling v3.0 Series

The most advanced generation in the Kling family. Built on a unified multimodal architecture that consolidates video generation, image creation, and audio synthesis into a single "all-in-one" model — a paradigm shift from task-specific clip generation to full narrative-level video production.

Key Capabilities:

  1. 15-second generation with custom duration control (3–15s) — the longest native generation in Kling's history
  2. Multi-Shot AI Director with up to 6 camera cuts per video, including automatic shot transitions driven by prompt-based scene direction
  3. Native audio-visual sync — dialogue, music, and sound effects co-generated alongside video in a single pass
  4. Elements 3.0 subject consistency — locks character identity across shots, camera angles, and scene transitions with multi-image and video-based references
  5. Multi-language lip-sync in Chinese, English, Japanese, Korean, and Spanish with dialect and accent support
  6. Physics-aware generation with improved handling of complex physical interactions and reduced artifacts
  7. Native text rendering for precise lettering in signage, captions, and advertising layouts

Pro Models

  1. kling-v3.0-pro/text-to-video — Flagship T2V model with maximum visual fidelity, cinematic motion control, native audio generation, and multi-shot AI Director capabilities. Ideal for high-end creative and commercial production
  2. kling-v3.0-pro/image-to-video — Premium I2V synthesis with superior subject consistency, detailed texture preservation, and audio-visual co-generation. Best for projects requiring precise character identity maintenance across sequences

Standard Models

  1. kling-v3.0-std/text-to-video — Cost-efficient T2V generation with smooth motion, strong prompt adherence, and reliable audio-visual output. Optimized for high-volume creative workflows
  2. kling-v3.0-std/image-to-video — Fast, affordable I2V conversion with consistent detail retention and natural dynamics. Ideal for everyday content creation and rapid prototyping

Omni Models (Coming Soon)

  1. kling-v3.0-omni — Reference-heavy variant featuring enhanced Elements 3.0 with video-character reference (visual + audio capture), multi-shot storyboard workflows, and the strongest subject consistency in the lineup. Designed for serialized storytelling, brand-consistent content, and enterprise production pipelines
  2. Kling v2.6 Pro Series

The Latest Pro Models

  1. kling-v2.6-pro/motion-control — Fine-grained motion guidance for controllable, stable video generation
  2. kling-v2.6-pro/image-to-video — High-fidelity, fast I2V rendering with strong detail preservation
  3. kling-v2.6-pro/text-to-video — Professional-grade T2V outputs with coherent motion and temporal consistency
  4. Kling v2.6 Std Series

The Latest Standard Models

  1. kling-v2.6-std/text-to-video — Efficient T2V generation with smooth motion and reliable visual quality at lower cost
  2. kling-v2.6-std/image-to-video — Fast, cost-effective I2V synthesis with consistent detail retention and natural dynamics
  3. kling-v2.6-std/motion-control — Cost-effective motion transfer from reference videos, enabling controlled character animation with stable identity preservation at lower cost
  4. Kling v2.5 Turbo Series

Professional Pro Models

  1. kling-v2.5-turbo-pro/image-to-video — High-fidelity, fast I2V rendering
  2. kling-v2.5-turbo-pro/text-to-video — Professional-grade T2V outputs with strong frame coherence

Standard Turbo Models

  1. kling-v2.5-turbo-std/image-to-video — Lightweight, fast, and visually refined
  2. Kling v2.1 Series

Image-to-Video Models

  1. kling-v2.1-i2v-standard — Balanced performance for everyday content creation
  2. kling-v2.1-i2v-pro — Stronger scene continuity and semantic modeling
  3. kling-v2.1-i2v-pro/start-end — Start–end guided synthesis for narrative video creation
  4. kling-v2.1-i2v-master — Flagship-level realism and cinematic tone

Text-to-Video Master Model

  1. kling-v2.1-t2v-master — Unmatched motion control for expressive text-driven video
  2. Kling v2.0 Series

Image-to-Video Master Model

  1. kling-v2.0-i2v-master — Superior detail control and enhanced realism

Text-to-Video Master Model

  1. kling-v2.0-t2v-master — Optimized lighting, depth perception, and semantic accuracy
  2. Kling v1.6 Series

Image-to-Video (I2V) Models

  1. kling-v1.6-i2v-standard — Efficient baseline model with stable, realistic motion
  2. kling-v1.6-i2v-pro — Enhanced motion realism, texture detail, and dynamic fidelity

Text-to-Video (T2V) Model

  1. kling-v1.6-t2v-standard — Strong text-to-video consistency and expressive visual output

Multi-Frame I2V Models

  1. kling-v1.6-multi-i2v-standard — Improved transition smoothness and temporal coherence
  2. kling-v1.6-multi-i2v-pro — Cinematic multi-frame synthesis for advanced storytelling

Specialized Kling Tools

Kling Effects & Enhancement Tools

  1. kling-effects — Natural motion effects, creative transitions, and style blending

Kling Lipsync Models

  1. kling-lipsync/audio-to-video — Voice-driven, perfectly aligned talking-face videos
  2. kling-lipsync/text-to-video — Script-to-lipsync generation for digital humans
  3. kwaivgi/kling-v2-ai-avatar-standard — Affordable, single-image talking avatars for everyday explainers, training clips, and social content
  4. kwaivgi/kling-v2-ai-avatar-pro — High-fidelity, studio-quality digital humans with richer motion, expressions, and lip-sync for premium productions

Audio and Speech Tools

  1. kling-v1-tts — Clear and natural text-to-speech for video narration
  2. kwaivgi/kling-video-to-audio — Auto-generated or extracted sound effects and music