Скидка 50% на модели Vidu Q3 и Q3 Pro · только на WaveSpeedAI | 20 мая – 2 июня

Kling V2.5 Turbo Std Image to Video

kwaivgi /

Kling 2.5 Turbo Std delivers image-to-video with fluid motion, cinematic visuals, and precise prompts at 25% lower pricing vs 2.1 Std. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video
Ввод

Перетащите или нажмите для загрузки

preview

Ожидание

$0.21за запуск·~47 / $10

Далее:

ПримерыСмотреть всё

A cinematic and humorous scene. The old wizard is trying to concentrate, studying his spellbook and the glowing orb with a serious expression. The scene is interrupted. The mischievous imp on the left flutters forward and tries to snatch a glowing crystal floating off the book. Simultaneously, the pixie zips close to the wizard's ear, buzzing annoyingly. The wizard, deeply annoyed, looks up from his book, his concentration broken. He raises his free hand and swats at the imp, scolding it without a word, while shaking his head at the pixie. The imp zips away, cackling. The ghost drifts by, seemingly amused.

A cinematic, ultra-high-speed action sequence, bringing the input image to life. The cyberpunk ninja is in the middle of a fluid, devastating dash-and-slash attack. The camera follows him as he moves like a blur through the neon-drenched city street. He completes his impossibly fast sword swing, and the katana leaves a brilliant red and white light trail in the air. We see the arc of the blade in dramatic slow-motion for one split second. After the strike, he doesn't stop. He continues his forward dash, his body low, and gracefully sheathes his sword with a sharp, robotic *'click'* motion, all without breaking his sprint. He then vanishes into the rain and neon-lit shadows of the city. Style: Photorealistic, masterpiece, high detail, extreme motion blur on background, dynamic camera tracking, cyberpunk aesthetic, dark, gritty.

A cinematic, epic Wuxia (Chinese martial arts) 5-second video, bringing the input image to life. The scene starts with the swordsman in his static, contemplative pose on the rooftop, his robes gently fluttering in the night breeze, exactly as seen in the image. He slowly raises his head, his eyes now sharp and focused, looking out over the misty city towards a distant target (off-camera). With a sudden, explosive burst of "Qi" (energy), he pushes off the roof and leaps into the air. He becomes a graceful, shadowy blur, performing an impossibly fast "Qinggong" (light-foot) run. He sprints weightlessly across the curved, tiled rooftops, his dark robes and long hair streaming behind him. He takes a gravity-defying leap over a wide alley, silhouetted for a moment against the massive, bright full moon before vanishing into the mist of the city below. Camera: A dynamic, sweeping drone shot that follows his run and leap. Style: Photorealistic fantasy, masterpiece, high detail, smooth motion, slow-motion effect on the leap.

A cinematic, magical transition starting from the input image. The woman's blue-and-white top and white skirt dissolve into shimmering, magical particles of light. As the light particles fade away, they seamlessly reveal that she is now wearing a vibrant [Color, e.g., red or blue] bikini underneath. Simultaneously, the green grass background transforms and cross-fades into a stunning, photorealistic tropical beach with a sparkling turquoise ocean and clear blue sky. Once the transformation is complete, she gives a bright, joyful smile directly to the camera, then gracefully stands up, turns her body, and begins to run playfully towards the ocean waves. The camera follows her as she runs away.

A cinematic, epic, 5-second video, bringing the input image to life in a grand, comical fantasy battle. The scene starts with the brave Corgi Knight (or small animal king) in full armor, posing heroically on the cliff edge under the dark, stormy sky, just like the image. He holds his small sword high, looks out at the unseen enemy, and lets out a surprisingly high-pitched, adorable, yet fierce battle cry (a 'squeak' or 'bark'). His pony rears up dramatically for a moment, then gallops heroically down the steep cliff path towards the beach. As he charges, the loyal army of armored squirrels and rodents on the beach below raises their tiny spears in unison, chittering and squeaking as they surge forward to follow their king into battle. Camera: A dynamic, low-angle tracking shot, following the Corgi King as he leads the charge, capturing the chaotic, adorable energy of the small animal army. Style: Epic fantasy, cute, cinematic, high detail, masterpiece, dynamic motion, slow-motion effect.

A cinematic, photorealistic video, starting as a static shot of the man in the input image. The camera begins a slow, dramatic zoom-in on his intense, analytical face. As the camera pushes closer, he slowly and deliberately pulls his hands out of his coat pockets. He raises them into the frame, palms facing forward. He then performs a series of complex, intricate, mystical hand gestures (like Doctor Strange), tracing patterns in the air. His expression remains serious and focused. Finally, he opens his palms wide. Two large, intricate, glowing orange magical sigils (Eldritch magic circles) burst into existence and spin rapidly in front of each hand, illuminating his face with a warm, magical light. Style: Epic, supernatural, slow-motion, high detail, masterpiece, dramatic lighting, magical particle effects.

A cinematic, epic horror, 5-second video, bringing the input image to life. The scene starts with the dark, gothic castle being battered by the violent thunderstorm, just as seen in the image. Massive waves crash against the cliffs, rain pours, and a bright flash of lightning illuminates the sky. Suddenly, the turbulent sea in the foreground explodes upwards. A colossal, terrifying sea monster (a Lovecraftian kraken or leviathan) bursts from the water, its massive form silhouetted against the storm. It rears its head back, opens its massive, blood-curdling maw filled with rows of teeth, and unleashes a deafening, earth-shaking roar directly at the castle tower. In direct response to the roar, the castle's dark windows, which were all black, suddenly ignite one by one with the small, flickering, warm lights of torches, as the inhabitants are violently awakened in panic. Camera: A quick, dramatic push-in towards the castle windows as the torches are lit. Style: Photorealistic, masterpiece, high detail, high contrast, slow-motion effect on the monster's roar, Lovecraftian horror.

A cinematic, epic, 5-second video, bringing the mech from the input image to life. The scene starts with the winged mech floating silently in a dark, cosmic void, just as seen in the image, with the purple halos slowly rotating around it. Suddenly, its eyes flash with a brilliant green light. The multiple white and yellow energy wings burst open to their full, majestic span, scattering a storm of golden, feather-like particles into space. The purple halos intensify and spin rapidly. The mech then brings its two powerful rifles together, connecting them for a final attack. A massive, overwhelmingly powerful beam of pure energy erupts from the combined weapon, incinerating an unseen enemy fleet off-camera. The sheer force of the blast creates a blinding shockwave that whites out the screen.

A cinematic, luxurious advertisement for a cocktail, starting from the product in [Image 1]. (Scene 1: The Creation) An extreme slow-motion, macro close-up. A single, perfect maraschino cherry falls gracefully into a crystal-clear martini glass filled with deep amber liquid. The moment it hits, it creates a beautiful, elegant splash, and a rich, dark red syrup (like the one at the bottom of [Image 1]) blossoms and swirls upwards from the cherry, mixing with the golden liquid. Capture the liquid dynamics and micro-droplets frozen in time. (Scene 2: The Showcase) The shot seamlessly transitions to a 360-degree rotating product shot of the finished cocktail. The glass is now perfectly still, glistening under dramatic studio lighting, showcasing its vibrant two-tone color and the elegant lemon twist garnish from [Image 1]. The background is a dark, sophisticated, blurred high-end bar. (Scene 3: The Enjoyment) The final scene cuts to a close-up of a charismatic, sophisticated man in a sharp, tailored suit (closely resembling the man in [Image 2]). He is sitting in a luxurious lounge. He confidently picks up the finished cocktail, swirls it gently, and takes a slow, appreciative sip. He then looks directly at the camera with a subtle, knowing smile. Style: Ultra-realistic, high resolution, professional product photography, shallow depth of field, dramatic lighting, smooth motion, high-speed camera effect.

A cinematic, high-detail food videography shot, bringing the chef from the input image to life. The scene starts exactly as the image: the chef, with intense focus, finishes the final, precise cut on the raw steak. He then looks up, satisfied. He generously seasons the steak with fresh-ground black pepper and sea salt. The camera follows as he lifts the perfectly seasoned steak and gently places it onto the smoking-hot grill pan in the foreground. This is followed by an extreme macro close-up, in dramatic slow-motion, as the steak hits the hot iron. A loud, satisfying *SIZZLE* erupts. Fragrant smoke and steam billow up, momentarily obscuring the chef as they fill the frame. The camera lingers on the steak as it develops perfect, dark sear marks. Style: Ultra-realistic, professional food commercial, high detail, macro shots, slow-motion effect, warm lighting, high-speed camera effect.

A cinematic, photorealistic video, bringing the input image to life in a high-speed action sequence. The scene transforms to a dark, misty midnight, illuminated by a full moon. The ninja is no longer crouching. He explodes into motion, sprinting silently at a supernatural speed across the wet, moonlit rooftops of a traditional Japanese castle complex. He moves like a blur, leaping effortlessly over wide gaps between buildings, running vertically up walls for a few steps, and flowing over obstacles like smoke. The camera is a dynamic, low-angle tracking shot, struggling to keep pace with his fluid, parkour-like movements. Style: Fast-paced, smooth motion, high detail, stealthy, atmospheric, dark, cinematic.

A cinematic, slice-of-life anime clip, bringing the input image to life. The scene starts with the girl in the school uniform, standing in the entryway (genkan) with a piece of toast in her mouth, exactly as seen in the image. She briefly pauses, perhaps checking her bag. Suddenly, her eyes widen in a classic anime "Oh no! I'm late!" expression. She gives a muffled, worried sound from around the toast. She immediately pivots, her twintails and pleated skirt swirling with the motion, and makes a frantic, high-energy dash out the open door. The camera follows her as she runs into the bright, sun-drenched street outside. Style: Japanese slice-of-life anime, warm colors, high detail, smooth animation, lens flare from the sun.

Похожие модели

README

Kling V2.5 Turbo Standard (Image-to-Video)

Kling V2.5 Turbo Standard is a high-performance image-to-video generation model optimized for speed, quality, and affordability. It transforms a single image and a short prompt into smooth, cinematic video clips that preserve the original style, lighting, and emotion — all at a lower cost than previous versions.

🌟 Model Highlights

  • 💰 Ultra Cost-Effective Delivers higher visual quality at a 25% lower price compared with Kling V2.1 Standard. The model provides stunning realism while significantly reducing generation costs.

  • 🏢 B2B Early Access Exclusively launched for enterprise clients through an invite-only program. The model will be publicly available on Kling Web/App around November, giving B-end users an early advantage and extended testing window.

  • 🎬 Pro-Level Visual Quality Although output resolution is 720p, the model’s refined dynamics and motion synthesis ensure rich details, clean motion, and stable lighting that meet the needs of most video generation scenarios.

  • ⚡ Fast Inference Built with optimized pipelines for rapid generation — ideal for high-volume creative workflows.

  • 🧠 Strong Text Comprehension Matches the Kling 2.5 Turbo Pro version in prompt understanding and narrative coherence, producing well-timed, semantically accurate motion.

⚙️ Capabilities

  • Input: Image + text prompt
  • Output: 720p video
  • Supported Durations: 5s / 10s
  • Use Cases: Marketing videos, storyboards, explainers, short-form content

💰 Pricing

DurationPrice (USD)
5s$0.21
10s$0.42

🧩 How to Use

  1. Write your prompt — describe the subject, camera movement, and atmosphere.
  2. Upload a reference image — defines composition and color tone.
  3. Set duration — choose 5s or 10s depending on your creative need.
  4. Adjust guidance_scale — higher values increase prompt adherence.
  5. Run generation — fast inference delivers results within seconds.
  6. Review & iterate — tweak prompts or seeds for variations.

💡 Ideal For

  • Marketing & Brand Teams — Generate short ads or motion design clips quickly.
  • Content Creators & YouTubers — Produce narrative motion from static art.
  • Studios & Production Houses — Use as a previsualization tool for scene planning.
  • Education & Explainers — Turn diagrams or slides into dynamic videos.
Доступность:Этот сайт использует модели ИИ, предоставляемые третьими лицами.

Kling v2.5 Turbo Std Image To Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-v2.5-turbo-std/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Kling v2.5 Turbo Std Image To Video below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v2.5-turbo-std/image-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "negative_prompt": "blurry, low quality, distorted",
    "guidance_scale": 0.5,
    "duration": 5
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("kwaivgi/kling-v2.5-turbo-std/image-to-video", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "negative_prompt": "blurry, low quality, distorted",
        "guidance_scale": 0.5,
        "duration": 5
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "kwaivgi/kling-v2.5-turbo-std/image-to-video",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "negative_prompt": "blurry, low quality, distorted",
    "guidance_scale": 0.5,
    "duration": 5
}
)

print(output["outputs"][0])  # → URL of the generated output

Kling v2.5 Turbo Std Image To Video API — Frequently asked questions

What is the Kling v2.5 Turbo Std Image To Video API?

Kling v2.5 Turbo Std Image To Video is a Kuaishou model for video generation from images, exposed as a REST API on WaveSpeedAI. Kling 2.5 Turbo Std delivers image-to-video with fluid motion, cinematic visuals, and precise prompts at 25% lower pricing vs 2.1 Std. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Kling v2.5 Turbo Std Image To Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v2.5-turbo-std-image-to-video.

How much does Kling v2.5 Turbo Std Image To Video cost per run?

Kling v2.5 Turbo Std Image To Video starts at $0.21 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Kling v2.5 Turbo Std Image To Video accept?

Key inputs: `prompt`, `image`, `duration`, `guidance_scale`, `negative_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v2.5-turbo-std-image-to-video.

How long does Kling v2.5 Turbo Std Image To Video take to generate?

Average end-to-end generation time on WaveSpeedAI is around 49 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Kling v2.5 Turbo Std Image To Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.