Giảm 50% mô hình Vidu Q3 & Q3 Pro · Chỉ trên WaveSpeedAI | 20/5 – 2/6

Vidu Image to Video Q2 Turbo

vidu /

Vidu Q2 Turbo Image-to-Video turns a single image into smooth, cinematic motion with fast, high-quality output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video
Input

Kéo & thả hoặc nhấp để tải lên

preview
The background music for generating the output.

Idle

$0.1per run·~10 / $1

Next:

ExamplesView all

A cinematic fantasy sequence. The boy gently reaches out to touch the colorful dragon’s nose, and the dragon slowly lowers its head, blinking warmly. Soft dust particles float in the air, light beams shimmer through mist, and the ground subtly breathes with glowing mushrooms. The camera slowly circles around them, capturing the magical bond between child and creature in a gentle, emotional moment. Highly detailed, volumetric lighting, soft depth of field, 4K quality.

A serene snowy night at an elevated Japanese train station, bathed in deep blue twilight. A lone child in a blue coat and red backpack stands on the empty platform, looking down the tracks. Above, a massive cherry blossom tree reflects perfectly on a mirror-like snowy surface, petals gently drifting in the cold wind. Soft overhead station lights cast warm glows on concrete pillars. Subtle steam rises from the child's breath. Camera slowly dollies forward from wide establishing shot to medium on the child, with delicate snow particles floating in air. Hyper-detailed Makoto Shinkai anime style, cinematic color grading, 6-second poetic loop, 1080p, 16:9.

Epic fantasy procession in a windswept flower field. A blonde samurai girl in ornate teal armor stands atop a massive yellow ox demon with curved horns and red ropes. Wind whips her hair and cape. Behind, a blindfolded monk with staff and a rat-eared monk with tail walk in sync. Red spider lilies sway violently. Camera slow dolly left, low angle, ox’s heavy steps shake the ground. Hand-painted ukiyo-e anime style, saturated colors, dynamic ink textures

Cyber-dystopian alley at dusk. A lone girl with ultra-long orange ponytail in flowing white tech-coat walks away from camera. Her coat billows in wind, cables overhead sway like vines. Sunlight cuts through skyscraper gaps, casting golden beams on cracked concrete. Camera slow tracking shot behind her, slight parallax on buildings. Photorealistic CGI, teal-orange color grade, volumetric god rays

Epic anime mountain standoff. A blonde girl in red hooded cloak grips a glowing katana, eyes fierce. Behind her, a young archer notches an arrow. A colossal white wolf with scarred mask towers, fur rippling in wind. Clouds swirl around snow-capped peaks. Camera slow push-in from wide tableau to tight on girl and wolf’s eyes, scarf and mane dancing. Studio Ghibli-style hand-drawn animation, soft cel-shading, golden rim light

Related Models

README

Vidu Q2 Turbo Image-to-Video

Vidu Q2 Turbo Image-to-Video turns a single reference image into a smooth, cinematic video. Turbo accelerates generation for faster turnaround while keeping motion clean and stable — great for transitions, product demos, and quick storytelling.

Why Choose This?

  • Turbo-optimized pipeline Faster generation than Pro at the same settings, perfect for quick iteration.

  • Temporal smoothing Cuts flicker and popping while keeping faces, hands, hair, and thin details intact.

  • Depth-aware motion Respects occlusion and parallax for natural foreground/background separation.

  • Cinematic camera paths Subtle pans, push-ins, and dollies without rubbery warps.

  • Optional background music Auto-generate BGM for social-ready clips.

Parameters

ParameterRequiredDescription
imageYesReference image to animate (upload or URL)
promptYesDescribe desired motion, mood, and camera movement
durationNoVideo length in seconds (1–10, default: 4)
resolutionNoOutput resolution: 540p, 720p, or 1080p
movement_amplitudeNoMotion intensity: auto, small, medium, or large
bgmNoEnable background music generation
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Upload your image — ensure the URL is accessible for preview.
  2. Write a prompt — describe desired motion, mood, and camera movement.
  3. Set duration — choose video length from 1 to 10 seconds.
  4. Select resolution — 540p, 720p, or 1080p based on quality needs.
  5. Adjust movement amplitude (optional) — control motion intensity.
  6. Enable BGM (optional) — add background music automatically.
  7. Set seed (optional) — use for reproducible results.
  8. Run — submit and download your video.

Pricing

ResolutionDurationPrice
540p1s$0.03
540p2s$0.04
540p3s$0.05
540p4s$0.06
540p5s$0.07
540p6s$0.08
540p7s$0.09
540p8s$0.10
540p9s$0.20
540p10s$0.30
720p1s$0.04
720p2s$0.05
720p3s$0.10
720p4s$0.15
720p5s$0.20
720p6s$0.25
720p7s$0.30
720p8s$0.35
720p9s$0.45
720p10s$0.50
1080p1s$0.175
1080p2s$0.225
1080p3s$0.275
1080p4s$0.325
1080p5s$0.375
1080p6s$0.425
1080p7s$0.475
1080p8s$0.525
1080p9s$0.625
1080p10s$0.725

Billing Rules

540p: $0.03 for 1s, +$0.01/s up to 8s, then $0.20 for 9s, $0.30 for 10s

720p: $0.04 for 1s, $0.05 for 2s, then +$0.05/s from 3s

1080p: $0.175 for 1s, then +$0.05/s up to 8s, then +$0.10/s for 9s-10s

Best Use Cases

  • Quick Iterations — Faster than Pro for rapid creative exploration.
  • Social Media Content — Create engaging video clips from photos.
  • Product Demos — Animate product images with cinematic motion.
  • Transitions — Generate smooth video transitions from key frames.
  • Storytelling — Transform story moments into dynamic video.

Pro Tips

  • Use high-quality, well-lit images for best results.
  • Be specific in your prompt about camera movement (pan, zoom, dolly).
  • Start with "auto" amplitude, then adjust if motion is too subtle or strong.
  • Turbo is generally faster than Pro at the same settings.
  • Use smaller amplitude for portraits to avoid unnatural distortion.

Notes

  • Maximum duration is 10 seconds per generation.
  • Actual runtime depends on resolution, duration, movement amplitude, and queue load.
  • Ensure you have the right to use the uploaded image in your project.
  • Image URL must be accessible for preview to display correctly.

Related Models

Accessibility:This website uses AI models provided by third parties.

Image To Video Q2 Turbo API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/vidu/image-to-video-q2-turbo with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Image To Video Q2 Turbo below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/vidu/image-to-video-q2-turbo" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "duration": 5,
    "resolution": "720p",
    "bgm": true,
    "movement_amplitude": "auto",
    "seed": -1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("vidu/image-to-video-q2-turbo", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "duration": 5,
        "resolution": "720p",
        "bgm": true,
        "movement_amplitude": "auto",
        "seed": -1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "vidu/image-to-video-q2-turbo",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "duration": 5,
    "resolution": "720p",
    "bgm": true,
    "movement_amplitude": "auto",
    "seed": -1
}
)

print(output["outputs"][0])  # → URL of the generated output

Image To Video Q2 Turbo API — Frequently asked questions

What is the Image To Video Q2 Turbo API?

Image To Video Q2 Turbo is a Vidu model for video generation from images, exposed as a REST API on WaveSpeedAI. Vidu Q2 Turbo Image-to-Video turns a single image into smooth, cinematic motion with fast, high-quality output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Image To Video Q2 Turbo API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/vidu/vidu-image-to-video-q2-turbo.

How much does Image To Video Q2 Turbo cost per run?

Image To Video Q2 Turbo starts at $0.10 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Image To Video Q2 Turbo accept?

Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`, `bgm`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/vidu/vidu-image-to-video-q2-turbo.

How long does Image To Video Q2 Turbo take to generate?

Average end-to-end generation time on WaveSpeedAI is around 42 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Image To Video Q2 Turbo outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Vidu). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.