Vidu Q2 Pro turns a single still image into smooth, cinematic image-to-video with stable motion, clean edges, and consistent lighting. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
En attente
$0.15par exécution·~66 / $10
Post-apocalyptic steam locomotive barrels through haunted canyon at night. Black iron engine belches thick white steam, headlamp cuts fog. Tattered zombies claw at red freight cars. Lightning forks illuminate skeletal trees and abandoned wooden trestles. Camera slow tracking alongside train, sparks fly from wheels. Photorealistic CGI, desaturated blue-gray grade, volumetric fog
A cinematic stop-motion style animation, the camera slowly pans forward toward the eerie house. The rider on the horse leans closer, the horse breathes visibly in the cold air, while the masked figure gestures toward the door. Subtle fog drifts across the ground, warm light flickers from the windows, evoking a mysterious and melancholic tone.
A cinematic sci-fi scene inside a cluttered laboratory filled with books, wires, and old machinery. A man in a long coat walks slowly through the warm, dust-filled room toward the glowing window. The camera tracks gently forward, revealing intricate cables on the ceiling and the soft orange light flickering across metal surfaces. Papers rustle, and a faint breeze moves through the space. Atmospheric lighting, volumetric dust, high detail, cinematic depth, 4K quality.
A young boy in a flowing red cape and green jacket stands defiantly on a misty orange-yellow plain, facing a massive, menacing red oni demon with glowing red eyes, sharp horns, and jagged teeth. The demon rears up aggressively, tentacles writhing from its body, while the boy raises his hands in a magical gesture, wind swirling around him. Dynamic camera slowly circles from low angle to high, building tension, in vibrant hand-drawn anime style with bold red and orange hues, ethereal blue accents in the background. Smooth 8-second animation, 1080p, cinematic lighting.
Gothic ruined library under full moon. A pale girl in white uniform reads on velvet throne. Spectral blue phoenix perches on chair, translucent wolf circles her feet. Torn pages orbit like snowflakes. Moonbeam pierces cracked dome, dust motes sparkle. Camera slow dolly from bookshelf to girl’s face, soft parallax on spirits. Photorealistic dark fantasy, cyan-moonlight grade, volumetric god rays
Vidu Q2 Pro Image-to-Video turns a single still image into a smooth, cinematic video. Built for creators who need stable motion, clean edges, and consistent lighting — with optional background music for social-ready clips.
Crisp motion from one frame Generates natural camera moves and subject animation without warping or distortion.
Identity and detail preservation Protects faces, hair, hands, and thin structures throughout the animation.
Layout-aware dynamics Respects depth and parallax for believable foreground/background movement.
Optional background music Auto-add BGM for social-ready clips without post-production.
Motion control Adjust movement amplitude from subtle to dramatic based on your creative needs.
| Parameter | Required | Description |
|---|---|---|
| image | Yes | Reference image to animate (PNG/JPG/WebP) |
| prompt | Yes | Describe desired motion, mood, and camera movement |
| duration | No | Video length in seconds (1–10, default: 4) |
| resolution | No | Output resolution: 540p, 720p, or 1080p |
| movement_amplitude | No | Motion intensity: auto, small, medium, or large |
| bgm | No | Enable background music |
| seed | No | Random seed for reproducibility (-1 for random) |
| Resolution | Duration | Price |
|---|---|---|
| 540p | 1s | $0.04 |
| 540p | 2s | $0.05 |
| 540p | 3s | $0.075 |
| 540p | 4s | $0.10 |
| 540p | 5s | $0.125 |
| 540p | 6s | $0.15 |
| 540p | 7s | $0.175 |
| 540p | 8s | $0.20 |
| 540p | 9s | $0.225 |
| 540p | 10s | $0.25 |
| 720p | 1s | $0.075 |
| 720p | 2s | $0.125 |
| 720p | 3s | $0.175 |
| 720p | 4s | $0.225 |
| 720p | 5s | $0.275 |
| 720p | 6s | $0.325 |
| 720p | 7s | $0.375 |
| 720p | 8s | $0.425 |
| 720p | 9s | $0.475 |
| 720p | 10s | $0.525 |
| 1080p | 1s | $0.275 |
| 1080p | 2s | $0.35 |
| 1080p | 3s | $0.425 |
| 1080p | 4s | $0.50 |
| 1080p | 5s | $0.575 |
| 1080p | 6s | $0.65 |
| 1080p | 7s | $0.725 |
| 1080p | 8s | $0.80 |
| 1080p | 9s | $0.875 |
| 1080p | 10s | $0.95 |
540p: $0.04 for 1s, $0.05 for 2s, then +$0.025 per second
720p: $0.075 for 1s, then +$0.05 per second
1080p: $0.275 for 1s, then +$0.075 per second
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/vidu/image-to-video-q2-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Image To Video Q2 Pro below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/vidu/image-to-video-q2-pro" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"movement_amplitude": "auto",
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("vidu/image-to-video-q2-pro", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"movement_amplitude": "auto",
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"vidu/image-to-video-q2-pro",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"duration": 5,
"resolution": "720p",
"bgm": true,
"movement_amplitude": "auto",
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputImage To Video Q2 Pro is a Vidu model for video generation from images, exposed as a REST API on WaveSpeedAI. Vidu Q2 Pro turns a single still image into smooth, cinematic image-to-video with stable motion, clean edges, and consistent lighting. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/vidu/vidu-image-to-video-q2-pro.
Image To Video Q2 Pro starts at $0.15 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`, `bgm`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/vidu/vidu-image-to-video-q2-pro.
Average end-to-end generation time on WaveSpeedAI is around 300 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Vidu). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.