Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.
Idle
$0.25per run·~40 / $10
Use the input image as the first frame. Preserve the same woman, face identity, suitcase, outfit, bus stop, lighting, rainy atmosphere, and composition. Light rain falls gently, reflections shimmer on the wet street, and distant bus headlights slowly approach. The woman slightly tightens her grip on the suitcase and looks toward the road. The camera slowly pushes in, creating a quiet emotional cinematic moment. Realistic motion, stable identity, no flicker, no distortion, no morphing.
Vidu Q3 Pro Image-to-Video transforms a reference image into a generated video clip with prompt-guided motion, controllable duration, resolution options, movement amplitude settings, and optional audio generation. It is designed for cinematic image animation, commercial visuals, social content, and other image-driven video generation workflows.
Image-guided video generation Start from a reference image and generate motion while preserving the core scene, subject, and visual identity.
Prompt-based motion control Use a text prompt to describe the motion, atmosphere, camera feel, and overall scene progression.
Flexible movement intensity
Adjust movement_amplitude to control how subtle or dramatic the motion should be.
Optional audio generation Enable generated audio and optional background music for a more complete output.
Multiple resolution options
Choose between 540p, 720p, and 1080p depending on your quality and cost needs.
Production-ready API Suitable for animated keyframes, marketing content, social video, concept visualization, and cinematic motion design workflows.
| Parameter | Required | Description |
|---|---|---|
| image | Yes | Reference image used as the starting visual input for video generation. |
| prompt | Yes | Positive prompt describing the desired motion, style, and scene behavior. |
| resolution | No | Output resolution: 540p, 720p (default), or 1080p. |
| duration | No | Video duration in seconds. Range: 1–16. Default: 5. |
| movement_amplitude | No | Motion intensity: auto, small, medium, or large. Default: auto. |
| generate_audio | No | Whether to generate audio. Default: true. |
| bgm | No | Whether to include background music in the generated output. Default: true. |
| seed | No | Random seed for reproducibility. Use -1 for random generation. |
540p, 720p, or 1080p based on your quality and budget needs.1 and 16 seconds.small, medium, or large, or leave it on auto.-1 for random output, or a fixed seed for more reproducible generations.A cinematic travel shot of a lone figure standing on a desert road at sunset, warm natural light, subtle wind moving clothing and dust, slow camera pullback, realistic motion, stable composition
Pricing is based on duration and resolution.
| Resolution | 5s Cost | Per Second |
|---|---|---|
| 540p | $0.25 | $0.05 |
| 720p | $0.625 | $0.125 |
| 1080p | $0.75 | $0.15 |
| Resolution | 1s | 5s | 10s | 16s |
|---|---|---|---|---|
| 540p | $0.05 | $0.25 | $0.50 | $0.80 |
| 720p | $0.125 | $0.625 | $1.25 | $2.00 |
| 1080p | $0.15 | $0.75 | $1.50 | $2.40 |
540p720p costs 2.5× the base rate1080p costs 3× the base ratedurationmovement_amplitude, generate_audio, bgm, and seed do not affect pricing540p or 720p for testing, then switch to 1080p for higher-quality final outputs.small or medium movement amplitude when you want more stable and natural animation.large movement amplitude for scenes that can tolerate stronger motion.seed when you want more consistent variations.image and prompt are required.duration supports 1–16 seconds.resolution defaults to 720p.movement_amplitude defaults to auto.generate_audio and bgm default to true.seed uses -1 for random generation.duration and resolution.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/vidu/q3-pro/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Q3 Pro Image To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/vidu/q3-pro/image-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "720p",
"duration": 5,
"movement_amplitude": "auto",
"generate_audio": true,
"bgm": true,
"seed": 0
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("vidu/q3-pro/image-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "720p",
"duration": 5,
"movement_amplitude": "auto",
"generate_audio": true,
"bgm": true,
"seed": 0
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"vidu/q3-pro/image-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "720p",
"duration": 5,
"movement_amplitude": "auto",
"generate_audio": true,
"bgm": true,
"seed": 0
}
)
print(output["outputs"][0]) # → URL of the generated outputQ3 Pro Image To Video is a Vidu model for video generation from images, exposed as a REST API on WaveSpeedAI. Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-pro-image-to-video.
Q3 Pro Image To Video starts at $0.25 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`, `bgm`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-pro-image-to-video.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (Vidu). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.