Vidu Q3 Pro Image to Video | Fast Image-to-Video API

Vidu Q3 Pro Image-to-Video

Vidu Q3 Pro Image-to-Video transforms a reference image into a generated video clip with prompt-guided motion, controllable duration, resolution options, movement amplitude settings, and optional audio generation. It is designed for cinematic image animation, commercial visuals, social content, and other image-driven video generation workflows.

Why Choose This?

Image-guided video generation Start from a reference image and generate motion while preserving the core scene, subject, and visual identity.
Prompt-based motion control Use a text prompt to describe the motion, atmosphere, camera feel, and overall scene progression.
Flexible movement intensity Adjust movement_amplitude to control how subtle or dramatic the motion should be.
Optional audio generation Enable generated audio and optional background music for a more complete output.
Multiple resolution options Choose between 540p, 720p, and 1080p depending on your quality and cost needs.
Production-ready API Suitable for animated keyframes, marketing content, social video, concept visualization, and cinematic motion design workflows.

Parameters

Parameter	Required	Description
image	Yes	Reference image used as the starting visual input for video generation.
prompt	Yes	Positive prompt describing the desired motion, style, and scene behavior.
resolution	No	Output resolution: `540p`, `720p` (default), or `1080p`.
duration	No	Video duration in seconds. Range: `1–16`. Default: `5`.
movement_amplitude	No	Motion intensity: `auto`, `small`, `medium`, or `large`. Default: `auto`.
generate_audio	No	Whether to generate audio. Default: `true`.
bgm	No	Whether to include background music in the generated output. Default: `true`.
seed	No	Random seed for reproducibility. Use `-1` for random generation.

How to Use

Upload your image — provide the reference image you want to animate.
Write your prompt — describe the motion, scene progression, style, and atmosphere you want.
Choose resolution — use 540p, 720p, or 1080p based on your quality and budget needs.
Set duration — choose the desired video length between 1 and 16 seconds.
Adjust movement amplitude (optional) — use small, medium, or large, or leave it on auto.
Configure audio options (optional) — enable or disable generated audio and background music.
Set a seed (optional) — use -1 for random output, or a fixed seed for more reproducible generations.
Submit — run the model and download the generated video.

Example Prompt

A cinematic travel shot of a lone figure standing on a desert road at sunset, warm natural light, subtle wind moving clothing and dust, slow camera pullback, realistic motion, stable composition

Pricing

Pricing is based on duration and resolution.

Resolution	5s Cost	Per Second
540p	$0.25	$0.05
720p	$0.625	$0.125
1080p	$0.75	$0.15

Example Costs

Resolution	1s	5s	10s	16s
540p	$0.05	$0.25	$0.50	$0.80
720p	$0.125	$0.625	$1.25	$2.00
1080p	$0.15	$0.75	$1.50	$2.40

Billing Rules

Base price is $0.25 per 5 seconds at 540p
720p costs 2.5× the base rate
1080p costs 3× the base rate
Pricing scales linearly with duration
movement_amplitude, generate_audio, bgm, and seed do not affect pricing

Best Use Cases

Image animation — Turn still images into short video clips with controlled motion.
Marketing creatives — Create animated hero visuals, ad content, and branded motion assets.
Social media video — Generate short-form animated clips for posts, reels, and promotional content.
Concept visualization — Bring keyframes, illustrations, or photo concepts to life with motion.
Atmospheric storytelling — Add subtle camera motion and scene dynamics to still images.
Audio-enabled video generation — Produce outputs with generated audio and background music for richer presentation.

Pro Tips

Use a clean, high-quality reference image for better stability and subject preservation.
Be specific in your prompt about motion, camera behavior, and atmosphere.
Start with 540p or 720p for testing, then switch to 1080p for higher-quality final outputs.
Use small or medium movement amplitude when you want more stable and natural animation.
Keep large movement amplitude for scenes that can tolerate stronger motion.
Reuse the same seed when you want more consistent variations.

Notes

Both image and prompt are required.
duration supports 1–16 seconds.
resolution defaults to 720p.
movement_amplitude defaults to auto.
generate_audio and bgm default to true.
seed uses -1 for random generation.
Pricing depends only on duration and resolution.

Related Models

vidu/q3-pro/start-end-to-video - Vidu video generation models may be useful when you need different quality, speed, or workflow trade-offs.

Q3 Pro Image To Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/vidu/q3-pro/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Q3 Pro Image To Video below.

HTTP example

# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/vidu/q3-pro/image-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "720p",
    "duration": 5,
    "movement_amplitude": "auto",
    "generate_audio": true,
    "bgm": true,
    "seed": 0
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("vidu/q3-pro/image-to-video", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "resolution": "720p",
        "duration": 5,
        "movement_amplitude": "auto",
        "generate_audio": true,
        "bgm": true,
        "seed": 0
});

console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "vidu/q3-pro/image-to-video",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "720p",
    "duration": 5,
    "movement_amplitude": "auto",
    "generate_audio": true,
    "bgm": true,
    "seed": 0
}
)

print(output["outputs"][0])  # → URL of the generated output

Q3 Pro Image To Video API — Frequently asked questions

What is the Q3 Pro Image To Video API?

Q3 Pro Image To Video is a Vidu model for video generation from images, exposed as a REST API on WaveSpeedAI. Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed. You can call it programmatically or try it from the playground above.

How do I call the Q3 Pro Image To Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-pro-image-to-video.

How much does Q3 Pro Image To Video cost per run?

Q3 Pro Image To Video starts at $0.25 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Q3 Pro Image To Video accept?

Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`, `bgm`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/vidu/vidu-q3-pro-image-to-video.

How do I get started with the Q3 Pro Image To Video API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Q3 Pro Image To Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Vidu). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ExamplesView all

Related Models

README

Vidu Q3 Pro Image-to-Video

Why Choose This?

Parameters

How to Use

Example Prompt

Pricing

Example Costs

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Q3 Pro Image To Video API — Quick start

Q3 Pro Image To Video API — Frequently asked questions