Browse ModelsViduVidu Reference To Video Q2

Vidu Reference To Video Q2

Vidu Reference To Video Q2

Playground

Try it on WavespeedAI!

Vidu Q2 is an Image-to-Video and Reference-to-Video model that emphasizes subtle facial expressions and smooth push-pull camera moves for natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Vidu Q2 Reference-to-Video

Vidu Q2 Reference-to-Video transforms one or multiple input images into expressive, cinematic videos. It excels at producing subtle facial motion, natural body dynamics, and camera-aware storytelling — ideal for turning still portraits or concept images into smooth motion clips.


Why Choose This?

  • Smooth motion realism Subtle micro-expressions, eye movements, and breathing motions reproduced authentically.

  • Cinematic camera dynamics Built-in control of push/pull, pan, tilt, and zoom effects for scene depth and emotional tone.

  • Multiple-image reference support Upload up to 7 reference images to guide pose, lighting, or perspective transitions.

  • Flexible composition Choose from multiple aspect ratios (16:9, 9:16, 4:3, 3:4, 1:1) for any platform.

  • Motion amplitude control Select auto, small, medium, or large to define the strength and style of movement.

  • High fidelity output Consistent lighting, identity preservation, and accurate reference adherence.


Parameters

ParameterRequiredDescription
promptYesDescribe the scene, action, or mood
imagesYesReference images (up to 7 images)
aspect_ratioNoAspect ratio: 16:9, 9:16, 4:3, 3:4, or 1:1
resolutionNoOutput resolution: 540p, 720p, or 1080p
durationNoVideo length in seconds (1–10)
movement_amplitudeNoMotion intensity: auto, small, medium, or large
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Upload reference images — add up to 7 images to guide the generation.
  2. Write your prompt — describe the scene, action, camera motion, or mood.
  3. Choose aspect ratio — select based on your target platform.
  4. Set resolution — 540p, 720p, or 1080p based on quality needs.
  5. Set duration — choose video length from 1 to 10 seconds.
  6. Adjust movement amplitude — auto for portraits, medium/large for action.
  7. Run — submit and download your video.

Pricing

ResolutionDurationPrice
540p1s$0.075
540p2s$0.10
540p3s$0.125
540p4s$0.15
540p5s$0.175
540p6s$0.20
540p7s$0.225
540p8s$0.25
540p9s$0.35
540p10s$0.45
720p1s$0.125
720p2s$0.15
720p3s$0.175
720p4s$0.20
720p5s$0.225
720p6s$0.25
720p7s$0.275
720p8s$0.30
720p9s$0.40
720p10s$0.50
1080p1s$0.375
1080p2s$0.425
1080p3s$0.475
1080p4s$0.525
1080p5s$0.575
1080p6s$0.625
1080p7s$0.675
1080p8s$0.725
1080p9s$0.825
1080p10s$0.925

Billing Rules

540p: $0.075 for 1s, +$0.025/s up to 8s, then $0.35 for 9s, $0.45 for 10s

720p: $0.125 for 1s, +$0.025/s up to 8s, then $0.40 for 9s, $0.50 for 10s

1080p: $0.375 for 1s, +$0.05/s up to 8s, then $0.825 for 9s, $0.925 for 10s


Best Use Cases

  • Filmmakers and Storytellers — Bring still characters or concept art to life with controlled, cinematic motion.
  • Advertising Creators — Generate short motion ads with precise control over composition and intensity.
  • Artists and Illustrators — Animate hand-drawn or AI-generated portraits into dynamic living forms.
  • Game and Animation Studios — Prototype visual narratives quickly using character or environment references.

Pro Tips

  • Use consistent lighting and angles among reference images for smoother transitions.
  • Write prompts that define camera motion, emotion, or scene tone clearly.
  • “auto” movement amplitude works best for portrait-style animation.
  • Use “medium” or “large” amplitude for full-body or action scenes.
  • For cinematic looks, pair 16:9 with 1080p and descriptive atmosphere prompts.

Notes

  • Maximum 7 reference images per generation.
  • Maximum duration is 10 seconds.
  • If using image URLs, ensure they are publicly accessible.
  • Successfully loaded images will display as thumbnails in the interface.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/vidu/reference-to-video-q2" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "movement_amplitude": "auto",
    "seed": 0
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
imagesarrayYes[]1 ~ 7 itemsReference images for video generation. Requirements: 1. Accept 1-7 images; 2. Images can be URLs or Base64 encoded
aspect_ratiostringNo16:916:9, 9:16, 4:3, 3:4, 1:1The aspect ratio of the generated media.
resolutionstringNo720p540p, 720p, 1080pThe resolution of the generated media.
durationnumberNo51 ~ 10The duration of the generated media in seconds.
movement_amplitudestringNoautoauto, small, medium, largeThe movement amplitude of objects in the frame. Defaults to auto, accepted value: auto, small, medium, large.
seedintegerNo--1 ~ 2147483647The random seed to use for the generation.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.