WaveSpeed.ai
Inicio/Explorar/Vidu Models/vidu/reference-to-video-q2
image-to-video

image-to-video

Vidu Q2 Reference To Video

vidu/reference-to-video-q2

Vidu Q2 is an Image-to-Video and Reference-to-Video model that emphasizes subtle facial expressions and smooth push-pull camera moves for natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

preview

Idle

Tu solicitud costará $0.1 por ejecución.

Con $1 puedes ejecutar este modelo aproximadamente 10 veces.

Una cosa más::

EjemplosVer todo

README

Vidu Q2 Reference-to-Video

Vidu Q2 Reference-to-Video transforms one or multiple input images into expressive, cinematic videos. It excels at producing subtle facial motion, natural body dynamics, and camera-aware storytelling — ideal for turning still portraits or concept images into smooth motion clips.

Why Choose This?

  • Smooth motion realism Subtle micro-expressions, eye movements, and breathing motions reproduced authentically.

  • Cinematic camera dynamics Built-in control of push/pull, pan, tilt, and zoom effects for scene depth and emotional tone.

  • Multiple-image reference support Upload up to 7 reference images to guide pose, lighting, or perspective transitions.

  • Flexible composition Choose from multiple aspect ratios (16:9, 9:16, 4:3, 3:4, 1:1) for any platform.

  • Motion amplitude control Select auto, small, medium, or large to define the strength and style of movement.

  • High fidelity output Consistent lighting, identity preservation, and accurate reference adherence.

Parameters

ParameterRequiredDescription
promptYesDescribe the scene, action, or mood
imagesYesReference images (up to 7 images)
aspect_ratioNoAspect ratio: 16:9, 9:16, 4:3, 3:4, or 1:1
resolutionNoOutput resolution: 540p, 720p, or 1080p
durationNoVideo length in seconds (1–10)
movement_amplitudeNoMotion intensity: auto, small, medium, or large
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Upload reference images — add up to 7 images to guide the generation.
  2. Write your prompt — describe the scene, action, camera motion, or mood.
  3. Choose aspect ratio — select based on your target platform.
  4. Set resolution — 540p, 720p, or 1080p based on quality needs.
  5. Set duration — choose video length from 1 to 10 seconds.
  6. Adjust movement amplitude — auto for portraits, medium/large for action.
  7. Run — submit and download your video.

Pricing

ResolutionDurationPrice
540p1s$0.075
540p2s$0.10
540p3s$0.125
540p4s$0.15
540p5s$0.175
540p6s$0.20
540p7s$0.225
540p8s$0.25
540p9s$0.35
540p10s$0.45
720p1s$0.125
720p2s$0.15
720p3s$0.175
720p4s$0.20
720p5s$0.225
720p6s$0.25
720p7s$0.275
720p8s$0.30
720p9s$0.40
720p10s$0.50
1080p1s$0.375
1080p2s$0.425
1080p3s$0.475
1080p4s$0.525
1080p5s$0.575
1080p6s$0.625
1080p7s$0.675
1080p8s$0.725
1080p9s$0.825
1080p10s$0.925

Billing Rules

540p: $0.075 for 1s, +$0.025/s up to 8s, then $0.35 for 9s, $0.45 for 10s

720p: $0.125 for 1s, +$0.025/s up to 8s, then $0.40 for 9s, $0.50 for 10s

1080p: $0.375 for 1s, +$0.05/s up to 8s, then $0.825 for 9s, $0.925 for 10s

Best Use Cases

  • Filmmakers and Storytellers — Bring still characters or concept art to life with controlled, cinematic motion.
  • Advertising Creators — Generate short motion ads with precise control over composition and intensity.
  • Artists and Illustrators — Animate hand-drawn or AI-generated portraits into dynamic living forms.
  • Game and Animation Studios — Prototype visual narratives quickly using character or environment references.

Pro Tips

  • Use consistent lighting and angles among reference images for smoother transitions.
  • Write prompts that define camera motion, emotion, or scene tone clearly.
  • "auto" movement amplitude works best for portrait-style animation.
  • Use "medium" or "large" amplitude for full-body or action scenes.
  • For cinematic looks, pair 16:9 with 1080p and descriptive atmosphere prompts.

Notes

  • Maximum 7 reference images per generation.
  • Maximum duration is 10 seconds.
  • If using image URLs, ensure they are publicly accessible.
  • Successfully loaded images will display as thumbnails in the interface.

Related Models