vidu/reference-to-video-2.0

Create videos that align with reference subjects—like characters, objects, and environments—using the world’s first Multi-Entity Consistency feature.

image-to-video

new

preview
preview

Idle

https://d2g64w682n9w0w.cloudfront.net/media/images/1745492395158998117_CPokhd95.webp

Your request will cost $0.2 per video,
For $1 you can run this model approximately 5 times.

ExamplesView more examples

README

Vidu2.0 Reference to Video maintains the character and beauty of reference images for video production. The model keeps the facial and visual consistency of avatars, characters and logos.

Key Features

  • Identity-locked generation
  • Smooth temporal transitions
  • Consistent character motion
  • Visual style adherence

ComfyUI

vidu 2.0 Reference to Video is available on ComfyUI, providing local inference capabilities through a node-based workflow, ensuring flexible and efficient image generation on your system.

Use Cases

  • Digital influencers & avatars
  • Story-driven video characters
  • Fashion or cosplay generation
  • Personalization in marketing

Accelerated Inference

Our accelerated inference approach leverages advanced optimization technology from WavespeedAI. This innovative fusion technique significantly reduces computational overhead and latency, enabling rapid image generation without compromising quality. The entire system is designed to efficiently handle large-scale inference tasks while ensuring that real-time applications achieve an optimal balance between speed and accuracy. For further details, please refer to the blog post.