Home/Explore/Hunyuan Video Models/wavespeed-ai/hunyuan-video/i2v

image-to-video

wavespeed-ai/hunyuan-video/i2v

HunyuanVideo is an advanced image-to-video generation model that can create high-quality videos from text descriptions.

Doc

Hint: You can drag and drop a file or click to upload

preview
If set to true, the safety checker will be enabled.

Idle

Your request will cost $0.4 per run.

For $10 you can run this model approximately 25 times.

One more thing:

ExamplesView all

README

HunyuanVideo

HunyuanVideo is an advanced image-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

  • 🎨 High-quality video generation from text descriptions
  • 📐 Support for various aspect ratios and resolutions
  • ✍️ Advanced prompt handling with a built-in rewrite system
  • 🎯 Stable motion generation and temporal consistency