bannerbanner
Join Waitlist
Home/Explore/Hunyuan Video Models/wavespeed-ai/hunyuan-video/i2v

image-to-video

Hunyuan I2V | Image To Video Generation From Images And Text Prompts | WaveSpeedAI

wavespeed-ai/hunyuan-video/i2v

Hunyuan i2v turns images and text prompts into high-quality videos, generating coherent short clips from descriptive inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Hint: You can drag and drop a file or click to upload

preview

Idle

Your request will cost $0.4 per run.

For $10 you can run this model approximately 25 times.

One more thing::

ExamplesView all

README

HunyuanVideo

HunyuanVideo is an advanced image-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

  • 🎨 High-quality video generation from text descriptions
  • 📐 Support for various aspect ratios and resolutions
  • ✍️ Advanced prompt handling with a built-in rewrite system
  • 🎯 Stable motion generation and temporal consistency