Hunyuan I2V | Image To Video Generation From Images And Text Prompts

WaveSpeedAI × WAN: SpeedUp 2nd - In CharacterJoin

Dashboard Explore Inspiration

Home/Explore/Hunyuan Video Models/wavespeed-ai/hunyuan-video/i2v

image-to-video

wavespeed-ai/hunyuan-video/i2v

Hunyuan i2v turns images and text prompts into high-quality videos, generating coherent short clips from descriptive inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Documentation

Enable Safety Checker

Idle

Your request will cost $0.4 per run.

For $10 you can run this model approximately 25 times.

One more thing::

ExamplesView all

README

HunyuanVideo

HunyuanVideo is an advanced image-to-video generation model that can create high-quality videos from text descriptions. It features a comprehensive framework that integrates image-video joint model training and efficient infrastructure for large-scale model training and inference.

Model Description

This model is trained on a spatial-temporally compressed latent space and uses a large language model for text encoding. According to professional human evaluation results, HunyuanVideo outperforms previous state-of-the-art models in terms of text alignment, motion quality, and visual quality.

Key features

🎨 High-quality video generation from text descriptions
📐 Support for various aspect ratios and resolutions
✍️ Advanced prompt handling with a built-in rewrite system
🎯 Stable motion generation and temporal consistency