HunyuanVideo-1.5 Text-to-Video
HunyuanVideo-1.5 is Tencent’s lightweight text-to-video generation model that delivers state-of-the-art visual quality and motion coherence with only 8.3B parameters. It is designed to be both powerful and efficient, making high-quality video generation accessible for everyday creators and production workflows on WaveSpeedAI.
Key Features
- High-quality video generation directly from text prompts
- Lightweight 8.3B parameters for fast inference on consumer-grade GPUs
- Video durations: 5 s, 8 s, and 10 s
- Strong motion coherence and stable subject identity
Pricing
| Resolution | Price per second |
|---|
| 480p | $0.02 / s |
| 720p | $0.04 / s |
How to Use
- Write your text prompt describing the scene, characters, motion, camera movement, and overall style.
- Select the duration: 5 s, 8 s, or 10 s.
- Optionally tweak inference steps or seed to balance speed, quality, and reproducibility.
- Run the job from the WaveSpeedAI interface.
- Preview the generated clip and download it from the dashboard.
Tips for Best Results
- Be explicit: describe who is in the scene, what they are doing, where they are, and how the camera moves.
- Mention style and mood (for example, “cinematic lighting,” “handheld documentary,” “anime style,” “neon cyberpunk city”).
- Shorter clips (5–8 s) generally produce the most coherent and visually stable results.
- Reuse similar prompts and seeds when you want a series of related shots that share style and characters.
Upscaling for Higher Quality
After generating your base video with HunyuanVideo-1.5, you can use WaveSpeedAI’s dedicated video super-resolution models to enhance clarity and sharpness:
Generate efficiently at 480p or 720p, then upscale to higher resolutions for a better final viewing experience.