Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only

Hunyuan Models

Tencent's Hunyuan delivers state-of-the-art video and image generation with 3D awareness and temporal consistency.

Tencent's Hunyuan delivers state-of-the-art video and image generation with 3D awareness and temporal consistency.

All Models

20 models
text-to-3d

wavespeed-ai/hunyuan-3d-v3.1/text-to-3d-rapid

Hunyuan 3D V3.1 Rapid is a fast text-to-3D generation model that quickly creates 3D models from text descriptions. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

wavespeed-ai/hunyuan-image-3-instruct/edit

Hunyuan Image 3.0 Instruct Edit – instruction-based image editing with natural language prompts, supporting up to 2 reference images for precise modifications. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-image

wavespeed-ai/hunyuan-image-3-instruct/text-to-image

Hunyuan Image 3.0 Instruct text-to-image model from Tencent with high-quality image generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan-3d-v3.1/image-to-3d-rapid

Hunyuan 3D V3.1 Rapid is a fast image-to-3D generation model, quickly converting 2D images into 3D models. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-dubbing

wavespeed-ai/hunyuan-video-foley

HunyuanVideo-Foley generates realistic Foley and ambient audio from an uploaded video using a text prompt to describe desired sounds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

wavespeed-ai/hunyuan-image-2.1

HunyuanImage-2.1 is an efficient diffusion text-to-image model producing high-resolution 2K images with detailed, photorealistic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human

wavespeed-ai/hunyuan-avatar

Hunyuan Avatar creates audio-driven talking or singing videos from one image + audio, in 480p/720p up to 120s (starts at $0.15/5s). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/hunyuan-video/i2v

Hunyuan i2v turns images and text prompts into high-quality videos, generating coherent short clips from descriptive inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/hunyuan-video/t2v

Hunyuan Video (t2v) is an advanced text-to-video model that generates high-quality videos from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/hunyuan-video-1.5/image-to-video

HunyuanVideo-1.5 (i2v) is a lightweight 8.3B parameter image-to-video model that generates high-quality videos from images with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/hunyuan-video-1.5/text-to-video

HunyuanVideo-1.5 (t2v) is a lightweight 8.3B parameter text-to-video model that generates high-quality videos with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d-v2-multi-view

Hunyuan3D V2 Multi-View is Tencent's image-to-3D generative model on WaveSpeedAI that builds 3D reconstructions from multi-view images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d-v3/image-to-3d

Transform your photos into ultra-high-resolution 3D models in seconds with Tencent's Hunyuan3D V3 Image to 3D. Film-quality geometry with PBR textures from single or multi-view images, ready for games, e-commerce, and 3D printing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d-v3/sketch-to-3d

Transform your sketches into detailed 3D models with Tencent's Hunyuan3D V3. Convert hand-drawn sketches and concept art into high-quality 3D assets with textures, perfect for rapid prototyping and game development. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-3d

wavespeed-ai/hunyuan3d-v3/text-to-3d

Turn text prompts into detailed, fully-textured 3D models with Tencent's Hunyuan3D V3. Generate high-quality 3D assets with PBR materials from simple descriptions, ready for Unity, Unreal, and Blender. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d/v2-base

Hunyuan3D-V2-Base is a state-of-the-art Image-to-3D model by Tencent that turns images into 3D assets for visualization and content. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d/v2-mini

Hunyuan3D-V2-Mini is a Tencent image-to-3D generative model available on WaveSpeedAI. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d/v2-multi-view

Hunyuan3D V2 Multi-View generates accurate 3D reconstructions from multiple images. Tencent-developed and available on WaveSpeedAI. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/hunyuan3d/v2.1

Tencent Hunyuan3D v2.1 is a scalable 3D asset-creation system that advances state-of-the-art 3D generation for asset workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

wavespeed-ai/hunyuan-image-3

HunyuanImage-3.0: An AR multimodal model unifying understanding & generation. Its Text-To-Image module rivals closed-source leaders. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Hunyuan Models

Video & Audio

  1. hunyuan-video-foley

AI video foley and sound design that automatically adds footsteps, ambience, and effects while matching the timing and context of your footage.

Video Generation (2D)

  1. hunyuan-video/i2v

Image-to-video generator that turns a single frame into a short, smooth clip while preserving composition and scene structure.

  1. hunyuan-video/t2v

Text-to-video model for quickly generating stylized clips from natural-language prompts.

  1. hunyuan-video-1.5/image-to-video

Upgraded image-to-video model with stronger temporal stability, richer detail, and more cinematic camera motion.

  1. hunyuan-video-1.5/text-to-video

Version 1.5 text-to-video model offering better prompt adherence, cleaner structure, and higher overall visual quality.

Image & Avatar

  1. hunyuan-image-2.1

Efficient general-purpose image generator that produces clean, realistic images from text prompts for a wide range of use cases.

  1. hunyuan-avatar

Portrait-focused model for stylized avatars and character art, with vivid color, expressive faces, and consistent personal style.

  1. hunyuan-image-3 & instruct

Advanced image model with stronger 3D awareness and scene composition, ideal for intricate diorama-style and environment-heavy visuals.

3D Models

  1. hunyuan3d-v2-multi-view

3D multi-view generator that produces consistent renders of a character or object from multiple angles for downstream 3D work.

  1. hunyuan3d/v2-base

Base 3D generator that creates solid, stylized shapes suitable for further sculpting, shading, or printing.

  1. hunyuan3d/v2-mini

Lightweight 3D model specialized for compact, toy-like characters and small, stylized objects.

  1. hunyuan3d/v2-multi-view

Multi-view variant of the v2 3D model that outputs coherent character views across several camera positions.

  1. hunyuan3d/v2.1

Enhanced v2.1 3D generator with higher geometric fidelity and cleaner structure for polished stylized characters.