Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only

Wan 2.1 Video Models

WAN 2.1 optimized by WaveSpeedAI, delivers state-of-the-art AI content generation with real-time performance.

WAN 2.1 optimized by WaveSpeedAI, delivers state-of-the-art AI content generation with real-time performance.

All Models

35 models
video-to-video

wavespeed-ai/wan-2.1/mocha

MoCha performs Video-To-Video character swaps using reference images, replacing a video's character without per-frame pose or depth maps. Ready-to-use REST inference API, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-flf2v

Wan-2.1 FLF2V converts a start and end frame into a smooth, coherent video sequence, bridging frames with realistic motion transitions. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/i2v-720p-lora-ultra-fast

WAN 2.1 i2v 720p is an ultra-fast Image-to-Video model that turns images into 720P videos and supports custom LoRAs for style control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human

wavespeed-ai/wan-2.1/multitalk

MultiTalk (WAN 2.1) is an audio-driven AI that turns a single image and audio into talking or singing conversational videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/ditto

Wan2.1-DITTO is a unified video-to-video model for realistic style transfer and reenactment, replicating holistic movement and expressions across frames. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/synthetic-to-real-ditto

WAN 2.1 Synthetic To Real Ditto mirrors motion and facial expressions in video-to-video synthetic-to-real conversion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

training

wavespeed-ai/wan-2.1-14b-lora-trainer

Train custom Wan 2.1 LoRA models 10x faster. Style training, character training, object training. From concept to model in minutes, not hours. Upload a ZIP file containing images to start!

lora-support

wavespeed-ai/wan-2.1/i2v-480p-lora

Generate unlimited 480P AI videos with WAN 2.1 Image-to-Video and custom LoRA support for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1/i2v-480p

Wan 2.1 i2v-480p turns images into unlimited 480p AI videos with the Wan 2.1 image-to-video model, perfect for fast content creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1/i2v-720p

WAN 2.1 i2v converts images into unlimited 720P AI videos for scalable content generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

wavespeed-ai/wan-2.1/text-to-image

Wan 2.1 Text-to-Image delivers ultra-realistic photographic images by adapting the Wan 2.1 video model for SOTA visual fidelity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1/i2v-480p-ultra-fast

Wan 2.1 i2v 480p Ultra-Fast enables unlimited image-to-video generation at 480p for fast, reliable video creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/wan-2.1/t2v-720p

WAN 2.1 T2V 720P offers text-to-video 720p generation from prompts, enabling unlimited AI video creation for social and marketing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/text-to-image-lora

Wan 2.1 Text-to-Image LoRA repurposes Wan 2.1 to create ultra-realistic images with exceptional detail and LoRA fine-tuning support. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1/i2v-720p-lora

Wan 2.1 i2v-720p generates image-to-video outputs at 720p and supports custom LoRA adapters for personalized styles and fine-tuning. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1/i2v-720p-ultra-fast

WAN 2.1 Image-to-Video (i2v) 720P Ultra-Fast converts images into 720P videos with ultra-fast inference and supports unlimited AI video generation for high-throughput workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

wavespeed-ai/wan-2.1-14b-vace

WAN 2.1 VACE is an all-in-one video model supporting Reference-to-Video (Image-to-Video), V2V, Masked V2V and Move/Swap/Animate capabilities. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/i2v-480p-lora-ultra-fast

Wan 2.1 i2v 480p Ultra-Fast generates unlimited image-to-video content at 480p, supporting custom LoRAs for style personalization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/t2v-480p-lora

Wan 2.1 t2v-480p-lora generates unlimited 480P text-to-video outputs with custom LoRAs for personalized styles and precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/v2v-480p-ultra-fast

Ultra-fast Wan 2.1 Video-to-Video (v2v) model for generating unlimited AI videos at 480p from existing video inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/wan-2.1/t2v-480p

Wan 2.1 creates unlimited text-to-video content at 480P from simple text prompts, ideal for prototyping and content generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/t2v-720p-lora

Wan 2.1 Text-to-Video 720P creates 720P videos from text prompts and supports custom LoRAs for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/v2v-480p

WAN 2.1 V2V (video-to-video) converts source clips into unlimited AI-generated 480p videos for scalable content creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/wan-2.1/t2v-480p-ultra-fast

WAN 2.1 T2V 480p Ultra-Fast turns text prompts into unlimited 480p AI videos with ultra-fast throughput and reliable 480p output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/t2v-720p-lora-ultra-fast

WAN 2.1 Text-to-Video 720P delivers unlimited ultra-fast videos from text prompts and supports custom LoRAs for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/t2v-480p-lora-ultra-fast

WAN 2.1 T2V 480p delivers ultra-fast text-to-video generation with custom LoRA support for unlimited 480p AI videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/v2v-720p

Wan 2.1 V2V converts source videos into AI 720p outputs for scalable video-to-video production and unlimited video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

wavespeed-ai/wan-2.1/t2v-720p-ultra-fast

WAN 2.1 Text-to-Video generates high-quality 720P videos from text prompts with an ultra-fast pipeline for unlimited AI videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/wan-2.1/v2v-720p-ultra-fast

Ultra-fast Wan 2.1 V2V generates unlimited 720P video-to-video conversions and supports custom LoRAs for personalized styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/v2v-720p-lora

Wan 2.1 V2V 720P LoRA converts source videos into 720P AI-enhanced video-to-video edits with support for custom LoRA personalization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/v2v-720p-lora-ultra-fast

Wan 2.1 V2V 720p LoRA Ultra-Fast converts videos to 720p with custom LoRA support and lets you generate unlimited AI videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

alibaba/wan-2.1/i2v-plus-720p

WAN 2.1 i2v-plus 720P turns still images into smooth image-to-video clips, enabling unlimited AI videos from image inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

alibaba/wan-2.1/t2v-plus-720p

WAN 2.1 T2V Plus (720p) turns text prompts into high-quality 720p videos and supports unlimited AI video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/v2v-480p-lora-ultra-fast

Wan 2.1 V2V 480p is an ultra-fast video-to-video model that generates unlimited AI videos and supports custom LoRAs for personalization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

lora-support

wavespeed-ai/wan-2.1/v2v-480p-lora

WAN 2.1 V2V 480p LoRA generates unlimited 480p video-to-video edits with custom LoRA support for tailored styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Wan 2.1 Video Models

WAN 2.1 Model Collection

WAN 2.1, developed by Alibaba's Tongyi Lab and optimized by WaveSpeedAI, represents a cutting-edge achievement in AI content generation. This comprehensive suite features multiple specialized versions including the advanced 14B parameter model and custom LORA training capabilities.

Core Models:

WAN 2.1 Base

  1. State-of-the-art video generation
  2. Multi-modal content creation
  3. Real-time processing capability

WAN 2.1 14B VACE

  1. Enhanced visual understanding
  2. Advanced parameter optimization
  3. Superior output quality

WAN 2.1 LORA Trainer

  1. Custom model fine-tuning
  2. Specialized training capabilities
  3. Personalized content generation

WAN FLF2V

  1. Specialized video synthesis
  2. Optimized workflow integration
  3. Enhanced performance

Key Features

  1. Comprehensive Task Support
  2. Text-to-Video (T2V)
  3. Image-to-Video (I2V)
  4. Video Editing
  5. Text-to-Image (T2I)
  6. Video-to-Audio (V2A)

Technical Advantages

  1. WaveSpeedAI Optimization
  2. Ultra-fast inference engine
  3. Real-time generation
  4. Enterprise-grade reliability

This advanced suite combines Alibaba's innovative research with WaveSpeedAI's optimization technology, delivering professional-grade content generation with exceptional speed and quality.