Home/Explore/Wan 2.2 Video Models/wavespeed-ai/wan-2.2/fun-control

video-to-video

wavespeed-ai/wan-2.2/fun-control

Wan2.2-Fun-Control is a next-generation video generation and control model launched by Alibaba PAI team. Through innovative Control Codes mechanism combined with deep learning and multi-modal conditional inputs, it can generate high-quality videos that comply with preset control conditions. The model is released under the Apache 2.0 license and supports commercial use. Our endpoint starts with $0.2 per 5 seconds (480p) or $0.4 per 5 seconds (720p) video generation and supports a maximum generation length of 120 seconds.

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

Idle

Your request will cost $0.2 per run.

For $10 you can run this model approximately 50 times.

One more thing:

ExamplesView all

README

Wan2.2-Fun-Control

What is Wan2.2-Fun-Control?

Wan2.2-Fun-Control is a next-generation video generation and control model launched by Alibaba PAI team. Through innovative Control Codes mechanism combined with deep learning and multi-modal conditional inputs, it can generate high-quality videos that comply with preset control conditions. The model is released under the Apache 2.0 license and supports commercial use.

Key Features:

  • Multi-modal Control: Supports multiple control conditions including Canny (line art), Depth, OpenPose (human pose), MLSD (geometric edges), and trajectory control
  • High-Quality Video Generation: Based on the Wan2.2 architecture, outputs film-level quality videos
  • Multi-language Support: Supports multi-language prompts including Chinese and English

Pricing

Our endpoint starts with $0.2 per 5 seconds (480p) or $0.4 per 5 seconds (720p) video generation and supports a maximum generation length of 120 seconds.

Tips

The composition style, as well as the camera position and human body pose of the reference image and the video should be as consistent as possible; otherwise, the probability of generation failure will increase significantly.

The aspect ratio of the input image and video should be the same to achieve the best output.