Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only

Wan 2.7 Models

Wan 2.7 Models unify text-, image-, and reference-driven video generation with native, synchronized audio in one pass—delivering sharper detail, smoother cinematic motion, and more consistent camera language for production-ready storytelling at scale.

Wan 2.7 Models unify text-, image-, and reference-driven video generation with native, synchronized audio in one pass—delivering sharper detail, smoother cinematic motion, and more consistent camera language for production-ready storytelling at scale.

All Models

9 models
text-to-video

alibaba/wan-2.7/text-to-video

WAN 2.7 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-video

alibaba/wan-2.7/image-to-video

WAN 2.7 converts images into videos (720p/1080p) with optional audio, supporting first and last frame control. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-video

alibaba/wan-2.7/reference-to-video

WAN 2.7 Reference-to-Video turns character, prop, or scene references from images or videos into new video shots with preserved identity, style, and layout plus smooth, coherent motion. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

video-to-video

alibaba/wan-2.7/video-edit

WAN 2.7 Video Edit performs prompt-driven video editing with multi-image reference support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-extend

alibaba/wan-2.7/video-extend

WAN 2.7 Video Extend extends existing videos with optional last frame control and audio support, supporting 720p/1080p output. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-image

alibaba/wan-2.7/image-edit-pro

WAN 2.7 Image Edit Pro performs prompt-driven image editing with multi-image reference support and up to 2K output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

alibaba/wan-2.7/image-edit

WAN 2.7 Image Edit performs prompt-driven image editing with support for multiple-image references. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

alibaba/wan-2.7/text-to-image

WAN 2.7 Text-to-Image generates high-quality images from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

alibaba/wan-2.7/text-to-image-pro

WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Wan 2.7 Models

Alibaba's Wan 2.7 is a production-ready suite of AI image-generation and image-editing endpoints, featuring a built-in thinking mode for enhanced reasoning and higher-quality outputs. Wan 2.7 covers two core workflows: text-to-image generation and image editing, each available in standard and pro tiers, offering flexible quality-cost trade-offs.

Wan 2.7 Series — Text-to-Image & Image Editing API

Wan 2.7 offers four focused endpoints for generating images from text or editing existing images—ideal for commercial pipelines, creative production, and repeatable content workflows.

  1. Wan 2.7 Text-to-Image — Generate high-quality images from text prompts using built-in thinking mode for smarter composition and accurate prompt adherence.
  2. Wan 2.7 Image Edit — Edit and refine images with precise, controlled modifications while preserving structure and subject consistency.
  3. Wan 2.7 Text-to-Image Pro — Pro tier with up to 4K (4096×4096) output for print-ready and large-format assets.
  4. Wan 2.7 Image Edit Pro — Pro tier image editing with up to 2K resolution for high-fidelity results.

Key Features

  1. Thinking Mode — Built-in chain-of-thought reasoning delivers better prompt adherence, more coherent compositions, and fewer artifacts.
  2. Up to 4K Image Output — Pro tier supports up to 4096×4096 resolution, suitable for print and large-format display.
  3. Multi-Image Reference Editing — Upload 1–9 reference images for style, subject, and background guidance in editing workflows.
  4. Standard & Pro Tiers — Choose cost-efficient standard or higher-quality pro endpoints based on your use case.
  5. Seed-Based Reproducibility — Flexible sizing and seed control for fast iteration and repeatable creative testing.