Alibaba WAN 2.6 | Text-to-Image Generator

Alibaba Wan 2.6 Text-to-Image

Alibaba Wan 2.6 Text-to-Image (alibaba/wan-2.6/text-to-image) is Alibaba’s text-to-image generation model for creating high-quality visuals from a single natural-language prompt. It’s built for practical creative workflows—concept art, product visuals, portraits, and stylized imagery—where you want strong prompt adherence plus flexible custom sizing.

Why it stands out

Fast, one-shot text-to-image generation Generate an image in a single run for quick ideation and production workflows.
Custom width × height output Set width and height directly (within the endpoint’s limits) to match banners, thumbnails, posters, or social formats.
Prompt expansion for better results Enable prompt expansion to automatically enrich short prompts with useful detail for more coherent compositions.
Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.

Parameters

Parameter	Description
prompt*	Text description of the image you want to generate.
width	Output width (within allowed limits).
height	Output height (within allowed limits).
enable_prompt_expansion	Toggle prompt expansion to enrich short prompts.
seed	Set a fixed seed for more repeatable iterations (-1 for random).

How to use

Write a clear prompt (subject + setting + style).
Choose width and height that match your target aspect ratio.
Turn on enable_prompt_expansion if your prompt is short or under-specified.
Set a seed if you want repeatable iterations (keep the same seed while you tweak the prompt).
Click Run, review the result, and iterate.

Prompt tips

Start with subject + environment + style: “A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography.”
Add camera / composition when framing matters: “wide shot, shallow depth of field, 35mm film look.”
Keep instructions positive and specific (what you want to see, not what you fear).

Pricing

$0.03 per generated image

Notes

Output sizing is limited by the endpoint’s current constraints (for example, width/height bounds and aspect-ratio limits). If a size fails, reduce resolution or choose a more standard aspect ratio.
Enabling prompt expansion can improve quality for short prompts, but may add a little latency.
Returned image URLs may be time-limited—save outputs if you need long-term storage.

Related Models

Alibaba Wan 2.5 Text-to-Image — A proven Wan text-to-image model for reliable, cost-stable AI image generation with a similar prompt-first workflow.
ByteDance Seedream V4 Text-to-Image — A style-consistent text-to-image generator for posters, campaigns, and high-volume brand-friendly illustration batches.
FLUX.2 Turbo Edit — A fast natural-language image editing model for precise image-to-image transformations, brand color control, and iterative creative revisions.
Google Nano Banana Pro Edit — High-fidelity prompt-based image editing for composition-preserving changes, product visuals, and reliable on-image text handling.

ExamplesView all

README