Alibaba Wan 2.6 Text-to-Image
Alibaba Wan 2.6 Text-to-Image (alibaba/wan-2.6/text-to-image) is Alibaba’s text-to-image generation model for creating high-quality visuals from a single natural-language prompt. It’s built for practical creative workflows—concept art, product visuals, portraits, and stylized imagery—where you want strong prompt adherence plus flexible custom sizing.
Why it stands out
-
Fast, one-shot text-to-image generation
Generate an image in a single run for quick ideation and production workflows.
-
Custom width × height output
Set width and height directly (within the endpoint’s limits) to match banners, thumbnails, posters, or social formats.
-
Prompt expansion for better results
Enable prompt expansion to automatically enrich short prompts with useful detail for more coherent compositions.
-
Seeded iteration
Use a fixed seed to refine style and layout with more repeatable variations.
Parameters
| Parameter | Description |
|---|
| prompt* | Text description of the image you want to generate. |
| width | Output width (within allowed limits). |
| height | Output height (within allowed limits). |
| enable_prompt_expansion | Toggle prompt expansion to enrich short prompts. |
| seed | Set a fixed seed for more repeatable iterations (-1 for random). |
How to use
- Write a clear prompt (subject + setting + style).
- Choose width and height that match your target aspect ratio.
- Turn on enable_prompt_expansion if your prompt is short or under-specified.
- Set a seed if you want repeatable iterations (keep the same seed while you tweak the prompt).
- Click Run, review the result, and iterate.
Prompt tips
- Start with subject + environment + style:
“A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography.”
- Add camera / composition when framing matters:
“wide shot, shallow depth of field, 35mm film look.”
- Keep instructions positive and specific (what you want to see, not what you fear).
Pricing
- $0.03 per generated image
Notes
- Output sizing is limited by the endpoint’s current constraints (for example, width/height bounds and aspect-ratio limits). If a size fails, reduce resolution or choose a more standard aspect ratio.
- Enabling prompt expansion can improve quality for short prompts, but may add a little latency.
- Returned image URLs may be time-limited—save outputs if you need long-term storage.
Related Models
- Alibaba Wan 2.5 Text-to-Image — A proven Wan text-to-image model for reliable, cost-stable AI image generation with a similar prompt-first workflow.
- ByteDance Seedream V4 Text-to-Image — A style-consistent text-to-image generator for posters, campaigns, and high-volume brand-friendly illustration batches.
- FLUX.2 Turbo Edit — A fast natural-language image editing model for precise image-to-image transformations, brand color control, and iterative creative revisions.
- Google Nano Banana Pro Edit — High-fidelity prompt-based image editing for composition-preserving changes, product visuals, and reliable on-image text handling.