Wan 2.6 Text to Image | High-Quality Text-to-Image API

alibaba /

WAN 2.6 Text-to-Image generates high-quality images from natural-language prompts with strong prompt adherence and clean composition. It supports multiple aspect ratios and size control, seed-based reproducibility, and flexible styles (photorealistic to illustrative) for ads, product shots, and social visuals. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.

text-to-image

輸入

Enable Safety Checker

就緒

An extreme close-up documentary shot of a human face in brutal Arctic cold, eyelashes completely frozen and coated in thick ice crystals, frozen breath crystallizing in the air, skin slightly red from negative 50°C temperatures, hyper-realistic cinematic lighting, shallow depth of field, every frost particle sharply detailed, realistic cold blue color tones, shot on an ARRI Alexa 65 with a macro lens, natural film grain, Netflix-style documentary realism.

$0.03每次運行·~33 / $1

下一步：

示例查看全部

An extreme close-up documentary shot of a human face in brutal Arctic cold, eyelashes completely frozen and coated in thick ice crystals, frozen breath crystallizing in the air, skin slightly red from negative 50°C temperatures, hyper-realistic cinematic lighting, shallow depth of field, every frost particle sharply detailed, realistic cold blue color tones, shot on an ARRI Alexa 65 with a macro lens, natural film grain, Netflix-style documentary realism.

a small girl with black twin-tail hair, sitting with her legs drawn together in front of her, smoking a cigarette, angel wings attached to her back, gently fluttering, flat solid gray background, no gradient, uniform monochrome, 3D pixel art style, voxel art, blocky geometry, anime-style character design, stylized proportions, minimal facial detail, low-resolution yet three-dimensional pixels, minimalistic composition, quiet and subdued mood, slightly surreal atmosphere, cinematic framing, soft but gloomy lighting --ar 58:77 --video 1

Jumping wolf motif that is one colour. The wolf is in similar style as Jankovics Marcell's Fehérlófia. As the wolf body looks like as flames. the wolf, standing in a snowy mountain landscape, minimalist ink sketch style, black and white only, sharp eyes, calm but tense posture, hand-drawn animation look, no fur details, abstract form, high contrast, rough texture --ar 1:1

dark fantasy 1980s DVD screengrab of a crusader raising his sword in a traditional early middle ages church ar 3:2 --ar 1:1

A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography, medium shot, shallow depth of field, 35mm look, clean lines, natural shadows, soft highlights, cozy seating, neatly arranged tea bar, high detail Negative prompt: blurry, low-res, watermark, text, logo, cluttered background, overexposed, underexposed, distortion, fisheye, noise

A mix collage with rapper, diamond, concert, neons, scratch paper, lyrics on paper, racing cars, money, and girls with a futuristic vibe

README

Wan 2.6 Text-to-Image

Wan 2.6 Text-to-Image (/wan-2.6/text-to-image) is ’s text-to-image generation model for creating high-quality visuals from a single natural-language prompt. It’s built for practical creative workflows—concept art, product visuals, portraits, and stylized imagery—where you want strong prompt adherence plus flexible custom sizing.

Why it stands out

Fast, one-shot text-to-image generation Generate an image in a single run for quick ideation and production workflows.
Custom width × height output Set width and height directly (within the endpoint’s limits) to match banners, thumbnails, posters, or social formats.
Prompt expansion for better results Enable prompt expansion to automatically enrich short prompts with useful detail for more coherent compositions.
Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.

Parameters

Parameter	Description
prompt*	Text description of the image you want to generate.
width	Output width (within allowed limits).
height	Output height (within allowed limits).
enable_prompt_expansion	Toggle prompt expansion to enrich short prompts.
seed	Set a fixed seed for more repeatable iterations (-1 for random).

How to use

Write a clear prompt (subject + setting + style).
Choose width and height that match your target aspect ratio.
Turn on enable_prompt_expansion if your prompt is short or under-specified.
Set a seed if you want repeatable iterations (keep the same seed while you tweak the prompt).
Click Run, review the result, and iterate.

Prompt tips

Start with subject + environment + style: “A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography.”
Add camera / composition when framing matters: “wide shot, shallow depth of field, 35mm film look.”
Keep instructions positive and specific (what you want to see, not what you fear).

Pricing

$0.03 per generated image

Notes

Output sizing is limited by the endpoint’s current constraints (for example, width/height bounds and aspect-ratio limits). If a size fails, reduce resolution or choose a more standard aspect ratio.
Enabling prompt expansion can improve quality for short prompts, but may add a little latency.
Returned image URLs may be time-limited—save outputs if you need long-term storage.

Related Models

Wan 2.5 Text-to-Image — A proven Wan text-to-image model for reliable, cost-stable AI image generation with a similar prompt-first workflow.
Seedream V4 Text-to-Image — A style-consistent text-to-image generator for posters, campaigns, and high-volume brand-friendly illustration batches.
FLUX.2 Turbo Edit — A fast natural-language image editing model for precise image-to-image transformations, brand color control, and iterative creative revisions.
Google Nano Banana Pro Edit — High-fidelity prompt-based image editing for composition-preserving changes, product visuals, and reliable on-image text handling.

無障礙：本網站使用的 AI 模型由第三方提供。