WAN 2.6 Text-to-Image generates high-quality images from natural-language prompts with strong prompt adherence and clean composition. It supports multiple aspect ratios and size control, seed-based reproducibility, and flexible styles (photorealistic to illustrative) for ads, product shots, and social visuals. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.
Boşta

$0.03çalıştırma başına·~33 / $1

An extreme close-up documentary shot of a human face in brutal Arctic cold, eyelashes completely frozen and coated in thick ice crystals, frozen breath crystallizing in the air, skin slightly red from negative 50°C temperatures, hyper-realistic cinematic lighting, shallow depth of field, every frost particle sharply detailed, realistic cold blue color tones, shot on an ARRI Alexa 65 with a macro lens, natural film grain, Netflix-style documentary realism.

a small girl with black twin-tail hair, sitting with her legs drawn together in front of her, smoking a cigarette, angel wings attached to her back, gently fluttering, flat solid gray background, no gradient, uniform monochrome, 3D pixel art style, voxel art, blocky geometry, anime-style character design, stylized proportions, minimal facial detail, low-resolution yet three-dimensional pixels, minimalistic composition, quiet and subdued mood, slightly surreal atmosphere, cinematic framing, soft but gloomy lighting --ar 58:77 --video 1

Jumping wolf motif that is one colour. The wolf is in similar style as Jankovics Marcell's Fehérlófia. As the wolf body looks like as flames. the wolf, standing in a snowy mountain landscape, minimalist ink sketch style, black and white only, sharp eyes, calm but tense posture, hand-drawn animation look, no fur details, abstract form, high contrast, rough texture --ar 1:1

dark fantasy 1980s DVD screengrab of a crusader raising his sword in a traditional early middle ages church ar 3:2 --ar 1:1

A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography, medium shot, shallow depth of field, 35mm look, clean lines, natural shadows, soft highlights, cozy seating, neatly arranged tea bar, high detail Negative prompt: blurry, low-res, watermark, text, logo, cluttered background, overexposed, underexposed, distortion, fisheye, noise

A mix collage with rapper, diamond, concert, neons, scratch paper, lyrics on paper, racing cars, money, and girls with a futuristic vibe
Wan 2.6 Text-to-Image (/wan-2.6/text-to-image) is ’s text-to-image generation model for creating high-quality visuals from a single natural-language prompt. It’s built for practical creative workflows—concept art, product visuals, portraits, and stylized imagery—where you want strong prompt adherence plus flexible custom sizing.
Fast, one-shot text-to-image generation Generate an image in a single run for quick ideation and production workflows.
Custom width × height output Set width and height directly (within the endpoint’s limits) to match banners, thumbnails, posters, or social formats.
Prompt expansion for better results Enable prompt expansion to automatically enrich short prompts with useful detail for more coherent compositions.
Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.
| Parameter | Description |
|---|---|
| prompt* | Text description of the image you want to generate. |
| width | Output width (within allowed limits). |
| height | Output height (within allowed limits). |
| enable_prompt_expansion | Toggle prompt expansion to enrich short prompts. |
| seed | Set a fixed seed for more repeatable iterations (-1 for random). |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/alibaba/wan-2.6/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.6 Text To Image below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.6/text-to-image" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("alibaba/wan-2.6/text-to-image", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"alibaba/wan-2.6/text-to-image",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.6 Text To Image is a Alibaba model for image generation, exposed as a REST API on WaveSpeedAI. WAN 2.6 Text-to-Image generates high-quality images from natural-language prompts with strong prompt adherence and clean composition. It supports multiple aspect ratios and size control, seed-based reproducibility, and flexible styles (photorealistic to illustrative) for ads, product shots, and social visuals. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.6-text-to-image.
Wan 2.6 Text To Image starts at $0.030 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `size`, `seed`, `enable_prompt_expansion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.6-text-to-image.
Average end-to-end generation time on WaveSpeedAI is around 9 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Alibaba). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.