Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
Home/Explore/Wan 2.7 Models/alibaba/wan-2.7/text-to-image-pro
text-to-image

text-to-image

Alibaba WAN 2.7 Pro

alibaba/wan-2.7/text-to-image-pro

Alibaba WAN 2.7 Text-to-Image Pro generates high-quality images up to 4K from text prompts with thinking mode for enhanced image quality. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input
width
height
2048 × 2048 px
Range: 512 - 8192
Enable thinking mode for enhanced reasoning and better image quality. Increases generation time.

Idle

Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom

Your request will cost $0.075 per run.

For $1 you can run this model approximately 13 times.

One more thing:

ExamplesView all

two young people eating dessert together, close-up shot, wide angle lens, exaggerated perspective, sitting at an outdoor table, feeding each other with spoons, playful expressions, summer vibe, bright sunlight, pastel umbrellas above, blue sky, casual candid moment, lifestyle photography, vibrant colors, high contrast, natural skin texture, modern editorial style, high detail
Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom
[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography

README

Alibaba Wan 2.7 Text-to-Image Pro

Alibaba Wan 2.7 Text-to-Image Pro (alibaba/wan-2.7/text-to-image-pro) is the professional tier of Alibaba's WanXiang 2.7 text-to-image generation model, supporting output resolutions up to 4K (4096×4096). Combined with built-in thinking mode for enhanced reasoning, it delivers higher-fidelity compositions ideal for print-ready assets, large-format posters, detailed product visuals, and any workflow where resolution and quality are the priority.

Why it stands out

  • Up to 4K resolution output Generate images up to 4096×4096 total pixels (512–8192 per dimension)—ideal for print, large-format displays, and high-DPI screens where standard 1024px output falls short. Aspect ratio: 1:8–8:1.

  • Thinking mode for smarter generation Built-in thinking mode enables the model to reason about prompt intent before generating, producing more coherent compositions and better prompt adherence.

  • Fast, one-shot text-to-image generation Generate an image in a single run for quick ideation and production workflows.

  • Custom size output Set output size directly (512–8192 per dimension) to match banners, thumbnails, posters, or social formats. Total pixels must be between 768×768 and 4096×4096, with aspect ratio between 1:8 and 8:1.

  • Seeded iteration Use a fixed seed to refine style and layout with more repeatable variations.

Parameters

ParameterDescription
prompt*Text description of the image you want to generate.
sizeOutput size in pixels (widthheight). Range: 512–8192 per dimension. Default: 10241024. Total pixels: 768×768–4096×4096. Aspect ratio: 1:8–8:1.
thinking_modeEnable thinking mode for enhanced reasoning and better image quality (default: true). Increases generation time.
seedSet a fixed seed for more repeatable iterations (-1 for random).

How to use

  1. Write a clear prompt (subject + setting + style).
  2. Choose a size that matches your target aspect ratio and resolution needs (e.g. 20482048, 40962048 for ultra-wide, 2048*4096 for tall posters).
  3. Leave thinking_mode enabled (default) for best quality, or disable it for faster generation.
  4. Set a seed if you want repeatable iterations (keep the same seed while you tweak the prompt).
  5. Click Run, review the result, and iterate.

Prompt tips

  • Start with subject + environment + style: "A modern tea shop interior, warm afternoon light, minimalist wood design, cinematic photography."
  • Add camera / composition when framing matters: "wide shot, shallow depth of field, 35mm film look."
  • Keep instructions positive and specific (what you want to see, not what you fear).
  • With thinking mode enabled, the model handles short or ambiguous prompts better—but detailed prompts still yield the best results.
  • For 4K outputs, include fine detail cues (textures, materials, lighting) to take full advantage of the higher resolution.

Pricing

  • $0.075 per generated image

Notes

  • Output size is 512–8192 pixels per dimension. Total pixels must be between 768×768 and 4096×4096, with aspect ratio between 1:8 and 8:1.
  • Thinking mode is enabled by default and improves quality, but adds some latency. Disable it if speed is the priority.
  • Higher resolutions (e.g. 4096×4096) will take longer to generate than standard sizes.
  • Returned image URLs may be time-limited—save outputs if you need long-term storage.

Related Models