WAN 2.5 Text-to-Image turns text prompts into AI-generated images with the WAN 2.5 model for on-demand image creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle

$0.03per run·~33 / $1

A Sumi-e Inspired Watercolor portrayal of warrior, blending traditional East Asian ink wash techniques with modern watercolor splashes. Use primarily red with accents of yellow for a minimalist yet expressive composition

Futuristic landmark tower with parametric façade, mirror metal + glass, sunrise above cloud sea, dramatic skyline, competition-grade visualization, realistic materials, cinematic depth

Summer Heatwave – 4-Scene Storyboard Frame 1 – Golden Beach Arrival Prompt: “Cinematic wide shot: a sunlit tropical beach with turquoise waves, palm trees swaying, golden sand glowing in the afternoon sun. A stylish young woman in a bright bikini walks toward the shore, her silhouette outlined by the sunlight.” Frame 2 – Racing the Waves Prompt: “Tracking side shot: the bikini woman runs joyfully along the shoreline, splashing water as waves crash behind her. Bright lens flare, playful energy, cinematic slow motion effect.” Frame 3 – Sunset Ride Prompt: “Dynamic cinematic shot: the woman riding a red convertible along a coastal highway, hair flowing in the wind, ocean waves glimmering beside her. The sky burns orange and pink with sunset.” Frame 4 – Night Glow Party Prompt: “Cinematic overhead shot: the beach at night, glowing lanterns and neon lights, the woman dancing barefoot in the sand surrounded by friends, laughter, tropical cocktails in hand, vibrant colors and festive atmosphere.”

Frame 1 – The Misty Temple Prompt: “A ruined ancient temple hidden in a foggy forest, moonlight filtering through broken stone pillars, giant statues half-buried in moss. Dark and mystical atmosphere, cinematic wide shot.” Frame 2 – The Warrior Appears Prompt: “Over-the-shoulder shot of a monkey warrior with golden eyes and fur, wearing battle-worn armor, holding a heavy staff across his back. The temple ruins glow faintly with candlelight, mist swirling around his silhouette, dramatic lighting.” Frame 3 – The Demon Awakens Prompt: “Low-angle shot of a colossal demon beast emerging from rubble, black scales glistening with crimson glow, tusks dripping firelight. Debris falling, dust clouds rising, cinematic epic fantasy scene.” Frame 4 – Clash of Power Prompt: “Slow-motion cinematic shot of the monkey warrior leaping with staff raised, striking against the demon’s horn. Sparks, shockwaves, temple walls cracking. Wide IMAX style composition, dynamic action freeze.” Frame 5 – Final Victory Prompt: “Cinematic wide shot: the monkey warrior standing on the fallen demon’s body, staff planted firmly, moonlight shining on his golden eyes. Broken temple pillars frame the scene, mist and blood on the ground, epic dark fantasy style.”

Scene 1 – The Mountain Awakening High atop a mist-shrouded mountain, an ancient stone statue of a monkey cracks open as golden light bursts forth. The Monkey King emerges, staff in hand, fur bristling with divine energy, eyes glowing with fire. Vast landscape of jagged peaks, swirling clouds, and flying cranes, cinematic wide shot, mythic and majestic. Scene 2 – The Demons’ Ambush In a bamboo forest lit by moonlight, monstrous demons with twisted horns and glowing eyes leap from the shadows. The Monkey King spins his golden staff, creating arcs of blazing light that cut through the darkness. Dynamic combat scene, cinematic motion, sparks and energy bursting in midair. Scene 3 – The Celestial Challenge On a heavenly battlefield above the clouds, celestial soldiers clad in golden armor march in formation, their spears glowing like lightning. The Monkey King, defiant and fearless, faces them atop a floating lotus platform, his staff growing to a colossal size. Epic scale, divine architecture in the background, thunder and fire in the skies. Scene 4 – The King’s Ascendance After the battle, the Monkey King rises above the battlefield, fur ablaze with golden flames, staff glowing like a pillar of the sun. A halo of divine light surrounds him as the heavens tremble. Cinematic climax, overwhelming mythic power, ultra-detailed, radiant and transcendent.

Scene 1 – The Forgotten Temple In a dense jungle lit by shafts of golden sunlight, an ancient stone temple lies half-buried in vines and moss. Strange glowing symbols flicker faintly across the stone, pulsing like a heartbeat. A lone explorer in futuristic armor approaches cautiously, their figure dwarfed by the monumental structure. Wide cinematic shot, rich detail, suspenseful atmosphere. Scene 2 – The Awakening Inside the temple’s heart, the explorer touches a crystalline relic set in a pedestal. Instantly, beams of radiant light erupt from the walls, and the temple begins to shift, stones floating into the air. A massive holographic star map unfolds, showing countless galaxies swirling in motion. Close dramatic shot, brilliant cosmic colors, sense of awe and revelation. Scene 3 – The Trial of Light The star map opens into a portal, pulling the explorer into a vast cosmic arena made of floating fragments of worlds. Colossal guardians of light and shadow emerge, circling the explorer like living constellations. The hero battles through swirling storms of energy and collapsing platforms. Wide dynamic shot, surreal scale, epic tension, cosmic grandeur. Scene 4 – The Ascension Victorious, the explorer floats at the center of the arena as the guardians dissolve into radiant energy, merging with their body. They transform into a luminous being, wings of starlight unfolding as they gaze upon a newborn galaxy forming before them. Cinematic final shot, breathtaking, transcendent, overwhelming beauty of creation.

A high-impact and cinematic push-in shot of a Tyrannosaurus Rex roaring ferociously on a chaotic battlefield. The camera dollys in rapidly, slightly tilting up to emphasize the creature's immense size and power. The T-Rex stomps the ground, causing a violent screen shake, while its deafening roar sends a shockwave through the air. As it moves, debris from explosions fall down around it, with fires flickering and growing in the background. Dramatic sunlight rays pierce through the stormy clouds, casting a powerful lens flare, and dust particles float in the air as a raw and chaotic atmosphere envelops the entire scene. A dynamic shot of a lone samurai warrior running at high speed through a field of cosmos flowers. The camera, in a fast-paced tracking shot, follows the subject from behind, getting low to the ground. The warrior's hair and clothes are whipping dramatically in the wind. As they run, the flowers and their petals fly up and swirl around the character, creating a dreamy and chaotic visual. The sun is setting behind them, casting a warm glow and lens flare that enhances the epic and cinematic feel of the scene.
![Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom](https://static.wavespeed.ai/examples/7dda39ba73ae4767890a930a4706e648/1.png)
Lookbook photo of a model wearing a [dark leather bomber jacket], minimalist studio, soft side light, confident pose, magazine cover aesthetic, clean backdrop, subtle film grain, high fashion. With a "Fashion Magazine" book name in art word style at the bottom
![[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography](https://static.wavespeed.ai/examples/eabe7de4f93b426ab342e7fb6142f62e/1.png)
[High-end Wireless Headphones], centered on pure white background, studio high-key lighting, crisp hard shadow, commercial packshot, 35mm perspective, ultra-sharp details, subtle floor reflection, dust-free, 8k, realistic product photography

Scandinavian living room, pale wood floor, off-white walls, furniture: a light gray sectional sofa, a minimalist round coffee table, and a simple black floor lamp, large window natural light, balanced composition, architectural wide-angle render, tidy styling, cozy and airy

A full-body shot of an Asian model with sharp, defined facial features and wet, slicked-back hair, her gaze is firm and direct. She is wearing an asymmetrical, sculptural, heavy-fabric pure white dress. The background is a minimalist concrete architecture with harsh light and shadow play. Dramatic, hard lighting, like intense afternoon sun, creates clean, long shadows. In the style of Peter Lindbergh, shot on a Hasselblad medium format camera, black and white photography, fine-grained, hyperrealistic, 8K.

A fit man, mid-workout, is captured in a dynamic shot as he drinks water. Sweat glistens on his muscular physique, prominently highlighting his defined abs and biceps. The water he's drinking is dramatically rendered, showing the splash and movement as it enters his mouth, conveying refreshment and exertion.

An exhausted ballerina in a haute couture gown, sitting in an empty subway car, her pointe shoes resting on the seat beside her. The city's nightscape rushes by outside the window. The atmosphere is quiet and lonely after a performance. In the style of candid documentary photography, lit by the carriage's fluorescent lights, cool color palette, photorealistic detail, 8K.

An astronaut in a spacesuit walking on the red desert of Mars, a giant dust storm rising behind, Earth is a tiny blue dot in the vast black sky. Cinematic, wide-angle shot, 4K, Interstellar movie style.

A professional man in his 40s, neatly groomed beard, wearing a tailored gray suit, standing confidently in a modern office lobby, realistic corporate headshot, clean lighting, natural expression.

A young businessman standing near a modern glass building, wearing a dark suit and tie, looking confident, golden hour lighting, realistic fashion photography, urban city background, professional but natural style.

A young male model standing in front of a minimalist concrete wall, wearing oversized streetwear hoodie, cargo pants, and white sneakers, hands in pockets, casual but stylish pose, editorial fashion photography, hyper realistic details

A group of friends laughing together at an outdoor café in the city, one wearing sunglasses, another holding a smartphone, stylish but casual outfits, natural light, authentic street photography style.

Close-up portrait of a model whose face is partially covered in flowing liquid metal or an iridescent, second-skin-like substance. She has otherworldly, light purple eyes and stares directly into the camera. The background is completely blurred out, leaving only a soft halo of light. The lighting is even and ethereal, as if from a bioluminescent source. Inspired by the style of Nick Knight, the image emphasizes surreal textures and subtle color gradients, exceptionally sharp, with breathtaking detail, 16K.

A fair-skinned model with classical beauty, lounging on a velvet chaise lounge, surrounded by old books and withered roses. She is wearing a baroque-style lace gown, her expression is languid and contemplative. The scene is a dim, old library, with a single stream of Rembrandt-style light from a side window illuminating her face and figure. Composition inspired by a John William Waterhouse painting, rich in narrative. The overall tones are deep and heavy, with strong chiaroscuro, creating an oil painting texture and detail.
WAN 2.5 is a cutting-edge text-to-image model on Cloud’s DashScope. It generates high-quality, detailed images directly from text prompts and supports multiple output resolutions.
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.5 Text To Image below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/text-to-image" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("alibaba/wan-2.5/text-to-image", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"alibaba/wan-2.5/text-to-image",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1024*1024",
"enable_prompt_expansion": false,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.5 Text To Image is a Alibaba model for image generation, exposed as a REST API on WaveSpeedAI. WAN 2.5 Text-to-Image turns text prompts into AI-generated images with the WAN 2.5 model for on-demand image creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.5-text-to-image.
Wan 2.5 Text To Image starts at $0.030 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `size`, `seed`, `negative_prompt`, `enable_prompt_expansion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-wan-2.5-text-to-image.
Average end-to-end generation time on WaveSpeedAI is around 12 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Alibaba). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.