Vidu Contest
WaveSpeed.ai
Início/Explorar/Wan 2.2 Models/wavespeed-ai/wan-2.2/text-to-image-realism
text-to-image

text-to-image

WAN 2.2 Text-To-Image Realism

wavespeed-ai/wan-2.2/text-to-image-realism

WAN 2.2 delivers ultra-realistic text-to-image generation, converting prompts into photoreal images with high fidelity and detail. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input
width
height
1280 × 720 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A group of four women are seated around a wooden picnic table outdoors at a backyard gathering. The woman in the foreground, light-skinned and young adult, has shoulder-length light brown hair and a friendly, smiling expression. She's wearing a white, sleeveless top with small, pearl-like embellishments on the shoulders.  The background shows several people in light-colored clothing, and a long wooden table with wine bottles, glasses, candles, and food on it. The setting is a backyard at twilight, with trees and a string of outdoor lights creating a warm ambiance.  A large white paper lantern hangs above the gathering. The composition is casual, focusing on the smiling woman seated at the table.  The perspective is from slightly above and a bit to the left of the woman's position. The lighting is soft, warm, and ambient, highlighting the faces and creating a welcoming atmosphere.  Colors are muted and natural tones with pops of light from the candles and lights.  The overall style is casual and social, evoking a friendly backyard dinner party.

Sua solicitação custará $0.025 por execução.

Por $1 você pode executar este modelo aproximadamente 40 vezes.

Mais uma coisa:

ExemplosVer todos

A group of four women are seated around a wooden picnic table outdoors at a backyard gathering. The woman in the foreground, light-skinned and young adult, has shoulder-length light brown hair and a friendly, smiling expression. She's wearing a white, sleeveless top with small, pearl-like embellishments on the shoulders.  The background shows several people in light-colored clothing, and a long wooden table with wine bottles, glasses, candles, and food on it. The setting is a backyard at twilight, with trees and a string of outdoor lights creating a warm ambiance.  A large white paper lantern hangs above the gathering. The composition is casual, focusing on the smiling woman seated at the table.  The perspective is from slightly above and a bit to the left of the woman's position. The lighting is soft, warm, and ambient, highlighting the faces and creating a welcoming atmosphere.  Colors are muted and natural tones with pops of light from the candles and lights.  The overall style is casual and social, evoking a friendly backyard dinner party.
action shot of a teenage boy skateboarding on the street, wearing a loose hoodie and cargo pants. the photo captures him mid-air during a jump, full of energy. background is a city graffiti wall, dynamic composition, slight motion blur. street photography style, wide-angle lens, bright sunlight, high contrast, vivid colors.
close-up portrait of an elegant 30s man with gold-rimmed glasses, sitting in a vintage study bathed in afternoon sunlight. he is holding a thick old book, looking directly at the camera with a focused gaze. dust particles dancing in the sunbeam. warm and grainy atmosphere, film photography aesthetic, shot on a Leica camera, soft light, visible skin texture, shallow depth of field.
(masterpiece, best quality, cinematic lighting, absurdres), 1boy, handsome young man, short black hair, serious expression, wearing a black hoodie and cargo pants, leaning against a graffiti wall on a city street at night, neon lights, rim light, detailed clothing texture, dynamic angle, full body shot
(masterpiece, best quality, ultra-detailed, photo-realistic:1.2), 1girl, a beautiful korean woman, long wavy brown hair, soft smile, looking at viewer, wearing a cozy beige sweater, sitting at a wooden table in a sunlit cafe, holding a coffee cup, detailed eyes, beautiful detailed face, soft lighting, depth of field, bokeh, window light
(masterpiece, best quality, 8k, ultra-detailed), 1girl, beautiful detailed face, long hair blowing in the wind, wearing a simple white one-piece dress, standing on a rooftop at sunset, (golden hour lighting:1.3), lens flare, dramatic shadows, looking into the distance, melancholic atmosphere, from side.
portrait of a kind, smiling elderly Chinese grandmother with wrinkles. she is sitting on a wicker chair in her courtyard. Sunlight filters through grapevine leaves, casting dappled light on her weathered face. documentary photography style, emphasizing realism, every wrinkle and spot on her skin is clearly visible, warm and soulful eyes, rustic texture.
masterpiece, award-winning photograph, ultra-realistic, (photorealistic:1.4), 8K, RAW photo, extremely detailed, soulful and poignant. An elderly 75-year-old Japanese woodworker, his weathered face a topographical map of his life with deep wrinkles, intensely focused yet kind eyes shining in the dim light. He wears a faded indigo traditional samue, its fabric softened by countless washes, cuffs frayed. He is hunched over a heavy, century-old keyaki wood workbench, scarred with cuts and littered with wood shavings. His gnarled, skillful hands, calloused and stained, are carefully carving an intricate pattern onto a piece of fragrant hinoki cypress wood with a family heirloom chisel, its blade gleaming. Hand-forged tools hang neatly on a pegboard wall behind him, a small shinto shrine sits in a corner. The scene is solely illuminated by a single, unshielded incandescent bulb, casting a pool of intense, warm light and long, dramatic shadows, highlighting the rich texture of the wood, the weathered skin, and the dust motes dancing in the golden beam. Macro shot focusing on his hands and the carving tool, extremely shallow depth of field. Shot on a Canon EOS R5 with a 100mm f/2.8L macro lens.
Potential Pulitzer Prize-winning documentary photograph, raw emotion, urban poetry, masterpiece, (photorealistic:1.3), Kodak T-MAX 400 film aesthetic. A young, androgynous street performer, around 22, with a pale face that accentuates their magenta hair peeking from under a pilled wool beanie. Their long, nimble fingers, calloused at the tips from years of playing, a tarnished silver ring on their thumb. They sit on a graffiti-covered crate on a gritty NYC subway platform, wearing a worn-out denim jacket over a hoodie. With eyes closed, they draw a hauntingly beautiful melody from a weathered cello, its body covered in cracks and scratches, the sound cutting through the station's ambient noise. The air smells of ozone, damp concrete, and faint fast food. Behind them, a tattered Broadway show poster peels from the tiled wall. You can almost feel the low rumble of the approaching train vibrating through the platform. A dramatic interplay of light: the sickly, greenish flicker of overhead fluorescent tubes versus the warm, golden, cinematic beam from the approaching train's headlight. Shot on a Leica M6 with a 35mm Summicron lens.
An artistic, cover-worthy photograph for a culinary magazine like 'Le Chef', masterpiece, (photorealistic:1.3), ultra-detailed. A 45-year-old male French pâtissier, the quintessential perfectionist with an intensely focused gaze, a faint dusting of cocoa powder on one eyebrow. He wears a starched, crisp white chef's uniform and a pristine apron, complete with a tall toque blanche. He stands at his immaculate stainless steel pastry station, a marble slab for chocolate work gleaming beside him. Holding his breath, he uses a pair of delicate tweezers to place a single, perfect, dew-kissed raspberry atop an elaborate dessert sculpture of chocolate mousse, mirror glaze, and hair-thin spun sugar. His rock-steady hands are a stark contrast to the dessert's fragile beauty. A focused, warm, golden light from the overhead heat lamp illuminates the pass, making the dessert look irresistible and catching a single, tiny bead of sweat on the chef's temple. The background is a softly blurred high-end kitchen with hanging copper pots.
An authentic and joyful commercial lifestyle photograph for an artisan brand, masterpiece, (photorealistic:1.3), warm and inviting, ultra-detailed, 8K. A charming female potter in her early 30s, her hair tied in a messy bun with a headband, a single, endearing smudge of dry clay on her cheek. Her smile is genuine and infectious. She wears a practical canvas apron over a comfortable linen shirt. She stands proudly in her beautiful, sun-drenched pottery studio. The shelves behind her are artfully arranged with her own handcrafted ceramics – mugs, bowls, and vases in earthy tones. Warm, golden hour sunlight streams through a large window, creating a beautiful atmospheric haze and making dust particles sparkle in the air. She is holding her favorite, newly-fired mug in both hands, presenting it to the camera like a treasure, her expression radiating the pure joy of creation and inviting the viewer to share in her passion.
An atmospheric and deeply heartwarming photograph celebrating friendship and communal joy, masterpiece, (photorealistic:1.3), ultra-detailed, 8K. A diverse group of 4-5 friends in their early 30s are gathered around a low table on the 'engawa' (wooden veranda) of a traditional Tokyo house on a cool evening. A string of warm, glowing Edison bulbs is strung up above them. In the center of the table, a communal pot of sukiyaki (hot pot) is simmering, sending up inviting clouds of steam, revealing tofu, shiitake mushrooms, and marbled beef. Everyone is engaged, either reaching with chopsticks into the pot or raising their glasses in a toast, their faces illuminated with genuine laughter and relaxed happiness. One woman is seen affectionately placing a piece of food into her friend's bowl. The scene is warmly lit by the steam from the hot pot and the overhead string lights, casting a soft, flattering glow on everyone and creating an atmosphere of intimate, cozy camaraderie.

README

Wan 2.2 Text-to-Image Realism

Generate photorealistic images from detailed text descriptions with Wan 2.2 Realism. This specialized model excels at creating lifelike scenes, authentic human subjects, and natural environments — perfect for when you need images that look like real photographs.

Why It Looks Great

  • Photorealistic focus: Optimized specifically for realistic, photograph-like outputs.
  • Detailed human rendering: Excels at natural skin tones, expressions, and group compositions.
  • Custom dimensions: Precise control over width and height for any aspect ratio.
  • High resolution support: Generate images up to 1280×720 and beyond.
  • Prompt Enhancer: Built-in tool to refine and expand your descriptions automatically.
  • Reproducible results: Use the seed parameter to recreate exact outputs or explore variations.

Parameters

ParameterRequiredDescription
promptYesDetailed text description of the realistic image you want to generate.
sizeNoCustom dimensions with separate width and height controls.
widthNoOutput width in pixels (e.g., 1280).
heightNoOutput height in pixels (e.g., 720).
seedNoRandom seed for reproducibility. Use -1 for random.
output_formatNoOutput file format: jpeg or png. Default: jpeg.

How to Use

  1. Write your prompt — describe the scene in detail, including people, setting, lighting, and atmosphere.
  2. Use Prompt Enhancer (optional) — click to automatically enrich your description.
  3. Set dimensions — adjust width and height sliders to your desired resolution.
  4. Set seed (optional) — use -1 for random, or a specific number to reproduce results.
  5. Choose output format — select jpeg for smaller files or png for higher quality.
  6. Run — click the button to generate.
  7. Download — preview and save your realistic image.

Pricing

Flat rate per image generation.

OutputCost
Per image$0.025

Examples

Images GeneratedTotal Cost
1$0.025
10$0.25
40$1.00
100$2.50

Best Use Cases

  • Lifestyle & Stock Photography — Generate authentic-looking lifestyle scenes and stock imagery.
  • Group Portraits — Create realistic multi-person compositions with natural interactions.
  • Environmental Scenes — Produce believable outdoor settings, gatherings, and events.
  • Marketing & Advertising — Generate photorealistic visuals for campaigns without photoshoots.
  • Concept Visualization — Visualize realistic scenarios for presentations and pitches.

Example Prompts

  • "A group of four women are seated around a wooden picnic table outdoors at a backyard gathering. The woman in the foreground, light-skinned and young adult, has shoulder-length light brown hair and a friendly, smiling expression. She's wearing a white, sleeveless top."
  • "Professional headshot of a middle-aged businessman in a navy suit, soft studio lighting, neutral gray background, confident expression"
  • "Family enjoying breakfast in a sunny modern kitchen, natural morning light through windows, warm and authentic atmosphere"
  • "Two colleagues having a conversation in a contemporary office space, natural poses, professional but relaxed mood"
  • "Street photographer capturing city life, candid moment, golden hour lighting, urban background with bokeh"

Pro Tips for Best Results

  • Be extremely detailed — describe physical features, clothing, expressions, and positioning.
  • Include lighting details — "natural sunlight", "soft studio lighting", "golden hour".
  • Specify skin tones, ages, and distinguishing features for accurate human rendering.
  • Describe the environment and background to ground the scene in reality.
  • Use landscape dimensions (1280×720) for group scenes, portrait (720×1280) for individual shots.
  • The more specific your prompt, the more realistic and controlled the output.

Notes

  • This model is optimized for realism — for artistic or stylized outputs, consider other models.
  • Higher detail in prompts generally produces more accurate and realistic results.
  • Generation time may vary based on resolution and current queue load.
  • For multi-person scenes, describe each person's position and appearance clearly.