Home/Explore/Google Models/google/gemini-2.5-flash-image/text-to-image

text-to-image

google/gemini-2.5-flash-image/text-to-image

Google Gemini 2.5 Flash Image, a powerful new image generation and editing model with advanced features and creative control.

If set to true, the function will wait for the image to be generated and uploaded before returning the response. It allows you to get the image directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A gentle and soft digital watercolor illustration of a young woman seen from behind, holding a large, beautiful bouquet of wildflowers. She is standing in a vast field of tall grass at sunset. The colors are soft and blended, with translucent layers. The sky is a wash of pastel pink, orange, and lavender. The focus is on the delicate textures of the flowers and the soft, flowing lines of her hair and dress. The mood is serene, romantic, and deeply calming.

Your request will cost $0.038 per run.

For $1 you can run this model approximately 26 times.

One more thing:

ExamplesView all

A gentle and soft digital watercolor illustration of a young woman seen from behind, holding a large, beautiful bouquet of wildflowers. She is standing in a vast field of tall grass at sunset. The colors are soft and blended, with translucent layers. The sky is a wash of pastel pink, orange, and lavender. The focus is on the delicate textures of the flowers and the soft, flowing lines of her hair and dress. The mood is serene, romantic, and deeply calming.
A young woman in her mid-20s sitting by a café window, wearing a beige sweater, holding a cup of coffee, natural morning sunlight on her face, candid street photography style.
A cheerful group of college friends taking a selfie in a park, casual outfits, green grass and blue sky behind them, authentic lifestyle photo.
A jogger running along a riverside path in the early morning, wearing sportswear, light fog over the water.
Epic fantasy art of a majestic white dragon with pearlescent scales, landing gracefully in a colossal underground cavern. The rider, a female elf in polished silver armor, dismounts. The cavern walls are lined with giant, glowing amethyst and quartz crystals that emit a soft, multi-colored ambient light. A serene underground lake reflects the shimmering crystals. The scene is peaceful and awe-inspiring, showcasing a hidden sanctuary. The crystals' light reflects beautifully off the dragon's scales and the elf's armor. Highly detailed, focusing on the interplay of light and reflective surfaces.
An ultra-photorealistic shot, capturing a lazy ginger cat sleeping peacefully on a stack of old books next to a window. Soft morning sunlight streams in, illuminating dust motes dancing in the air and highlighting the fine texture of the cat's fur and the worn paper of the books. A half-empty ceramic mug of tea sits nearby. The focus is sharp on the cat, with the rest of the cozy room gently blurred in the background. Shot with a Canon EOS R5, 50mm f/1.8 lens, natural lighting, serene and quiet mood.
A beautiful anime scene in the style of Studio Ghibli. A young girl in a simple summer dress sits alone on a weathered wooden bench at a rural bus stop in the Japanese countryside. It's late afternoon, and the sky is a soft orange. The bus stop is surrounded by lush, overgrown greenery and towering, vibrant green camphor trees. A single cicada is visible on the signpost. The atmosphere is peaceful, slightly nostalgic, and filled with the quiet hum of summer. Beautifully detailed hand-drawn background, soft, warm color palette.
An interior scene in a cozy, cluttered attic art studio, Hayao Miyazaki aesthetic. Sunlight filters through a large, round window, illuminating a wooden desk covered in art supplies: watercolor palettes, jars of brushes, scattered sketches, and a half-finished painting of a landscape. A warm cup of tea steams gently. The room is filled with books, hanging plants, and interesting trinkets. The feeling is one of creative solitude and peaceful messiness. Rich details, warm and inviting light.
A cinematic film still of a person walking alone down a dirt path in a dense, misty forest at dawn. Sunbeams (volumetric light) cut through the thick fog and canopy of tall pine trees, creating a magical, ethereal effect. The person is a small figure in the grand landscape, wearing a warm coat. The color grading is slightly desaturated with cool, muted tones, evoking a sense of quiet introspection and profound peace. Shot on Kodak Portra 400 film for a soft, grainy texture.
A vibrant, high-angle shot of a Solarpunk city sanctuary. Buildings are constructed with smooth, white bio-concrete and flowing organic shapes, seamlessly integrated with vertical gardens and cascading waterfalls. On a massive rooftop terrace, a diverse community of people tends to a lush hydroponic farm under a geodesic glass dome. Elegant, petal-shaped solar panels track the sun. Small transport drones hum quietly, carrying produce. The lighting is bright, clean, and optimistic, conveying a sense of community and harmony. Hyper-detailed, 8K

README

Gemini 2.5 Flash Image is Google’s state-of-the-art image generation and editing model. It is a new variant of the Gemini 2.5 family, specifically designed for fast, conversational, and multi-turn creative workflows. This model is made available to developers through the Gemini API, Google AI Studio, and Vertex AI.

Key Features

  • Native Image Generation and Editing: Gemini 2.5 Flash Image is a multimodal model that natively understands and generates images. This allows for a seamless, unified workflow for creating and editing visuals.
  • Multi-image Fusion: This powerful feature allows you to combine multiple input images into a single, cohesive, new visual. For example, you can integrate a product into a new scene or restyle a room by merging images of different furniture and decor.
  • Character and Style Consistency: A significant advancement is the ability to maintain a consistent character, object, or style across multiple prompts and images. This is essential for storytelling, branding, and generating a series of cohesive assets without needing time-consuming fine-tuning.
  • Conversational Editing: The model enables precise, targeted edits using natural language. You can make specific changes like blurring a background, removing an object, altering a subject’s pose, or colorizing a black-and-white photo by simply describing the desired outcome.
  • Visual Reasoning: Gemini 2.5 Flash Image benefits from the Gemini model’s deep world knowledge. It can go beyond simple photorealism to perform complex tasks that require genuine understanding, such as interpreting hand-drawn diagrams, assisting with educational queries, and following multi-step instructions.
  • SynthID Watermarking: To promote responsible AI and transparency, all images created or edited with Gemini 2.5 Flash Image are embedded with an invisible digital watermark from SynthID. This watermark helps identify the content as AI-generated or edited.