Vidu Q3 और Q3 Pro मॉडल पर 50% छूट · केवल WaveSpeedAI | 20 मई – 2 जून

Nano Banana 2 Text to Image

google /

Google Nano Banana 2 (Gemini 3.1 Flash Image) delivers Pro-quality image generation at Flash speed with 512px to 4K resolution support. Features include improved text rendering, character consistency for up to 5 characters, and real-world knowledge integration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image
Input
If enabled, the model will use web search to enhance the generation with real-time information.
If enabled, the model will use image search to enhance the generation with real-time information.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. Due to variability in Google’s underlying compute resources, inference times can fluctuate significantly. As a result, synchronous requests may hit timeouts.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A young woman with short red hair and round glasses wearing a dark green trench coat, standing in a rainy Tokyo street at night. Neon signs reflected in puddles. Cinematic still, anamorphic lens look.

$0.07per run·~14 / $1

Next:

ExamplesView all

A young woman with short red hair and round glasses wearing a dark green trench coat, standing in a rainy Tokyo street at night. Neon signs reflected in puddles. Cinematic still, anamorphic lens look.

A young woman with short red hair and round glasses wearing a dark green trench coat, standing in a rainy Tokyo street at night. Neon signs reflected in puddles. Cinematic still, anamorphic lens look.

Related Models

README

Google Nano Banana 2 Text-to-Image

Nano Banana 2 Text-to-Image (Gemini 3.1 Flash Image) is Google’s lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts. It transforms words into expressive, realistic images with remarkable clarity, composition, and style diversity — all within seconds.

Why Choose This?

  • Fast generation Optimized for speed while maintaining high visual quality.

  • Multi-resolution support Output in 1K, 2K, or 4K resolution based on your needs.

  • Flexible aspect ratios Multiple options including 1:1, 3:2, 2:3, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, and 8:1.

  • Cinematic quality Excels at photorealistic scenes with atmospheric lighting and anamorphic lens effects.

  • Prompt Enhancer Built-in tool to automatically improve your descriptions.

  • Format choice Export in PNG or JPEG format.

Parameters

ParameterRequiredDescription
promptYesText description of the desired image
aspect_ratioNoAspect ratio: 1:1, 3:2, 2:3, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1
resolutionNoOutput resolution: 0.5k, 1k (default), 2k, 4k
enable_web_searchNoEnable web search to enhance generation with real-time info (default: false)
enable_image_searchNoEnable image search to enhance generation with real-time info (default: false)
output_formatNoOutput format: png (default), jpeg

How to Use

  1. Write your prompt — describe the image in detail, including style, lighting, and composition.
  2. Choose aspect ratio (optional) — select a preset or leave empty for default.
  3. Select resolution — choose 1K, 2K, or 4K based on your needs.
  4. Choose output format — PNG for transparency support, JPEG for smaller file size.
  5. Use Prompt Enhancer (optional) — click to automatically refine your description.
  6. Run — submit and download your generated image.

Pricing

ResolutionCost
0.5k$0.045
1k$0.07
2k$0.105
4k$0.14
Web search+$0.014
Image search+$0.014

Best Use Cases

  • Cinematic Imagery — Generate atmospheric scenes with anamorphic lens effects and film-like quality.
  • Character Portraits — Create detailed character illustrations with specific attributes.
  • Social Media Content — Produce eye-catching visuals for various platforms with flexible aspect ratios.
  • Marketing & Ads — Generate professional images for campaigns and promotions.
  • Concept Art — Visualize creative ideas with photorealistic rendering.

Pro Tips

  • Use cinematic language like "anamorphic lens look," "neon signs reflected in puddles," or "rainy Tokyo street at night" for atmospheric results.
  • Be specific about character attributes (hair color, clothing, accessories) for consistent results.
  • Match aspect ratio to your target platform: 9:16 for Stories/Reels, 16:9 for banners, 1:1 for feeds.
  • Try the Prompt Enhancer to automatically improve your descriptions.
  • 2K outputs are charged at 1.5× the standard rate; 4K at 2× the standard rate.

Notes

  • Prompt is the only required field.
  • If aspect_ratio is not selected, the model uses a default ratio.
  • 2K resolution costs 1.5× and 4K resolution costs 2× the standard rate.
  • Ensure your prompts comply with Google's Safety Guidelines.

Related Models

Accessibility:This website uses AI models provided by third parties.

Nano Banana 2 Text To Image API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/google/nano-banana-2/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Nano Banana 2 Text To Image below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/google/nano-banana-2/text-to-image" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "1:1",
    "resolution": "1k",
    "enable_web_search": false,
    "enable_image_search": false,
    "output_format": "png",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("google/nano-banana-2/text-to-image", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "aspect_ratio": "1:1",
        "resolution": "1k",
        "enable_web_search": false,
        "enable_image_search": false,
        "output_format": "png",
        "enable_sync_mode": false,
        "enable_base64_output": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "google/nano-banana-2/text-to-image",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "1:1",
    "resolution": "1k",
    "enable_web_search": false,
    "enable_image_search": false,
    "output_format": "png",
    "enable_sync_mode": false,
    "enable_base64_output": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Nano Banana 2 Text To Image API — Frequently asked questions

What is the Nano Banana 2 Text To Image API?

Nano Banana 2 Text To Image is a Google model for image generation, exposed as a REST API on WaveSpeedAI. Google Nano Banana 2 (Gemini 3.1 Flash Image) delivers Pro-quality image generation at Flash speed with 512px to 4K resolution support. Features include improved text rendering, character consistency for up to 5 characters, and real-world knowledge integration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Nano Banana 2 Text To Image API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/google/google-nano-banana-2-text-to-image.

How much does Nano Banana 2 Text To Image cost per run?

Nano Banana 2 Text To Image starts at $0.070 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Nano Banana 2 Text To Image accept?

Key inputs: `prompt`, `aspect_ratio`, `resolution`, `enable_base64_output`, `enable_image_search`, `enable_sync_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/google/google-nano-banana-2-text-to-image.

How do I get started with the Nano Banana 2 Text To Image API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Nano Banana 2 Text To Image outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Google). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.