Home/Explore/Flux Image Tools/wavespeed-ai/flux-2-pro/text-to-image
text-to-image

text-to-image

FLUX.2 [pro] Text-to-Image | Fast Unlimited Image Generation | WaveSpeedAI

wavespeed-ai/flux-2-pro/text-to-image

Text-to-image generation with FLUX.2 [pro] from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities.

width
height
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A photograph of a vintage, beige CRT monitor sitting on a cluttered wooden desk in a dimly lit room. The curved screen glows with monochromatic green text. Displayed on the screen is Python code:

import flux_api
def generate_image(prompt):
    print(f"Processing: {prompt}")
    # Connecting to core...
    return True

Below the code, a blinking cursor sits next to a command prompt: `user@retro-pc:~/dev$ _`. The screen shows scan lines, slight flickering distortion, and reflections of the room lights on the curved glass surface. A mechanical keyboard with beige and grey keycaps sits in front of it. Film grain.

Your request will cost $0.03 per run.

For $1 you can run this model approximately 33 times.

One more thing::

ExamplesView all

A photograph of a vintage, beige CRT monitor sitting on a cluttered wooden desk in a dimly lit room. The curved screen glows with monochromatic green text. Displayed on the screen is Python code:

import flux_api
def generate_image(prompt):
    print(f"Processing: {prompt}")
    # Connecting to core...
    return True

Below the code, a blinking cursor sits next to a command prompt: `user@retro-pc:~/dev$ _`. The screen shows scan lines, slight flickering distortion, and reflections of the room lights on the curved glass surface. A mechanical keyboard with beige and grey keycaps sits in front of it. Film grain.
A girl taking a mirror selfie in a bathroom. She is holding an iPhone in her right hand. We see her back in the foreground, and her front reflection in the mirror. The phone screen in the reflection is visible and displays a text message bubble saying "I AM AI". The focus is on the reflection in the mirror. The background shows tiled walls and towels. The geometry of the reflection must perfectly match her pose.
A nature photograph captures a defensive standoff between a North American porcupine and a nine-banded armadillo in a dry, rocky riverbed. The porcupine is turned away, with hundreds of long, sharp quills fully erected, creating a textured defensive halo. The armadillo is curled halfway into a ball, showing the intricate, leathery texture of its armored bands and scales. Dust is kicking up slightly around them. Harsh afternoon sunlight casting long shadows. Sharp focus on the textures of both animals.
A comic book page layout drawn in a gritty noir graphic novel style with heavy black ink shadows. The page is divided into three horizontal panels separated by white gutters.
Top Panel: A close-up of detective's eyes looking suspicious through blinds. Text bubble says "HE'S LATE."
Middle Panel: A wide shot of a rainy, dark alleyway with a single figure standing under a streetlamp. Sound effect text "TIP TAPTIP TAP" near their feet.
Bottom Panel: A gloved hand holding a smoking revolver. Text bubble says "...TOO LATE."
The art style is consistent across all panels.
A symmetrical studio photograph of two identical, transparent glass chemical flasks sitting side-by-side on a white laboratory bench.
The LEFT flask contains a clear, still blue liquid and has a label reading "SOLUTION A: STABLE".
The RIGHT flask contains the same blue liquid, but it is vigorously bubbling and boiling, with steam rising from the neck. It has an identical label that reads "SOLUTION B: REACTING".
The lighting is clean and clinical. The focus must clearly show the difference in the liquid's state.

README

FLUX.2 [pro] — Text-to-Image

High-capacity, production-grade generation with cinematic visual quality. FLUX.2 [pro] is the flagship of the FLUX.2 family, built for maximum fidelity, stronger global coherence, and polished outputs suitable for hero images, key art, and high-stakes campaigns.

Built for

  • Hero shots and key visuals
  • Commercial campaigns and product imagery
  • High-resolution, detail-critical use cases
  • Teams prioritising quality over raw speed
  • Domain-specific and brand-specific fine-tuning at the top tier

What This Means for You

• Flagship-level visual quality

Generates sharp, detailed, and globally coherent images that are suited for front-page creatives, campaign assets, and final deliverables.

• Strong prompt adherence

Handles complex, multi-part prompts with improved consistency across characters, objects, layout, and style, reducing the need for post-selection or heavy manual curation.

• Open-source backbone

Built on the FLUX.2 ecosystem and open tooling, enabling deeper inspection, integration, and extension inside custom pipelines and internal platforms.

• Fine-tuning and LoRA friendly

Acts as a powerful base for LoRA and other lightweight adapters when you need top-tier quality combined with brand-specific or domain-specific customisation.

• Efficient for its class

While more compute-intensive than dev or flex, it remains efficient relative to other high-end diffusion models, making recurring production use feasible.

• Flexible output formats

Supports common outputs such as JPEG and PNG, ready for design tools, web delivery, print preparation, and downstream processing.

• Reproducible generations

Seed control allows consistent reruns and controlled variations, which is essential for A/B testing, creative approvals, and iterative refinement loops.

Pricing

Simple per-image billing:

  • $0.03 per generated image

FLUX.2 family on WaveSpeedAI

Use FLUX.2 [pro] Text-to-Image together with the rest of the FLUX.2 lineup for a complete generate-and-edit stack:

More Image Tools on WaveSpeedAI

  • Nano Banana Pro – Google’s Gemini-based text-to-image model for sharp, coherent, prompt-faithful visuals that work great for ads, keyframes, and product shots.
  • Seedream V4 – ByteDance’s style-consistent, multi-image generator ideal for posters, campaigns, and large batches of on-brand illustrations.
  • Qwen Edit Plus – an enhanced Qwen-based image editor for precise inpainting, cleanup, and local style changes while preserving overall composition.