Home/Explore/Flux Image Tools/wavespeed-ai/flux-2-flex/text-to-image
text-to-image

text-to-image

FLUX.2 [flex] Text-to-Image | Fast Unlimited Image Generation | WaveSpeedAI

wavespeed-ai/flux-2-flex/text-to-image

Text-to-image generation with FLUX.2 [flex] from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities.

width
height
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A vintage 1860s sepia-toned daguerreotype photograph. It depicts a serious Abraham Lincoln sitting in a formal chair, wearing his traditional suit and top hat, but he is wearing large, colorful, modern DJ headphones around his neck and holding a shiny silver microphone. The photo has scratches, dust particles, and heavy vignette typical of the 19th century, but the modern equipment looks physically present in the scene, not just pasted on.

Your request will cost $0.012 per run.

For $1 you can run this model approximately 83 times.

One more thing::

ExamplesView all

A vintage 1860s sepia-toned daguerreotype photograph. It depicts a serious Abraham Lincoln sitting in a formal chair, wearing his traditional suit and top hat, but he is wearing large, colorful, modern DJ headphones around his neck and holding a shiny silver microphone. The photo has scratches, dust particles, and heavy vignette typical of the 19th century, but the modern equipment looks physically present in the scene, not just pasted on.
A breathtaking photograph of "The Vertical Forest City." Enormous, futuristic residential skyscrapers built completely out of intertwining massive tree roots, glass, and polished wood, rising from a misty jungle canyon. Waterfalls cascade from the upper balconies of the buildings. Suspended glass bridges connect the towers. People are visible gardening on their plant-covered terraces. The architecture looks organic yet structurally impossible, bathed in warm sunset light.
A hyper-realistic close-up photograph of a master watchmaker's hands working on a complex mechanical watch movement. The watchmaker is using fine tweezers to carefully place a tiny ruby jewel bearing into the gears. We can see every wrinkle on the fingers, the tension in the skin, fingerprint ridges, and oil stains. The watch gears, springs, and tiny screws are rendered with immense mechanical precision under a magnifying lamp. The focus must be absolutely critical on the point where the tweezers touch the ruby.
A hyper-realistic studio shot of a transparent glass skull filled with colorful jelly beans. The skull is placed inside a cube made of clear ice. We can see the distorted refraction of the jelly beans through both the ice and the glass skull. Lighting is coming from behind, creating a glowing effect through the sweets. Water droplets are melting off the ice cube onto a black reflective surface.
A detailed, deconstructed technical blueprint of a futuristic sci-fi drone engine, drawn in white lines on a dark blue grid background. The schematic includes exploded views of gears and rotors. Specific components are labeled with text: "TURBINE V8", "INTAKE MANIFOLD", and "FUEL CELL". The drawing style is precise, engineering CAD style, with measurements and dashed lines indicating assembly.

README

FLUX.2 [flex] — Text-to-Image

Versatile, style-rich open-source generation with professional visual quality at high speed. FLUX.2 [flex] offers a more expressive variant of the FLUX.2 family while staying lean enough for rapid iteration and large-scale use.

Built for

  • Creative exploration and style-heavy projects
  • High-volume generation where variety matters
  • Teams optimising the speed–quality–diversity balance
  • Domain-specific and brand-specific fine-tuning

Richer Style Range with Fast Turnaround

FLUX.2 [flex] keeps the streamlined core of FLUX.2 but is tuned for broader aesthetics, stronger stylisation, and more dynamic compositions. You get production-ready images at speeds that support tight feedback loops, while having extra room to push mood, colour, and framing.

What This Means for You

• Broader aesthetic range

Generates images with more varied styles and moods, making it easier to explore different visual directions from a single prompt.

• Optimised speed–quality balance

Delivers results faster than heavyweight flagship models while maintaining sharpness, coherence, and detail suitable for professional use.

• Open-source backbone

Built on open tooling and community-driven development, so you can inspect, extend, and integrate it deeply into custom stacks.

• Fine-tuning friendly

Works well as a base model for LoRA or other lightweight adapters, letting you specialise it for specific genres, subjects, or branded looks without full retraining.

• Efficient resource usage

The lean architecture keeps GPU requirements modest, making large batches, automated pipelines, and internal tools more cost-effective.

• Flexible output formats

Supports common outputs such as JPEG and PNG for direct use in web, design, and product workflows.

• Reproducible generations

Seed control enables consistent re-runs and controlled variations—crucial for experimentation, A/B testing, and iterative creative work.

Pricing

Simple per-image billing:

  • $0.012 per generated image

FLUX.2 family on WaveSpeedAI

Combine FLUX.2 [flex] with the rest of the FLUX.2 lineup for a complete creation and editing workflow:

More Image Tools on WaveSpeedAI

  • Nano Banana Pro – Google’s Gemini-based text-to-image model for sharp, coherent, prompt-faithful visuals that work great for ads, keyframes, and product shots.
  • Seedream V4 – ByteDance’s style-consistent, multi-image generator ideal for posters, campaigns, and large batches of on-brand illustrations.
  • Qwen Edit Plus – an enhanced Qwen-based image editor for precise inpainting, cleanup, and local style changes while preserving overall composition.