← Blog

Introducing Baidu ERNIE Image Turbo on WaveSpeedAI

ERNIE Image Turbo — Baidu's 8-step distilled text-to-image model. Native Chinese/English/Japanese, fast generation, $0.03/image. Now on WaveSpeedAI.

4 min read
Wavespeed Ai Ernie Image Text To Image Turbo
Wavespeed Ai Ernie Image Text To Image Turbo ERNIE Image Turbo — Baidu's 8-step distilled text-to-image m...
Try it
Introducing Baidu ERNIE Image Turbo on WaveSpeedAI

Baidu’s Multilingual Image Model, Now at Turbo Speed

Full-quality image generation is great for finals — but for ideation, iteration, and real-time experiences, you want the same model with a fraction of the latency. We’re excited to announce that Baidu ERNIE Image Turbo is now live on WaveSpeedAI — a distilled 8-step variant of ERNIE Image that trades a tiny amount of detail for a huge speedup, at the same low per-image price.

What Is Baidu ERNIE Image Turbo?

ERNIE Image Turbo is the fast-inference variant of Baidu’s flagship text-to-image model, ERNIE Image. Through step distillation, Baidu compressed the generation pipeline to just 8 inference steps while preserving the core strengths that make ERNIE Image unique: native Chinese, English, and Japanese prompt understanding; flexible sizing; and LLM-enhanced prompt expansion.

If ERNIE Image is for “final pixel quality,” ERNIE Image Turbo is for “as fast as your users can type.”

Key Features

8-Step Distilled Inference A fraction of the compute of the full-quality variant, with output quality that’s still production-viable for the vast majority of use cases.

Same Native Multilingual Prompts Chinese (简体中文), English, and Japanese (日本語) — all first-class. Turbo doesn’t sacrifice language fidelity for speed.

LLM-Enhanced Prompt Expansion Short prompts still get the ERNIE-powered auto-expansion, so brief inputs produce detailed outputs.

Flexible Sizing Free aspect ratios and resolutions — portrait, landscape, square, custom.

Low Latency, Low Cost Cheap enough for ideation, fast enough for chat-based creative UIs, live demos, and iterative refinement.

Real-World Use Cases

Interactive Creative Apps

Build tools where users type a prompt and see results in seconds, not tens of seconds. Essential for chat-style creative UIs, design copilots, and text-adventure visuals.

Chinese/Japanese Social Content at Scale

Batch-generate large volumes of localized social media visuals without burning through a bigger budget.

Concept Exploration

Sweep through 20 variations of a concept in the time it takes full-quality ERNIE Image to render 5. Pick a winner, then re-render at full quality.

Product Listings and Thumbnails

For e-commerce, game asset, and UGC platforms — high-volume image generation at an unbeatable price.

Live Demos and User-Facing Previews

Keep users engaged with fast feedback. Use Turbo for interactive preview, then offer a “render final” button that calls full ERNIE Image.

Getting Started on WaveSpeedAI

  1. Write a prompt in Chinese, English, or Japanese.
  2. Pick a size — any aspect ratio that fits your layout.
  3. Submit — outputs return in a fraction of the time of the full-quality variant.

Full API schema and an interactive playground live on the model page.

Pricing

Just $0.03 per image — the same low price as full ERNIE Image, but with Turbo’s latency profile. An unbeatable deal for high-volume, interactive, and iterative workflows.

Why Run ERNIE Image Turbo on WaveSpeedAI

  • One API for 890+ models. Swap between Turbo, full ERNIE Image, FLUX, SDXL, and more with a string change.
  • No cold starts. Turbo stays turbo under load — no warmup tax.
  • Transparent pricing. Per-image, no subscription, no minimums.
  • Production reliability. Suitable for real-time creative apps, live demos, and consumer workloads.

Pro Tips

  • Ideate in Turbo, finalize in ERNIE Image. Let users (or yourself) iterate cheaply, then re-render the winner at full quality.
  • Batch well. Turbo’s low latency shines when generating grids of variations from a single prompt.
  • Keep prompts focused. Specify subject, style, and mood; the LLM expansion handles the rest.
  • For Chinese content, write in Chinese. Skip translation — Turbo understands natively.
  • Use streaming UIs. With Turbo’s speed, preview-as-you-type patterns become practical.

Start Creating Today

ERNIE Image Turbo takes Baidu’s native multilingual image generation and makes it fast enough for real-time use — at the same per-image price as the full-quality model.

Try Baidu ERNIE Image Turbo now on WaveSpeedAI and build multilingual creative UIs that respond at human speed.