Vidu Q3 और Q3 Pro मॉडल पर 50% छूट · केवल WaveSpeedAI | 20 मई – 2 जून

Jib Mix Qwen Image Text to Image

wavespeed-ai /

Jib Mix Qwen is a next-gen Text-to-Image model optimized for producing natural, pretty faces with improved Asian facial rendering. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image
Input
width
height
1024 × 1024 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

An Impressionist oil painting of a woman sitting in a sun-dappled garden, holding a parasol, in the style of Claude Monet. Visible, broken brushstrokes. The focus is on the fleeting play of light and shadow on her white dress and the vibrant, dappled colors of the flowers. Soft focus, bright, airy atmosphere.

$0.02per run·~50 / $1

Next:

ExamplesView all

An Impressionist oil painting of a woman sitting in a sun-dappled garden, holding a parasol, in the style of Claude Monet. Visible, broken brushstrokes. The focus is on the fleeting play of light and shadow on her white dress and the vibrant, dappled colors of the flowers. Soft focus, bright, airy atmosphere.

An Impressionist oil painting of a woman sitting in a sun-dappled garden, holding a parasol, in the style of Claude Monet. Visible, broken brushstrokes. The focus is on the fleeting play of light and shadow on her white dress and the vibrant, dappled colors of the flowers. Soft focus, bright, airy atmosphere.

A surreal oil painting of a woman’s face emerging from a swirl of colors, expressive brushstrokes, high contrast lighting, inspired by Van Gogh and Klimt, artistic depth.

A surreal oil painting of a woman’s face emerging from a swirl of colors, expressive brushstrokes, high contrast lighting, inspired by Van Gogh and Klimt, artistic depth.

A beautiful anime girl sitting under cherry blossoms, soft glowing light, expressive eyes, delicate hair strands, cinematic composition, Makoto Shinkai style, pastel tones, high detail.

A beautiful anime girl sitting under cherry blossoms, soft glowing light, expressive eyes, delicate hair strands, cinematic composition, Makoto Shinkai style, pastel tones, high detail.

A highly detailed 3D character render, stylized realism. A young female adventurer with large, expressive teal eyes and freckles. Messy copper hair. Wearing worn leather armor. Soft, flattering studio lighting. Flawless skin shading (subsurface scattering). Trending on Artstation, Unreal Engine 5, ZBrush, Pixar style.

A highly detailed 3D character render, stylized realism. A young female adventurer with large, expressive teal eyes and freckles. Messy copper hair. Wearing worn leather armor. Soft, flattering studio lighting. Flawless skin shading (subsurface scattering). Trending on Artstation, Unreal Engine 5, ZBrush, Pixar style.

A highly detailed Steampunk portrait of a female airship captain. Wearing an ornate Victorian dress combined with a leather corset and utility belt. Intricate brass goggles pushed up on her tophat. A complex clockwork mechanical arm. Background of gears and steam pipes. Warm sepia tones, imaginative, complex machinery.

A highly detailed Steampunk portrait of a female airship captain. Wearing an ornate Victorian dress combined with a leather corset and utility belt. Intricate brass goggles pushed up on her tophat. A complex clockwork mechanical arm. Background of gears and steam pipes. Warm sepia tones, imaginative, complex machinery.

A 1940s black and white film noir portrait. A mysterious woman (femme fatale) wearing a stylish hat with a veil. Her face is half-hidden in deep shadow. Dramatic "Venetian blind" shadows fall across her face and the wall behind her. Smoking a cigarette, smoke curling in the air. High contrast, grainy film texture, mysterious, chiaroscuro lighting.

A 1940s black and white film noir portrait. A mysterious woman (femme fatale) wearing a stylish hat with a veil. Her face is half-hidden in deep shadow. Dramatic "Venetian blind" shadows fall across her face and the wall behind her. Smoking a cigarette, smoke curling in the air. High contrast, grainy film texture, mysterious, chiaroscuro lighting.

An ultra-realistic, gritty portrait of a cyberpunk hacker. Her face is illuminated by the glowing blue and pink neon signs of a futuristic city street at night. Rain-slicked trench coat, intricate cybernetic implants on her cheek, intense, focused eyes. High contrast, sharp focus, digital art, cinematic atmosphere, style of Blade Runner.

An ultra-realistic, gritty portrait of a cyberpunk hacker. Her face is illuminated by the glowing blue and pink neon signs of a futuristic city street at night. Rain-slicked trench coat, intricate cybernetic implants on her cheek, intense, focused eyes. High contrast, sharp focus, digital art, cinematic atmosphere, style of Blade Runner.

An ethereal watercolor portrait of a red-haired woman, eyes closed, face tilted up. Delicate green ivy and tiny mushrooms are woven into her braided hair. Soft, earthy pastels (moss green, terracotta), translucent washes, fine ink outlines. A peaceful, listening expression, as if hearing the forest. Isolated on a white background, whimsical Art Nouveau illustration.

An ethereal watercolor portrait of a red-haired woman, eyes closed, face tilted up. Delicate green ivy and tiny mushrooms are woven into her braided hair. Soft, earthy pastels (moss green, terracotta), translucent washes, fine ink outlines. A peaceful, listening expression, as if hearing the forest. Isolated on a white background, whimsical Art Nouveau illustration.

A minimalist single-line (one-line) drawing of a person's face in profile. The entire portrait is created with one continuous, flowing black line on a plain white background. Elegant, simple, clean. Captures the essence of the form with no shading. Style of a modern tattoo design or Matisse's line drawings.

A minimalist single-line (one-line) drawing of a person's face in profile. The entire portrait is created with one continuous, flowing black line on a plain white background. Elegant, simple, clean. Captures the essence of the form with no shading. Style of a modern tattoo design or Matisse's line drawings.

A Baroque-style oil painting portrait of a nobleman with a thoughtful expression. In the style of Caravaggio. Dramatic chiaroscuro, with a single light source illuminating him against a pitch-black background (Tenebrism). Rich texture in his velvet robes and intricate lace collar. Deep, rich colors, masterful brushstrokes, profound and theatrical.

A Baroque-style oil painting portrait of a nobleman with a thoughtful expression. In the style of Caravaggio. Dramatic chiaroscuro, with a single light source illuminating him against a pitch-black background (Tenebrism). Rich texture in his velvet robes and intricate lace collar. Deep, rich colors, masterful brushstrokes, profound and theatrical.

An Art Deco portrait of a glamorous 1920s woman. Sharp, geometric bob haircut. Wearing an extravagant, sequined dress and a pearl headpiece. Posing confidently against a background of stylized, metallic gold geometric patterns (like the Chrysler Building). Bold lines, lavish ornamentation, strong symmetry, sophisticated and modern.

An Art Deco portrait of a glamorous 1920s woman. Sharp, geometric bob haircut. Wearing an extravagant, sequined dress and a pearl headpiece. Posing confidently against a background of stylized, metallic gold geometric patterns (like the Chrysler Building). Bold lines, lavish ornamentation, strong symmetry, sophisticated and modern.

A delicate watercolor portrait of a young woman in profile, her long silver hair flowing and blending with wisps of translucent clouds. Soft moonlight illuminates her face, adorned with tiny, glowing crescent moon symbols. Soft pastel blues and lavenders, elegant flowing lines, a serene and mystical expression. Clean white background, in the Art Nouveau style of Alphonse Mucha.

A delicate watercolor portrait of a young woman in profile, her long silver hair flowing and blending with wisps of translucent clouds. Soft moonlight illuminates her face, adorned with tiny, glowing crescent moon symbols. Soft pastel blues and lavenders, elegant flowing lines, a serene and mystical expression. Clean white background, in the Art Nouveau style of Alphonse Mucha.

Related Models

README

Jib-Mix-Qwen-Image (Text-to-Image)

Jib-Mix-Qwen-Image is a finely tuned text-to-image generation model based on Qwen-Image 20B (MMDiT), optimized through the Jib-Mix portrait enhancement pipeline. It specializes in realistic human faces, cinematic lighting, and vivid artistic styles, delivering professional-grade visuals from simple text prompts — no LoRA setup needed.

Why it looks great

  • Jib-Mix fine-tuning – Enhances facial structure, skin texture, and lighting realism, especially for close-ups and half-body portraits.
  • Cinematic diffusion engine – Captures lifelike depth, atmosphere, and tone with consistent color harmony.
  • Exceptional text rendering – Handles both Chinese and English typography natively, blending text naturally into the image.
  • Broad style coverage – From photorealism to anime, oil painting, 3D, or stylized artwork—one model, infinite versatility.
  • Identity consistency – Generates characters with coherent facial details and stable expressions across prompts.

Limits and Performance

  • Max resolution per job: up to 1536 × 1536 pixels
  • Output formats: JPEG / PNG / WEBP
  • Processing speed: ~5–8 seconds per image (depending on prompt complexity)
  • Prompt input: supports detailed, multi-line bilingual descriptions

Pricing

  • $0.02 per image Each image is billed individually.

How to Use

  1. Enter a prompt describing your desired image (Chinese or English).
  2. Set image size (width × height, up to 1536×1536).
  3. (Optional) Set a seed for reproducibility (-1 = random).
  4. Choose output format (JPEG / PNG / WEBP).
  5. Generate → preview → iterate with refined prompts.

Pro tips for best quality

  • Be specific — describe lighting, pose, emotion, and background for more control.
  • For portraits, include keywords like cinematic lighting, soft focus, 8K detail, professional photo.
  • Fix seed to maintain subject consistency across multiple outputs.
  • Experiment with styles (e.g., realistic, anime, oil painting, CG render) to explore model versatility.

Note

  • For best realism, ensure prompts describe camera angle, lighting, and environment — the model responds strongly to cinematic cues.
Accessibility:This website uses AI models provided by third parties.

Jib Mix Qwen Image Text To Image API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/jib-mix-qwen-image/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Jib Mix Qwen Image Text To Image below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/jib-mix-qwen-image/text-to-image" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/jib-mix-qwen-image/text-to-image", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "seed": -1,
        "output_format": "jpeg",
        "enable_sync_mode": false,
        "enable_base64_output": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/jib-mix-qwen-image/text-to-image",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Jib Mix Qwen Image Text To Image API — Frequently asked questions

What is the Jib Mix Qwen Image Text To Image API?

Jib Mix Qwen Image Text To Image is a WaveSpeedAI model for image generation, exposed as a REST API on WaveSpeedAI. Jib Mix Qwen is a next-gen Text-to-Image model optimized for producing natural, pretty faces with improved Asian facial rendering. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Jib Mix Qwen Image Text To Image API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/jib-mix-qwen-image-text-to-image.

How much does Jib Mix Qwen Image Text To Image cost per run?

Jib Mix Qwen Image Text To Image starts at $0.020 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Jib Mix Qwen Image Text To Image accept?

Key inputs: `prompt`, `size`, `seed`, `enable_base64_output`, `enable_sync_mode`, `output_format`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/jib-mix-qwen-image-text-to-image.

How long does Jib Mix Qwen Image Text To Image take to generate?

Average end-to-end generation time on WaveSpeedAI is around 7 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Jib Mix Qwen Image Text To Image outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.