Jib Mix Qwen Image Text To Image
Playground
Try it on WavespeedAI!Jib Mix Qwen — more natural pretty faces (Much better at Asian faces) model for next-gen text-to-image generation.
Features
Jib-Mix-Qwen-Image (Text-to-Image)
Jib-Mix-Qwen-Image is a finely tuned text-to-image generation model based on Qwen-Image 20B (MMDiT), optimized through the Jib-Mix portrait enhancement pipeline. It specializes in realistic human faces, cinematic lighting, and vivid artistic styles, delivering professional-grade visuals from simple text prompts — no LoRA setup needed.
Why it looks great
- Jib-Mix fine-tuning – Enhances facial structure, skin texture, and lighting realism, especially for close-ups and half-body portraits.
- Cinematic diffusion engine – Captures lifelike depth, atmosphere, and tone with consistent color harmony.
- Exceptional text rendering – Handles both Chinese and English typography natively, blending text naturally into the image.
- Broad style coverage – From photorealism to anime, oil painting, 3D, or stylized artwork—one model, infinite versatility.
- Identity consistency – Generates characters with coherent facial details and stable expressions across prompts.
Limits and Performance
- Max resolution per job: up to 1536 × 1536 pixels
- Output formats: JPEG / PNG / WEBP
- Processing speed: ~5–8 seconds per image (depending on prompt complexity)
- Prompt input: supports detailed, multi-line bilingual descriptions
Pricing
- $0.02 per image Each image is billed individually.
How to Use
- Enter a prompt describing your desired image (Chinese or English).
- Set image size (width × height, up to 1536×1536).
- (Optional) Set a seed for reproducibility (
-1= random). - Choose output format (JPEG / PNG / WEBP).
- Generate → preview → iterate with refined prompts.
Pro tips for best quality
- Be specific — describe lighting, pose, emotion, and background for more control.
- For portraits, include keywords like cinematic lighting, soft focus, 8K detail, professional photo.
- Fix seed to maintain subject consistency across multiple outputs.
- Experiment with styles (e.g., realistic, anime, oil painting, CG render) to explore model versatility.
Note
- For best realism, ensure prompts describe camera angle, lighting, and environment — the model responds strongly to cinematic cues.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/jib-mix-qwen-image/text-to-image" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"size": "1024*1024",
"seed": -1,
"output_format": "jpeg",
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| size | string | No | 1024*1024 | 256 ~ 1536 per dimension | The size of the generated media in pixels (width*height). |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
| output_format | string | No | jpeg | jpeg, png, webp | The format of the output image. |
| enable_sync_mode | boolean | No | false | - | If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. |
| enable_base64_output | boolean | No | false | - | If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |