Qwen Image Text To Image LoRA
Playground
Try it on WavespeedAI!Qwen-Image LoRA is a 20B MMDiT next-gen text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
Qwen-Image-LoRA
Qwen-Image-LoRA extends the base 20B MMDiT text-to-image model by allowing users to plug in custom LoRA weights (.safetensors) for fine-tuned control over style, characters, or artistic domains. This makes it a versatile tool for creators who want both world-class text rendering and personalized generation.
Why it looks great
- LoRA integration: Import external
.safetensorsLoRA weights and control blending strength viascale. - SOTA text rendering: Rivals GPT-4o in English and is best-in-class for Chinese typography.
- In-pixel text generation: Text is seamlessly integrated into images (no overlays).
- Bilingual support: Handles Chinese & English with diverse fonts and complex layouts.
- General image excellence: Photorealistic, anime, impressionist, or minimalist styles—all supported.
Limits and Performance
- Max resolution per job: up to 1024 × 1024 pixels
- LoRA path: provide
<owner>/<model-name>or external.safetensorsURL - LoRA scale: adjustable strength (default = 1.0)
- Output formats: JPEG / PNG / WEBP
- Processing speed: ~6–10 seconds per image
- Input prompt: supports multi-line descriptive text
Pricing
- $0.025 per image
- Each image is billed individually.
How to Use
-
Enter a prompt (supports detailed narrative & embedded text).
-
Set size (width & height, up to 1024×1024).
-
Add one or more LoRAs:
- Paste the path/URL of the LoRA
.safetensorsfile. - Adjust the scale (e.g.,
0.5for subtle effect,1.0for full strength).
- Paste the path/URL of the LoRA
-
(Optional) Set seed for reproducibility (
-1= random). -
Choose output format (JPEG / PNG).
-
Run → preview results → iterate with different LoRA scales.
Pro tips for best quality
- Use specific LoRAs for characters, art styles, or IP consistency.
- Combine multiple LoRAs for hybrid results (e.g., anime + steampunk).
- Adjust scale carefully—too high may distort, too low may fade.
- Lock the seed to maintain subject consistency when swapping LoRAs.
Reference
Note
- Please use wavespeed-ai/qwen-image-lora-trainer to make sure your LoRA can use in this model!
- Or the corresponding model from official platform! (Civitai or Hugging Face)
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-lora" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"size": "1024*1024",
"loras": [],
"seed": -1,
"output_format": "jpeg",
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| size | string | No | 1024*1024 | 256 ~ 1536 per dimension | The size of the generated media in pixels (width*height). |
| loras | array | No | max 3 items | List of LoRAs to apply (maximum 3). | |
| loras[].path | string | Yes | - | Path to the LoRA model | |
| loras[].scale | float | Yes | - | 0.0 ~ 4.0 | Scale of the LoRA model |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
| output_format | string | No | jpeg | jpeg, png, webp | The format of the output image. |
| enable_sync_mode | boolean | No | false | - | If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. |
| enable_base64_output | boolean | No | false | - | If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |