WaveSpeedAI APIWavespeed AIQwen Image Text To Image LoRA

Qwen Image Text To Image LoRA

Qwen Image Text To Image LoRA

Playground

Try it on WavespeedAI!

Qwen-Image LoRA is a 20B MMDiT next-gen text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Qwen-Image-LoRA

Qwen-Image-LoRA extends the base 20B MMDiT text-to-image model by allowing users to plug in custom LoRA weights (.safetensors) for fine-tuned control over style, characters, or artistic domains. This makes it a versatile tool for creators who want both world-class text rendering and personalized generation.


Why it looks great

  • LoRA integration: Import external .safetensors LoRA weights and control blending strength via scale.
  • SOTA text rendering: Rivals GPT-4o in English and is best-in-class for Chinese typography.
  • In-pixel text generation: Text is seamlessly integrated into images (no overlays).
  • Bilingual support: Handles Chinese & English with diverse fonts and complex layouts.
  • General image excellence: Photorealistic, anime, impressionist, or minimalist styles—all supported.

Limits and Performance

  • Max resolution per job: up to 1024 × 1024 pixels
  • LoRA path: provide <owner>/<model-name> or external .safetensors URL
  • LoRA scale: adjustable strength (default = 1.0)
  • Output formats: JPEG / PNG / WEBP
  • Processing speed: ~6–10 seconds per image
  • Input prompt: supports multi-line descriptive text

Pricing

  • $0.025 per image
  • Each image is billed individually.

How to Use

  1. Enter a prompt (supports detailed narrative & embedded text).

  2. Set size (width & height, up to 1024×1024).

  3. Add one or more LoRAs:

    • Paste the path/URL of the LoRA .safetensors file.
    • Adjust the scale (e.g., 0.5 for subtle effect, 1.0 for full strength).
  4. (Optional) Set seed for reproducibility (-1 = random).

  5. Choose output format (JPEG / PNG).

  6. Run → preview results → iterate with different LoRA scales.


Pro tips for best quality

  • Use specific LoRAs for characters, art styles, or IP consistency.
  • Combine multiple LoRAs for hybrid results (e.g., anime + steampunk).
  • Adjust scale carefully—too high may distort, too low may fade.
  • Lock the seed to maintain subject consistency when swapping LoRAs.

Reference


Note

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-lora" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "1024*1024",
    "loras": [],
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
sizestringNo1024*1024256 ~ 1536 per dimensionThe size of the generated media in pixels (width*height).
lorasarrayNomax 3 itemsList of LoRAs to apply (maximum 3).
loras[].pathstringYes-Path to the LoRA model
loras[].scalefloatYes-0.0 ~ 4.0Scale of the LoRA model
seedintegerNo-1-1 ~ 2147483647The random seed to use for the generation. -1 means a random seed will be used.
output_formatstringNojpegjpeg, png, webpThe format of the output image.
enable_sync_modebooleanNofalse-If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
enable_base64_outputbooleanNofalse-If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.