WaveSpeed.ai
Home/Explore/Best Open Source Image Models/wavespeed-ai/z-image/base-lora
lora-support

lora-support

Z-Image-Base LoRA

wavespeed-ai/z-image/base-lora

Z-Image-Base LoRA (6B) enables high-quality text-to-image generation with full CFG support and external LoRA support. Supports negative prompting while applying up to 3 LoRAs for custom styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input
width
height
1024 × 1024 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A vintage-style 35mm film photograph of a smiling couple sitting in a retro diner. Warm, golden indoor lighting. They are laughing, not looking at the camera. Flash photography aesthetic, slightly harsh shadow behind them, but the skin looks glowing and warm. Grainy, imperfect, nostalgic vibe. Details of the retro leather seats and milkshake on the table. Candid moment, pure joy.

Your request will cost $0.012 per run.

For $1 you can run this model approximately 83 times.

ExamplesView all

A vintage-style 35mm film photograph of a smiling couple sitting in a retro diner. Warm, golden indoor lighting. They are laughing, not looking at the camera. Flash photography aesthetic, slightly harsh shadow behind them, but the skin looks glowing and warm. Grainy, imperfect, nostalgic vibe. Details of the retro leather seats and milkshake on the table. Candid moment, pure joy.
Close-up of a fantasy queen wearing an elaborate, intricate gold headpiece encrusted with rubies and sapphires. The jewelry has filigree details and hangs over her forehead. Her makeup is gold leaf avant-garde style. Intense gaze, purple irises. Macro shot showing the facets of the gemstones and the texture of the gold metal. Royal atmosphere, symmetrical composition, sharp depth of field, opulence, photorealistic.
Surreal infrared portrait photography, Kodak Aerochrome film style. A young woman stands in a landscape where all foliage (trees, grass) is rendered in deep crimson and pink tones. Her skin appears pale, almost porcelain white and smooth, contrasting with dark, intense eyes. Grainy analog film texture, ethereal atmosphere, dreamlike colors, color shift, unique aesthetic.
A cinematic photograph of identical adult twin sisters interacting. They are sitting on a vintage sofa. Twin A on the left is laughing joyfully, head thrown back. Twin B on the right is looking at her sister with a serious, contemplative expression. They share the exact same facial features but different emotions. Warm afternoon light fills the bohemian room. The challenge is maintaining perfect facial likeness consistency. 35mm film photograph.
A moody cinematic street portrait at night in a rainy city. A handsome young man stands under a transparent umbrella. The background is a blur of vibrant city traffic lights and neon signs (beautiful bokeh). Raindrops are illuminated by the streetlights. He looks to the side with a thoughtful expression. Shot on Kodak Portra 800, high contrast, grainy texture, wet asphalt reflection, emotional storytelling, 85mm lens.

README

Z-Image Base LoRA

Z-Image Base LoRA is a 6-billion parameter text-to-image model from Tongyi-MAI with full LoRA support. Apply up to 3 custom LoRA adapters simultaneously to generate images with personalized styles, characters, or brand aesthetics — all while maintaining fast generation speeds.

Why Choose This?

  • Triple LoRA support Apply up to 3 custom LoRA adapters at once for layered style control — combine character, style, and aesthetic LoRAs in a single generation.

  • Flexible output sizing Customize width and height up to 1024px for any aspect ratio you need.

  • Prompt Enhancer Built-in tool to automatically improve your prompts for better results.

  • LoRA ecosystem compatibility Load LoRA weights from popular sources like Civitai and Hugging Face, or train your own custom LoRAs.

  • Affordable pricing Just $0.012 per image — perfect for high-volume generation with custom styles.

Parameters

ParameterRequiredDescription
promptYesText description of the image you want to generate
negative_promptNoElements to avoid in the output
lorasNoUp to 3 LoRA adapters to apply (click "+ Add Item")
sizeNoPreset size options
widthNoOutput width in pixels (default: 1024)
heightNoOutput height in pixels (default: 1024)
seedNoRandom seed for reproducibility (default: -1 for random)
output_formatNoOutput format: jpeg, png (default: jpeg)
enable_sync_modeNoAPI only: wait for result before returning response

How to Use

  1. Write your prompt — describe the image you want to create, including your LoRA trigger words.
  2. Add LoRAs — click "+ Add Item" to add up to 3 LoRA adapters with their weights.
  3. Add negative prompt (optional) — specify what to avoid.
  4. Set dimensions — adjust width and height for your needs.
  5. Run — submit and download your image.

Pricing

OutputCost
Per image$0.012

Best Use Cases

  • Character Consistency — Use character LoRAs to maintain identity across multiple generations.
  • Brand Aesthetics — Apply brand-specific style LoRAs for consistent marketing visuals.
  • Art Style Transfer — Generate images in specific artistic styles trained into LoRAs.
  • Combined Styles — Layer multiple LoRAs for unique style combinations.
  • Rapid Iteration — Test different LoRA combinations quickly at low cost.

Pro Tips

  • Include your LoRA trigger words in the prompt for best activation.
  • Start with LoRA weight around 0.7-1.0, then adjust based on results.
  • Combine complementary LoRAs (e.g., character + style + lighting) for richer outputs.
  • Use the Prompt Enhancer to automatically improve your descriptions.
  • Keep the same seed when comparing different LoRA combinations.
  • Use negative_prompt to avoid common issues like "blurry, distorted, low quality".

Train Your Own LoRA

Want to create custom LoRAs for Z-Image? Use the Z-Image LoRA Trainer:

Guidance

Related Models

  • Z-Image Base — Base model without LoRA support at $0.01 per image.
  • Z-Image Turbo — Faster generation optimized for sub-second inference.

Notes

  • Maximum of 3 LoRAs can be applied per generation.
  • LoRA weights typically range from 0.5 to 1.0 for best results.
  • enable_sync_mode is only available through the API, not in the web interface.