Z-Image Base
Z-Image Base is a 6-billion parameter text-to-image model from Tongyi-MAI that generates photorealistic images with optional reference image guidance. Provide a text prompt alone, or add a reference image to guide the composition, style, or subject — all at an incredibly affordable price.
Why Choose This?
-
Reference image guidance
Optionally provide a reference image to influence the generated output's composition, style, or subject matter.
-
Flexible output sizing
Customize width and height up to 1024px for any aspect ratio you need.
-
Strength control
Fine-tune how much the reference image influences the output with the strength parameter.
-
Prompt Enhancer
Built-in tool to automatically improve your prompts for better results.
-
Ultra-affordable
Just $0.01 per image — perfect for high-volume generation and experimentation.
Parameters
| Parameter | Required | Description |
|---|
| prompt | Yes | Text description of the image you want to generate |
| negative_prompt | No | Elements to avoid in the output |
| image | No | Reference image to guide generation (upload or URL) |
| size | No | Preset size options |
| width | No | Output width in pixels (default: 1024) |
| height | No | Output height in pixels (default: 1024) |
| strength | No | How much the reference image influences output, 0-1 (default: 0.6) |
| seed | No | Random seed for reproducibility (default: -1 for random) |
| output_format | No | Output format: jpeg, png (default: jpeg) |
| enable_sync_mode | No | API only: wait for result before returning response |
Strength Guide (with Reference Image)
- Lower values (0.2-0.4): Strong reference influence, output closely follows the reference image
- Medium values (0.5-0.7): Balanced blend of reference and prompt
- Higher values (0.8-1.0): Prompt dominates, reference serves as loose inspiration
How to Use
Text-to-Image (No Reference)
- Write your prompt — describe the image you want to create.
- Add negative prompt (optional) — specify what to avoid.
- Set dimensions — adjust width and height for your needs.
- Run — submit and download your image.
With Reference Image
- Upload a reference image — to guide the generation's composition or style.
- Write your prompt — describe the desired output.
- Adjust strength — control how much the reference influences the result.
- Run — submit and download your generated image.
Pricing
Best Use Cases
- Rapid Prototyping — Generate multiple concepts quickly at minimal cost.
- Style-guided Generation — Use reference images to maintain consistent aesthetics.
- Content Creation — Produce visuals for social media, blogs, and marketing.
- Creative Exploration — Experiment freely with different prompts and settings.
- Batch Generation — Create large volumes of images affordably.
Pro Tips
- Use the Prompt Enhancer to automatically improve your descriptions.
- For pure text-to-image, be specific about style, lighting, and composition.
- When using a reference image, start with strength around 0.6 and adjust based on results.
- Use negative_prompt to avoid common issues like "blurry, distorted, low quality".
- Keep the same seed to iterate on a specific composition while tweaking the prompt.
- Lower strength values make output follow the reference more closely; higher values give the prompt more creative freedom.
Notes
- When no image is provided, the model runs in pure text-to-image mode.
- The strength parameter only applies when a reference image is provided.
- enable_sync_mode is only available through the API, not in the web interface.