Z Image Base

Playground

Z-Image-Base is a 6 billion-parameter text-to-image model with full CFG support. Supports negative prompting and fine-tuning capabilities for maximum control over image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Z-Image Base

Z-Image Base is a 6-billion parameter text-to-image model from Tongyi-MAI that generates photorealistic images with optional reference image guidance. Provide a text prompt alone, or add a reference image to guide the composition, style, or subject — all at an incredibly affordable price.

Why Choose This?

Reference image guidance Optionally provide a reference image to influence the generated output’s composition, style, or subject matter.
Flexible output sizing Customize width and height up to 1024px for any aspect ratio you need.
Strength control Fine-tune how much the reference image influences the output with the strength parameter.
Prompt Enhancer Built-in tool to automatically improve your prompts for better results.
Ultra-affordable Just $0.01 per image — perfect for high-volume generation and experimentation.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the image you want to generate
negative_prompt	No	Elements to avoid in the output
image	No	Reference image to guide generation (upload or URL)
size	No	Preset size options
width	No	Output width in pixels (default: 1024)
height	No	Output height in pixels (default: 1024)
strength	No	How much the reference image influences output, 0-1 (default: 0.6)
seed	No	Random seed for reproducibility (default: -1 for random)
output_format	No	Output format: jpeg, png (default: jpeg)
enable_sync_mode	No	API only: wait for result before returning response

Strength Guide (with Reference Image)

Lower values (0.2-0.4): Strong reference influence, output closely follows the reference image
Medium values (0.5-0.7): Balanced blend of reference and prompt
Higher values (0.8-1.0): Prompt dominates, reference serves as loose inspiration

How to Use

Text-to-Image (No Reference)

Write your prompt — describe the image you want to create.
Add negative prompt (optional) — specify what to avoid.
Set dimensions — adjust width and height for your needs.
Run — submit and download your image.

With Reference Image

Upload a reference image — to guide the generation’s composition or style.
Write your prompt — describe the desired output.
Adjust strength — control how much the reference influences the result.
Run — submit and download your generated image.

Pricing

Output	Cost
Per image	$0.01

Best Use Cases

Rapid Prototyping — Generate multiple concepts quickly at minimal cost.
Style-guided Generation — Use reference images to maintain consistent aesthetics.
Content Creation — Produce visuals for social media, blogs, and marketing.
Creative Exploration — Experiment freely with different prompts and settings.
Batch Generation — Create large volumes of images affordably.

Pro Tips

Use the Prompt Enhancer to automatically improve your descriptions.
For pure text-to-image, be specific about style, lighting, and composition.
When using a reference image, start with strength around 0.6 and adjust based on results.
Use negative_prompt to avoid common issues like “blurry, distorted, low quality”.
Keep the same seed to iterate on a specific composition while tweaking the prompt.
Lower strength values make output follow the reference more closely; higher values give the prompt more creative freedom.

Notes

When no image is provided, the model runs in pure text-to-image mode.
The strength parameter only applies when a reference image is provided.
enable_sync_mode is only available through the API, not in the web interface.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/z-image/base" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "negative_prompt": "",
    "size": "1024*1024",
    "strength": 0.6,
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
negative_prompt	string	No		-	The negative prompt for the generation. Describes what you don't want in the image.
image	string	No		-	URL of the reference image to guide the generation.
size	string	No	1024*1024	256 ~ 1536 per dimension	The size of the generated media in pixels (width*height).
strength	number	No	0.6	0.00 ~ 1.00	Controls the strength of the transformation. Higher values produce outputs more different from the input image.
seed	integer	No	-1	-1 ~ 2147483647	The random seed to use for the generation. -1 means a random seed will be used.
output_format	string	No	jpeg	jpeg, png, webp	The format of the output image.
enable_sync_mode	boolean	No	false	-	If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
enable_base64_output	boolean	No	false	-	If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Wan Flf2v Z Image Base LoRA