Z Image Base | High-Quality Text-to-Image API

Z-Image Base

Z-Image Base is a 6-billion parameter text-to-image model from Tongyi-MAI that generates photorealistic images with optional reference image guidance. Provide a text prompt alone, or add a reference image to guide the composition, style, or subject — all at an incredibly affordable price.

Why Choose This?

Reference image guidance Optionally provide a reference image to influence the generated output's composition, style, or subject matter.
Flexible output sizing Customize width and height up to 1024px for any aspect ratio you need.
Strength control Fine-tune how much the reference image influences the output with the strength parameter.
Prompt Enhancer Built-in tool to automatically improve your prompts for better results.
Ultra-affordable Just $0.01 per image — perfect for high-volume generation and experimentation.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the image you want to generate
image	No	Reference image to guide generation (upload or URL)
size	No	Preset size options
width	No	Output width in pixels (default: 1024)
height	No	Output height in pixels (default: 1024)
strength	No	How much the reference image influences output, 0-1 (default: 0.6)
seed	No	Random seed for reproducibility (default: -1 for random)
output_format	No	Output format: jpeg, png (default: jpeg)
enable_sync_mode	No	API only: wait for result before returning response

Strength Guide (with Reference Image)

Lower values (0.2-0.4): Strong reference influence, output closely follows the reference image
Medium values (0.5-0.7): Balanced blend of reference and prompt
Higher values (0.8-1.0): Prompt dominates, reference serves as loose inspiration

How to Use

Text-to-Image (No Reference)

Write your prompt — describe the image you want to create.
Set dimensions — adjust width and height for your needs.
Run — submit and download your image.

With Reference Image

Upload a reference image — to guide the generation's composition or style.
Write your prompt — describe the desired output.
Adjust strength — control how much the reference influences the result.
Run — submit and download your generated image.

Pricing

Output	Cost
Per image	$0.01

Best Use Cases

Rapid Prototyping — Generate multiple concepts quickly at minimal cost.
Style-guided Generation — Use reference images to maintain consistent aesthetics.
Content Creation — Produce visuals for social media, blogs, and marketing.
Creative Exploration — Experiment freely with different prompts and settings.
Batch Generation — Create large volumes of images affordably.

Pro Tips

Use the Prompt Enhancer to automatically improve your descriptions.
For pure text-to-image, be specific about style, lighting, and composition.
When using a reference image, start with strength around 0.6 and adjust based on results.
Keep the same seed to iterate on a specific composition while tweaking the prompt.
Lower strength values make output follow the reference more closely; higher values give the prompt more creative freedom.

Notes

When no image is provided, the model runs in pure text-to-image mode.
The strength parameter only applies when a reference image is provided.
enable_sync_mode is only available through the API, not in the web interface.

Z Image Base API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/z-image/base with your input as JSON. The endpoint returns a prediction id. Start polling the result endpoint around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. On completed, read output values from data.outputs. Examples for Z Image Base below.

HTTP example

set -euo pipefail

: "${WAVESPEED_API_KEY:?Set WAVESPEED_API_KEY}"

REQUEST_BODY=$(cat <<'JSON'
{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "strength": 0.6,
    "seed": -1,
    "output_format": "jpeg"
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/z-image/base" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d "$REQUEST_BODY")

TASK=$(printf '%s' "$SUBMIT_RESPONSE" | jq 'if has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "$TASK" | jq -r '.id')
if [ -z "$PREDICTION_ID" ] || [ "$PREDICTION_ID" = "null" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "$TASK" | jq -r '.urls.get // empty')
if [ -z "$RESULT_URL" ]; then
  RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/$PREDICTION_ID/result"
fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body "$RESULT_URL" \
    -H "Authorization: Bearer $WAVESPEED_API_KEY")
  RESULT=$(printf '%s' "$RESPONSE" | jq 'if has("data") then .data else . end')
  STATUS=$(printf '%s' "$RESULT" | jq -r '.status')
  case "$STATUS" in
    completed) printf '%s\n' "$RESULT" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "$RESULT" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "$STATUS" >&2; exit 1 ;;
  esac
done

Node.js example

const submitUrl = "https://api.wavespeed.ai/api/v3/wavespeed-ai/z-image/base";
const apiKey = process.env.WAVESPEED_API_KEY;
if (!apiKey) throw new Error('Set WAVESPEED_API_KEY');

async function requestJson(url, options = {}) {
  const response = await fetch(url, options);
  if (!response.ok) throw new Error(await response.text());
  return response.json();
}

// 1. Submit the prediction.
const body = await requestJson(submitUrl, {
  method: "POST",
  headers: {
    "Authorization": `Bearer ${apiKey}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "strength": 0.6,
        "seed": -1,
        "output_format": "jpeg"
}),
});
const task = body.data ?? body;
if (!task.id) throw new Error("Submission response did not contain a prediction id");
const resultUrl = task.urls?.get ||
  `https://api.wavespeed.ai/api/v3/predictions/${task.id}/result`;

// 2. Poll until the prediction finishes.
while (true) {
  const resultBody = await requestJson(resultUrl, {
    headers: { "Authorization": `Bearer ${apiKey}` },
  });
  const result = resultBody.data ?? resultBody;
  if (result.status === "completed") {
    console.log(result.outputs);
    break;
  }
  if (["failed", "cancelled", "timeout"].includes(result.status)) throw new Error(JSON.stringify(result));
  if (!["created", "processing"].includes(result.status)) throw new Error("Unexpected status: " + result.status);
  await new Promise(resolve => setTimeout(resolve, 2000));
}

Python example

import json
import os
import time
from urllib.request import Request, urlopen

api_key = os.environ["WAVESPEED_API_KEY"]
headers = {"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"}
payload = {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "strength": 0.6,
    "seed": -1,
    "output_format": "jpeg"
}

def request_json(url, data=None):
    request = Request(url, data=data, headers=headers, method="POST" if data else "GET")
    with urlopen(request) as response:
        return json.load(response)

# 1. Submit the prediction.
body = request_json("https://api.wavespeed.ai/api/v3/wavespeed-ai/z-image/base", json.dumps(payload).encode())
task = body.get("data", body)
if not task.get("id"):
    raise RuntimeError("Submission response did not contain a prediction id")
result_url = task.get("urls", {}).get("get") or f"https://api.wavespeed.ai/api/v3/predictions/{task['id']}/result"

# 2. Poll until the prediction finishes.
while True:
    result_body = request_json(result_url)
    result = result_body.get("data", result_body)
    status = result.get("status")
    if status == "completed":
        print(result.get("outputs", []))
        break
    if status in {"failed", "cancelled", "timeout"}:
        raise RuntimeError(result)
    if status not in {"created", "processing"}:
        raise RuntimeError(f"Unexpected status: {status}")
    time.sleep(2)

Z Image Base API — Frequently asked questions

What is the Z Image Base API?

Z Image Base is a WaveSpeedAI model for image generation, exposed as a REST API on WaveSpeedAI. Z-Image-Base is a 6 billion-parameter text-to-image model with full CFG support. Supports fine-tuning capabilities for maximum control over image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Z Image Base API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID. Poll the result endpoint starting around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. The playground generates production-oriented Python, JavaScript, and cURL examples with timeouts, transient-error handling, and safe GET retries. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/z-image-base.

How much does Z Image Base cost per run?

Z Image Base starts at $0.010 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Z Image Base accept?

Key inputs: `prompt`, `image`, `size`, `seed`, `enable_base64_output`, `enable_sync_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/z-image-base.

How long does Z Image Base take to generate?

Median end-to-end generation time on WaveSpeedAI is around 11 seconds per request, based on recent successful runs. Queue time varies with global demand; live status is visible in the prediction record.

Can I use Z Image Base outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ExemplosVer todos

Modelos relacionados

README