Imagen 4 (Text-to-Image) — wavespeed-ai/imagen4
Imagen 4 is a high-quality text-to-image model for generating photorealistic and stylized images from prompts, with optional negative prompts for extra control. It’s well-suited for concept art, marketing creatives, portraits, and detailed scenes where you want clean composition and strong realism.
Key capabilities
- Text-to-image generation with strong realism and detail
- Optional negative_prompt to reduce unwanted artifacts or styles
- Reliable composition across common aspect ratios (e.g., 1:1, 16:9, 9:16)
- Supports multi-image output in a single run via num_images
Pricing
$0.04 per image.
Total cost = num_images × $0.04
Example: num_images = 4 → $0.16
How to use
- Write a prompt describing subject, setting, lighting, and style.
- Optionally add a negative_prompt listing what you want to avoid.
- Select aspect_ratio for your target layout.
- Choose num_images for how many variations you want.
- Set seed for reproducible results (optional), then generate.
Parameters
- prompt (required): The text description of what to generate
- negative_prompt (optional): What to avoid (artifacts, styles, objects, text, etc.)
- aspect_ratio: Output aspect ratio (e.g., 1:1, 16:9, 9:16)
- num_images: Number of images to generate per run
- seed: Fixed value for reproducibility; leave empty/random for variation
Prompting guide
A stable structure:
- Subject: who/what is in frame
- Scene: where + time + atmosphere
- Details: clothing, materials, environment cues
- Lighting: softbox, sunset, harsh noon, rim light
- Camera: close-up, wide shot, depth of field
- Style: photorealistic, cinematic, illustration, etc.
Example pattern:
A [shot type] of [subject] in [scene]. [Lighting + mood]. [Camera cues]. [Style cues].
Example prompts
- A raw, unflinching photograph of a weathered soldier in a desert trench, dust blowing across his helmet and gear, harsh sunlight, shallow depth of field, cinematic realism.
- Studio product photo of a minimalist watch on a matte surface, softbox lighting, crisp shadow, premium advertising look.
- Cozy café interior at night, warm tungsten lighting, rain on windows, film still composition, subtle grain.
Negative prompt examples
- blurry, low resolution, deformed hands, extra fingers, bad anatomy
- watermark, logo, text, caption, jpeg artifacts
- oversaturated, cartoonish, plastic skin, uncanny face
Best practices
- Use negative_prompt sparingly; focus your main prompt on what you want.
- Keep the first sentence concrete, then add camera/lighting constraints.
- Fix seed when iterating on prompt wording for controlled comparisons.