
text-to-image
Idle

Your request will cost $0.01 per run.
For $1 you can run this model approximately 100 times.
One more thing::



CogView-4 is Z.AI's high-quality text-to-image generation model designed to transform natural-language descriptions into precise, personalized visuals. It excels at interpreting user intent — producing images that accurately reflect your creative vision with strong compositional clarity and visual appeal.
Precise prompt understanding Accurately interprets detailed prompts to generate images that match your description — balancing subject, context, and style with strong fidelity.
Flexible quality modes Choose standard for fast results (5-10 seconds) or hd for richer detail and visual depth (~20 seconds).
Wide aspect ratio support Multiple presets from square to portrait, landscape, and ultra-wide formats for social, web, or print use.
Prompt Enhancer Built-in tool to automatically improve your prompts for better generation results.
Fast, reliable generation Optimized for quick turnaround with stable output quality — ideal for rapid ideation and creative iteration.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the image you want to generate |
| size | No | Output dimensions (default: 1024*1024) |
| quality | No | Rendering quality: standard or hd |
| Size | Orientation | Best For |
|---|---|---|
| 1024*1024 | Square | Social posts, avatars, album art |
| 768*1344 | Portrait | Mobile screens, stories, vertical banners |
| 864*1152 | Portrait | Mobile displays, vertical content |
| 1344*768 | Landscape | Web headers, presentations |
| 1152*864 | Landscape | Widescreen designs, banners |
| 1440*720 | Ultra-wide | Cinematic layouts, panoramic visuals |
| 720*1440 | Ultra-tall | Immersive vertical content |
| Item | Cost |
|---|---|
| Per image | $0.01 |
Simple flat-rate pricing regardless of size or quality settings.
Please ensure your prompts comply with content guidelines. If an error occurs, review your prompt and try again.