Vidu Text To Image Q2
Playground
Try it on WavespeedAI!Vidu Text-to-Image Q2 converts text prompts into high-quality images with exceptional visual detail and creative flexibility. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
vidu/text-to-image-q2 — High-resolution cinematic text-to-image
vidu/text-to-image-q2 is a high-end text-to-image model that focuses on clean composition, cinematic lighting, and high resolutions up to 4K. It’s built for scenarios where a single image has to carry a lot of visual weight: posters, key visuals, thumbnails, or product hero shots.
Why it’s useful
- Cinematic aspect ratios – choose from 1:1, 16:9, 9:16, 4:3, 3:4, 21:9, 2:3, 3:2 for social posts, banners, and vertical feeds.
- High resolutions (1080p → 4K) – generate images ready for large displays, detailed crops, or light print use.
- Prompt-driven style control – supports rich, descriptive prompts for mood, camera angle, lens type, lighting, and composition.
- Consistent structure and detail – strong global coherence makes it suitable for scenes with multiple elements and clear storytelling.
How to use
-
prompt* – describe the scene, subject, mood, and style you want (for example: “cinematic nighttime city street, shallow depth of field, dramatic lighting, 35mm film look”).
-
aspect_ratio – pick the framing:
- 1:1 for avatars, album covers, square posts
- 16:9 / 21:9 for cinematic or banner shots
- 9:16 for vertical / mobile content
- 4:3, 3:4, 2:3, 3:2 for more traditional photography ratios
-
resolution – choose the output quality:
- 1080p – fast preview and web-ready images
- 2K – higher detail for close-ups and cropping
- 4K – maximum sharpness and fidelity
-
Run the job, preview the result, and iterate on your prompt if needed.
Pricing
| Resolution | Price per image |
|---|---|
| 1080p | $0.90 |
| 2K | $1.20 |
| 4K | $1.50 |
Tips for best results
- Use specific, photography-style language (lens type, lighting, time of day, camera angle) to get more cinematic images.
- Pair aspect_ratio with prompt hints like “widescreen establishing shot” or “vertical social ad portrait” to guide composition.
- For 4K images, write slightly richer prompts (background, textures, materials) so the extra resolution is filled with meaningful detail.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/vidu/text-to-image-q2" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"aspect_ratio": "1:1",
"resolution": "1080p",
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The text prompt for generating the image. | |
| aspect_ratio | string | No | 1:1 | 1:1, 16:9, 9:16, 4:3, 3:4, 21:9, 2:3, 3:2 | The aspect ratio for the generated image. |
| resolution | string | No | 1080p | 1080p, 2K, 4K | The output resolution quality: 1080p (1920x1080), 2K (2560x1440), or 4K (3840x2160). |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |