Vidu Text To Image Q2

Playground

Vidu Text-to-Image Q2 converts text prompts into high-quality images with exceptional visual detail and creative flexibility. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

vidu/text-to-image-q2 — High-resolution cinematic text-to-image

vidu/text-to-image-q2 is a high-end text-to-image model that focuses on clean composition, cinematic lighting, and high resolutions up to 4K. It’s built for scenarios where a single image has to carry a lot of visual weight: posters, key visuals, thumbnails, or product hero shots.

Why it’s useful

Cinematic aspect ratios – choose from 1:1, 16:9, 9:16, 4:3, 3:4, 21:9, 2:3, 3:2 for social posts, banners, and vertical feeds.
High resolutions (1080p → 4K) – generate images ready for large displays, detailed crops, or light print use.
Prompt-driven style control – supports rich, descriptive prompts for mood, camera angle, lens type, lighting, and composition.
Consistent structure and detail – strong global coherence makes it suitable for scenes with multiple elements and clear storytelling.

How to use

prompt* – describe the scene, subject, mood, and style you want (for example: “cinematic nighttime city street, shallow depth of field, dramatic lighting, 35mm film look”).
aspect_ratio – pick the framing:
- 1:1 for avatars, album covers, square posts
- 16:9 / 21:9 for cinematic or banner shots
- 9:16 for vertical / mobile content
- 4:3, 3:4, 2:3, 3:2 for more traditional photography ratios
resolution – choose the output quality:
- 1080p – fast preview and web-ready images
- 2K – higher detail for close-ups and cropping
- 4K – maximum sharpness and fidelity
Run the job, preview the result, and iterate on your prompt if needed.

Pricing

Resolution	Price per image
1080p	$0.03
2K	$0.04
4K	$0.05

Tips for best results

Use specific, photography-style language (lens type, lighting, time of day, camera angle) to get more cinematic images.
Pair aspect_ratio with prompt hints like “widescreen establishing shot” or “vertical social ad portrait” to guide composition.
For 4K images, write slightly richer prompts (background, textures, materials) so the extra resolution is filled with meaningful detail.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/vidu/text-to-image-q2" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "1:1",
    "resolution": "1080p",
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The text prompt for generating the image.
aspect_ratio	string	No	1:1	1:1, 16:9, 9:16, 4:3, 3:4, 21:9, 2:3, 3:2	The aspect ratio for the generated image.
resolution	string	No	1080p	1080p, 2K, 4K	The output resolution quality: 1080p (1920x1080), 2K (2560x1440), or 4K (3840x2160).
seed	integer	No	-1	-1 ~ 2147483647	The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Vidu Template Halloween Vidu Text To Video