Google Veo3 Fast Image To Video

Playground

Google Veo3 Fast provides faster, more cost-effective Image-to-Video generation vs Veo 3, with commercial use allowed and $0.25/sec pricing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Google Veo 3 Fast — Image-to-Video (I2V) Model

Veo 3 Fast (I2V) is the high-speed image-to-video variant of Google’s Veo 3 generative suite. It transforms static images into cinematic 1080p motion clips with synchronized native audio — all in a fraction of the time and cost of standard Veo 3.

⚡ Why it stands out

From Still Image to Story Turn a single reference image into smooth, realistic motion sequences with natural lighting and perspective continuity.
High Speed, Low Cost Generates videos up to 30 % faster while using 80 % fewer credits, ideal for rapid creative iterations.
Cinematic Realism Produces expressive camera motion, atmospheric lighting, and lifelike character animation.
Native Audio Sync Automatically adds ambient sound, subtle effects, and music that match the visual rhythm — no post-production required.
Style & Identity Consistency Keeps subjects, color tone, and camera direction faithful to the uploaded image for coherent storytelling.

⚙️ Limits and Performance

Property	Description
Input	Single image + text prompt
Max duration	8 seconds
Resolution	Up to 1080p
Audio	Native synchronized dialogue, ambient, and music
Output format	MP4 with stereo audio

💰 Pricing

Every run needs $1.2 (both 720p and 1080p)

Without audio needs $0.8

✅ Commercial use allowed

🚀 How to Use

Upload an Image Provide a clear, well-lit source image — this defines your main subject and composition.
Write a Prompt Describe the desired motion, mood, and camera behavior.

Example: “Slow cinematic zoom out from the character as wind moves through the trees.”
Adjust Settings Choose the duration (up to 8 s) and resolution (up to 1080p).
Generate the Video Submit your job — Veo 3 Fast I2V automatically creates the motion and synchronized soundscape.
Preview & Download Review your result, refine your prompt if needed, and download the final MP4 file.

💡 Pro Tips

Use bright, high-contrast source images for clearer motion definition.
Keep prompts focused on one subject or action to ensure stability.
Add cinematic cues like “soft daylight,” “slow pan,” or “dramatic backlight” for stylistic control.
Avoid extreme or conflicting directions (e.g., “zoom in and out simultaneously”).
For multiple related clips, reuse the same source image for consistent appearance.

📝 Notes

Actual processing time varies depending on queue load and resolution.
The model is optimized for short, cinematic sequences and social-media content.
Ensure your uploaded image is clear, accessible, and properly licensed.
Please make sure your prompts comply with Google’s Safety Guidelines — if an error appears, revise your prompt and try again.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/google/veo3-fast/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "duration": 8,
    "resolution": "720p",
    "generate_audio": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	Text prompt for generation; Positive text prompt.
image	string	Yes		-	The image to use for the generation.
aspect_ratio	string	No	16:9	16:9, 9:16	Aspect ratio of the video.
duration	integer	No	8	8, 4, 6	The duration of the generated media in seconds.
resolution	string	No	720p	720p, 1080p	Video resolution.
generate_audio	boolean	No	true	-	Whether to generate audio.
negative_prompt	string	No		-	Negative prompt for the generation.
seed	integer	No	-	-1 ~ 2147483647	The random seed to use for the generation.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Google Veo3 Fast Google Veo3 Image To Video