Google Veo2 Image-to-Video creates high-quality videos with realistic motion, varied styles, and precise camera controls for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$2.2per run
Four friends drive through the desert in an old van, windows down, hair blowing in the wind. The camera switches between wide shots of the landscape and close-ups of their faces filled with joy. Warm, dusty tones, road-trip music vibe, lens flares, feeling of freedom.
A young woman stands on a rooftop at sunset, overlooking the sprawling cityscape of Los Angeles. She wears a vintage leather jacket and has headphones around her neck. The camera slowly pushes in as she closes her eyes, feeling the wind in her hair. Soft golden light, cinematic color grading, shallow depth of field, lens flares, 4K film grain.
female traveler explores a colorful street market in Marrakech. She smiles as she interacts with local vendors, vibrant fabrics and spices in the background. Natural daylight, handheld camera movement, subtle film look, realistic ambient sounds, documentary tone.
A slow-motion, low-angle shot of a male basketball player dribbling a ball on an urban outdoor court at sunset. Sweat drips from his face as he prepares for a powerful dunk. The dramatic backlighting creates a strong silhouette. Energetic, highly detailed, commercial aesthetic.
A man with a beard sits quietly by a forest lake at dawn, sipping coffee from a tin mug. Mist rises from the water, birds chirp in the background. Warm tones, minimal movement, shallow focus on his face, natural lighting, peaceful and contemplative mood.
A teenage girl stands outside a high school building in the late afternoon sun, holding a backpack and laughing with her friends. The camera captures her from behind, lens flare streaking through the frame. Autumn leaves fall slowly in the breeze. Soft focus, warm colors, light film grain, nostalgic 90s teen movie feel.
A young woman sits alone at a small café terrace in Paris, writing in a notebook. A croissant and coffee rest on the table. Morning sun filters through the trees onto her face. The camera slowly zooms in from across the street. Natural light, muted pastel tones, minimal background noise, elegant and quiet mood.
A man in a red coat stands perfectly still in the center of a vast salt flat, with only the sky and reflection below him. The camera rises slowly into a top-down view. Stark geometry, high contrast, minimalist color palette, surreal and contemplative, experimental art film tone.
A wide shot of a young couple enjoying a meal on the balcony of their apartment in a futuristic megacity. Below them, streams of flying vehicles zip between towering, bio-luminescent skyscrapers. The camera is static, contrasting the quiet, intimate moment with the bustling city below. Photorealistic, 8K.
A teenager in a graphic hoodie sits in his bedroom filled with colorful designer toys and figurines. He picks up a glossy collectible bearbrick and places it on a rotating display. Bright LED lighting, sharp colors, product-focused slow zoom-ins, trendy music-driven rhythm, urban visual style.
Google Veo2 on WaveSpeedAI empowers you to create high-quality videos from both text prompts and static images, leveraging Google's advanced AI for next-generation content creation.
Veo2 on WaveSpeedAI is designed for seamless text-to-video and image-to-video generation. It understands real-world physics, human movement, and cinematic techniques, making it ideal for creators and developers who want to produce visually stunning, dynamic videos at scale.
To get the best results with Veo2 on WaveSpeedAI, try these strategies:
Close-up, two shot, over-the-shoulderMacro lens, shallow focus, wide-angle lensSci-fi, romantic comedy, action movieZoom shot, dolly shot, tracking shot, pan shotA close-up shot of melting icicles on a frozen rock wall, with cool blue tones and a zoom-in camera movement, capturing the detailed motion of water drips.
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/google/veo2/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Veo2 Image To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/google/veo2/image-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"enable_prompt_expansion": true,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("google/veo2/image-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"enable_prompt_expansion": true,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"google/veo2/image-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"enable_prompt_expansion": true,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
}
)
print(output["outputs"][0]) # → URL of the generated outputVeo2 Image To Video is a Google model for video generation from images, exposed as a REST API on WaveSpeedAI. Google Veo2 Image-to-Video creates high-quality videos with realistic motion, varied styles, and precise camera controls for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/google/google-veo2-image-to-video.
Veo2 Image To Video starts at $2.20 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `aspect_ratio`, `resolution`, `duration`, `seed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/google/google-veo2-image-to-video.
Average end-to-end generation time on WaveSpeedAI is around 45 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Google). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.