WaveSpeedAI APIGoogleGoogle Veo3 Fast Image To Video

Google Veo3 Fast Image To Video

Google Veo3 Fast Image To Video

Playground

Try it on WavespeedAI!

Google Veo3 Fast provides faster, more cost-effective Image-to-Video generation vs Veo 3, with commercial use allowed and $0.25/sec pricing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Google Veo 3 Fast — Image-to-Video (I2V) Model

Veo 3 Fast (I2V) is the high-speed image-to-video variant of Google’s Veo 3 generative suite. It transforms static images into cinematic 1080p motion clips with synchronized native audio — all in a fraction of the time and cost of standard Veo 3.


⚡ Why it stands out

  • From Still Image to Story Turn a single reference image into smooth, realistic motion sequences with natural lighting and perspective continuity.

  • High Speed, Low Cost Generates videos up to 30 % faster while using 80 % fewer credits, ideal for rapid creative iterations.

  • Cinematic Realism Produces expressive camera motion, atmospheric lighting, and lifelike character animation.

  • Native Audio Sync Automatically adds ambient sound, subtle effects, and music that match the visual rhythm — no post-production required.

  • Style & Identity Consistency Keeps subjects, color tone, and camera direction faithful to the uploaded image for coherent storytelling.


⚙️ Limits and Performance

PropertyDescription
InputSingle image + text prompt
Max duration8 seconds
ResolutionUp to 1080p
AudioNative synchronized dialogue, ambient, and music
Output formatMP4 with stereo audio

💰 Pricing

Every run needs $1.2 (both 720p and 1080p)

Without audio needs $0.8

✅ Commercial use allowed


🚀 How to Use

  1. Upload an Image Provide a clear, well-lit source image — this defines your main subject and composition.

  2. Write a Prompt Describe the desired motion, mood, and camera behavior.

    Example: “Slow cinematic zoom out from the character as wind moves through the trees.”

  3. Adjust Settings Choose the duration (up to 8 s) and resolution (up to 1080p).

  4. Generate the Video Submit your job — Veo 3 Fast I2V automatically creates the motion and synchronized soundscape.

  5. Preview & Download Review your result, refine your prompt if needed, and download the final MP4 file.


💡 Pro Tips

  • Use bright, high-contrast source images for clearer motion definition.
  • Keep prompts focused on one subject or action to ensure stability.
  • Add cinematic cues like “soft daylight,” “slow pan,” or “dramatic backlight” for stylistic control.
  • Avoid extreme or conflicting directions (e.g., “zoom in and out simultaneously”).
  • For multiple related clips, reuse the same source image for consistent appearance.

📝 Notes

  • Actual processing time varies depending on queue load and resolution.
  • The model is optimized for short, cinematic sequences and social-media content.
  • Ensure your uploaded image is clear, accessible, and properly licensed.
  • Please make sure your prompts comply with Google’s Safety Guidelines — if an error appears, revise your prompt and try again.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/google/veo3-fast/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "duration": 8,
    "resolution": "720p",
    "generate_audio": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-Text prompt for generation; Positive text prompt.
imagestringYes-The image to use for the generation.
aspect_ratiostringNo16:916:9, 9:16Aspect ratio of the video.
durationintegerNo88, 4, 6The duration of the generated media in seconds.
resolutionstringNo720p720p, 1080pVideo resolution.
generate_audiobooleanNotrue-Whether to generate audio.
negative_promptstringNo-Negative prompt for the generation.
seedintegerNo--1 ~ 2147483647The random seed to use for the generation.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.