Google Veo3.1 Fast Reference To Video
Playground
Try it on WavespeedAI!Google Veo 3.1 Fast Reference to Video is a fast AI reference-to-video generation model that creates 8-second videos from up to three reference images using the official Veo predictLongRunning endpoint with referenceImages assets. Ready-to-use REST inference API for product videos, character consistency, branded visual storytelling, social media clips, advertising creatives, and professional reference-based video generation workflows with simple integration, no coldstarts, and affordable pricing.
Features
Google Veo 3.1 Fast Reference-to-Video
Google Veo 3.1 Fast Reference-to-Video generates an 8-second video guided by up to three reference images and a text prompt. It is designed for subject, object, and product consistency, making it useful for character-led shots, product motion, style-guided generation, and other reference-driven video workflows.
Why Choose This?
-
Reference-guided generation Use up to three reference images to preserve subject, object, or product identity in the generated video.
-
Fast Veo workflow Built on Google Veo 3.1 Fast for quicker turnaround and efficient iteration.
-
Consistent 8-second output Generates a fixed-length
8sMP4, making duration predictable for planning and pricing. -
Flexible aspect ratio Supports both
16:9and9:16for landscape and vertical video use cases. -
Optional audio generation Enable
generate_audiowhen you want the output to include generated sound. -
Simple pricing Pricing depends only on
resolutionand whether audio generation is enabled.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Motion, scene, and camera instructions. |
| images | Yes | 1–3 reference images. These are sent as asset reference images. |
| aspect_ratio | No | 16:9 or 9:16. Default: 16:9. |
| resolution | No | 720p or 1080p. Default: 720p. |
| generate_audio | No | Whether to generate audio. Default: false. |
| negative_prompt | No | Things to avoid in the video. |
| seed | No | Random seed for reproducibility. |
How to Use
- Upload your reference images — provide
1–3images for subject, style, or product guidance. - Write your prompt — describe the motion, scene progression, camera movement, and overall visual intent.
- Set aspect ratio — choose
16:9for landscape or9:16for vertical output. - Choose resolution — use
720pfor lower cost or1080pfor higher quality. - Enable audio (optional) — turn on
generate_audioif you want generated sound in the result. - Add a negative prompt (optional) — describe elements or artifacts you want to avoid.
- Set a seed (optional) — use a fixed seed for more reproducible outputs.
- Submit — run the model and download the generated 8-second video.
Example Prompt
A cinematic product reveal of the same luxury watch from the reference images, rotating slowly on a reflective black surface, dramatic studio lighting, soft camera push-in, premium commercial style
Pricing
This model generates a fixed 8-second video.
| Mode | Cost |
|---|---|
| 720p without audio | $0.64 |
| 720p with audio | $0.80 |
| 1080p without audio | $0.80 |
| 1080p with audio | $0.96 |
Billing Rules
- Output length is fixed at 8 seconds
720pwithout audio costs $0.64720pwith audio costs $0.801080pwithout audio costs $0.801080pwith audio costs $0.96- Pricing depends only on
resolutionandgenerate_audio aspect_ratio,negative_prompt,seed, and the number of reference images do not affect pricing
Best Use Cases
- Product motion videos — Generate controlled product shots from reference images.
- Character consistency — Keep the same subject identity across a short generated clip.
- Style-guided generation — Use references to anchor visual style, mood, or composition.
- Marketing creatives — Produce short polished clips for ads, social media, and promotional assets.
- Vertical content — Generate
9:16outputs for short-form mobile platforms.
Pro Tips
- Use clear, high-quality reference images for stronger identity preservation.
- Keep the reference images visually consistent when you want the subject or product to remain stable.
- Be specific in your prompt about motion, camera movement, and scene intent.
- Use
negative_promptto reduce unwanted style drift or artifacts. - Enable audio only when you actually need it, since it changes pricing.
- Reuse the same
seedwhen you want more reproducible generations.
Notes
- Both
promptandimagesare required. - This workflow supports up to 3 reference images.
- The model uses the official Veo 3.1 Fast long-running generation flow with
referenceImagesadded to the request payload. - Output duration is fixed at 8 seconds.
generate_audiodefaults tofalse.
Related Models
- Other Google Veo 3.1 Fast video generation variants may be useful when you need text-to-video or non-reference workflows.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/google/veo3.1-fast/reference-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"aspect_ratio": "16:9",
"resolution": "720p",
"generate_audio": false,
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | Positive prompt describing the desired video motion, scene, and camera behavior. | |
| images | array | Yes | [] | 1 ~ 3 items | Reference images used as asset references. Upload 1 to 3 PNG, JPEG, JPG, or WebP images. |
| aspect_ratio | string | No | 16:9 | 16:9, 9:16 | Aspect ratio of the generated video. |
| resolution | string | No | 720p | 720p, 1080p | Video resolution. |
| generate_audio | boolean | No | false | - | Whether to generate audio. |
| negative_prompt | string | No | - | Negative prompt for the generation. | |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |