Google Veo3.1 Lite Text To Video
Playground
Try it on WavespeedAI!Google Veo 3.1 Lite Text-to-Video generates high-fidelity 720p or 1080p videos with natively generated audio from text prompts. Lightweight variant optimized for cost efficiency. Supports landscape and portrait aspect ratios, dialogue with lip-sync, and customizable duration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
Veo 3.1 Lite Text-to-Video
Veo 3.1 Lite is Google’s efficient text-to-video model, generating high-quality cinematic video from natural language prompts at accessible pricing. With optional synchronized audio generation, negative prompt control, multiple aspect ratios, and resolution up to 1080p, it delivers strong results for a wide range of creative and production workflows.
- Need maximum quality? Try Veo 3 Text-to-Video
Why Choose This?
-
High-quality video generation Produces detailed, visually coherent video with accurate motion, lighting, and scene composition from text descriptions.
-
Optional audio generation Enable generate_audio to produce synchronized ambient sound and atmosphere alongside the video.
-
Negative prompt support Specify what you don’t want in the video for more precise control over the output.
-
Resolution options Generate at 720p or 1080p to match your quality and budget requirements.
-
Flexible aspect ratios Supports multiple orientations for social, cinematic, and broadcast formats.
-
Reproducible results Use the seed parameter to lock in a specific output for exact reproduction.
-
Prompt Enhancer Built-in tool to automatically improve your scene descriptions for richer output.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | No | Text description of the scene, motion, camera style, and atmosphere. |
| aspect_ratio | No | Output aspect ratio. Default: 16:9. |
| duration | No | Clip length in seconds. Default: 6. |
| resolution | No | Output resolution: 720p or 1080p. Default: 1080p. |
| generate_audio | No | Whether to generate synchronized audio for the video. Default: enabled. |
| negative_prompt | No | Elements to exclude from the generated video. |
| seed | No | Random seed for reproducible results. |
How to Use
- Write your prompt — describe the scene, characters, camera movement, lighting, and atmosphere. Use the Prompt Enhancer for better results.
- Select aspect ratio — choose the format that fits your target platform.
- Set duration — choose your desired clip length in seconds.
- Select resolution — 720p for standard output, 1080p for higher-quality results.
- Enable audio (optional) — generate_audio is enabled by default; disable it if you only need video.
- Add negative prompt (optional) — specify elements you want to exclude from the output.
- Set seed (optional) — fix the seed to reproduce a specific result in future runs.
- Submit — generate, preview, and download your video.
Pricing
| Resolution | Cost per 6s |
|---|---|
| 720p | $0.30 |
| 1080p | $0.48 |
Billing Rules
- 720p: $0.05 per second
- 1080p: $0.08 per second (1.6× base rate)
Best Use Cases
- Cinematic Storytelling — Render atmospheric, narrative-driven scenes from detailed text descriptions.
- Social Media Content — Generate platform-optimized video clips with synchronized audio.
- Marketing & Advertising — Produce high-quality promotional video content without a film crew.
- Concept Visualization — Bring creative ideas and moods to life quickly for pitching and review.
- Audio-Visual Content — Use audio generation for fully immersive ambient scenes.
Pro Tips
- Be specific in your prompt — include camera angle, lighting style, subject behavior, and atmosphere.
- Use negative_prompt to avoid common artifacts like unwanted motion or specific visual elements.
- Disable generate_audio if you plan to add your own soundtrack in post-production.
- Use 720p to rapidly test and iterate on prompts before committing to a 1080p final render.
- Fix the seed once you find a result you like to iterate consistently across resolution changes.
Notes
- All parameters are optional — the model can generate video from an empty prompt.
- 4K output is not supported on this model.
- You will only be charged if your video is successfully generated.
- Please follow Google’s content usage policies when crafting prompts.
Related Models
- Veo 3 Text-to-Video — Google’s flagship Veo 3 model for maximum video quality.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/google/veo3.1-lite/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"aspect_ratio": "16:9",
"duration": 6,
"resolution": "720p"
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | Text prompt for generation; Positive text prompt. | |
| aspect_ratio | string | No | 16:9 | 16:9, 9:16 | Aspect ratio of the video. |
| duration | integer | No | 6 | 8, 6, 4 | The duration of the generated media in seconds. |
| resolution | string | No | 720p | 720p, 1080p | Video resolution. |
| negative_prompt | string | No | - | Negative prompt for the generation. | |
| seed | integer | No | - | -1 ~ 2147483647 | The random seed to use for the generation. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |