Vidu One-Click V2 MV
Vidu One-Click V2 MV is an AI video generation model that creates videos from images and audio. Upload your reference images and audio track, and the model generates a synchronized video with smooth motion and cinematic transitions — with optional subtitle support.
Why Choose This?
-
Image + audio driven
Combine images and audio to generate videos with synchronized visuals and sound.
-
Multi-image support
Add multiple images to guide video generation across different scenes or perspectives.
-
Audio-synced duration
Video length is automatically determined by your audio track.
-
Subtitle generation
Optionally add synchronized subtitles to your video.
-
Flexible output
Support for multiple aspect ratios (16:9, 9:16, etc.) and resolutions up to 1080p.
-
Prompt Enhancer
Built-in tool to automatically improve your prompts for better results.
Parameters
| Parameter | Required | Description |
|---|
| images | Yes | Reference images (click "+ Add Item" for multiple) |
| audio | Yes | Audio track (determines video length) |
| prompt | No | Text description to guide visual style and motion |
| aspect_ratio | No | Output aspect ratio: 16:9, 9:16, etc. |
| resolution | No | Output quality: 720p (default), 1080p |
| add_subtitle | No | Enable subtitle generation |
How to Use
- Upload your images — add one or more reference images by clicking "+ Add Item".
- Upload your audio — the audio track that determines video duration.
- Write your prompt (optional) — describe the visual style, mood, or motion.
- Set aspect ratio — choose based on your target platform.
- Select resolution — 720p for faster generation, 1080p for higher quality.
- Enable subtitles (optional) — check if you need text overlay.
- Run — submit and download your video.
Pricing
| Resolution | Cost per 5 seconds |
|---|
| 540p | $0.15 |
| 720p | $0.20 |
| 1080p | $0.25 |
Billing Rules
- Base rate: $0.25 per 5 seconds (at 1080p)
- Resolution multiplier: 540p = 0.6×, 720p = 0.8×, 1080p = 1×
- Duration: Determined by audio length
Best Use Cases
- Talking Head Videos — Generate presenter-style videos with audio narration.
- Social Media Content — Create engaging video content for various platforms.
- Promotional Videos — Produce video clips with voiceover or background audio.
- Storytelling — Combine multiple images into narrative video sequences.
- Content Localization — Generate videos with different audio tracks and subtitles.
Pro Tips
- Use high-quality images that match the style you want in the video.
- Add multiple images to create visual variety throughout the video.
- Match aspect ratio to your target platform: 16:9 for YouTube, 9:16 for TikTok/Reels.
- Enable subtitles when your audio contains speech.
- Start with 720p for drafts, upgrade to 1080p for final production.
Notes
- Video duration is determined by the length of your audio track.
- Multiple images help create more dynamic videos.
- Ensure uploaded image and audio URLs are publicly accessible.
Related Models
- Vidu Reference-to-Video Q2 — Transform reference images into expressive cinematic videos.
- Vidu Image-to-Video Q2 Pro — High-quality image to video generation.