Seedance 2.0 15% OFF | Create in Video Generator →
Vidu Models

Vidu Models

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

All models

38 models
vidu/q3/drama-clip
image-to-video

vidu/q3/drama-clip

Vidu Q3 Drama Clip generates 8-12 second script-driven drama videos from structured assets, including characters, scenes, and tools. It is ideal for compact story scenes, storyboard shots, and focused narrative moments. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-ad
image-to-video

vidu/q3-ad

Vidu Q3 Ad Video generates commercial ad videos from 1 to 7 reference images with prompt guidance, supporting 720P / 1080P output and synchronized audio for product ads, brand campaigns, marketing creatives, and promotional videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/drama
image-to-video

vidu/q3/drama

Vidu Q3 Drama generates complete script-driven drama videos from scripts and structured assets, including characters, scenes, tools, and references. It plans the narrative structure, scene pacing, and transitions to create a story-driven drama in one request, supporting up to 180 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/image-to-video
image-to-video

vidu/q3/image-to-video

Vidu Q3 Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/text-to-video
text-to-video

vidu/q3/text-to-video

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-turbo/image-to-video
image-to-video

vidu/q3-turbo/image-to-video

Vidu Q3 Turbo Image-to-Video animates static images with high-quality motion and faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/image-to-video
image-to-video

vidu/q3-pro/image-to-video

Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3/image-to-video-pro
image-to-video

vidu/q3/image-to-video-pro

Vidu Q3 Image-to-Video Pro generates high-resolution videos (720p/1080p/2K/4K) from images with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/reference-to-video
image-to-video

vidu/q3/reference-to-video

Vidu Q3 Reference-to-Video Mix generates multi-entity consistent videos from 1-4 reference images with text prompt guidance. Supports 360p to 1080p resolutions, up to 16 seconds duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/start-end-to-video
image-to-video

vidu/q3/start-end-to-video

Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-turbo/start-end-to-video
image-to-video

vidu/q3-turbo/start-end-to-video

Vidu Q3 Turbo Start-End-to-Video creates smooth transitions between two images with faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/start-end-to-video
image-to-video

vidu/q3-pro/start-end-to-video

Vidu Q3 Pro Start-End-to-Video creates smooth transitions between two keyframes with viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3-pro/text-to-video
text-to-video

vidu/q3-pro/text-to-video

Vidu Q3 Pro Text to Video is a fast AI video generation model that creates high-quality, audio-capable videos from text prompts with support for 1–16 second outputs. Ready-to-use REST inference API for cinematic clips, advertising creatives, social media videos, product visuals, storytelling, and professional text-to-video workflows with simple integration, no coldstarts, and affordable pricing.

vidu/image-to-video-2.0
image-to-video

vidu/image-to-video-2.0

Vidu Image to Video 2.0 converts images into smooth-transition videos with exceptional visual quality and diverse, natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-2.0
image-to-video

vidu/reference-to-video-2.0

Vidu Reference-to-Video 2.0 turns references into videos that preserve characters, objects, and environments with Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-2.0
image-to-video

vidu/start-end-to-video-2.0

Vidu Start-End to Video 2.0 generates smooth transition videos interpolating between given start and end images for natural morphing effects. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video
image-to-video

vidu/image-to-video

Vidu Image-to-Video converts images into smooth-transition videos with high visual quality and diverse motion for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video
text-to-video

vidu/text-to-video

Vidu Text to Video converts text prompts into high-quality 720p videos with exceptional visual fidelity and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video
image-to-video

vidu/start-end-to-video

Vidu Start-End to Video converts a start and end image into a smooth transition Image-to-Video clip that morphs scenes seamlessly. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-pro
image-to-video

vidu/image-to-video-q2-pro

Vidu Q2 Pro turns a single still image into smooth, cinematic image-to-video with stable motion, clean edges, and consistent lighting. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-turbo
image-to-video

vidu/image-to-video-q2-turbo

Vidu Q2 Turbo Image-to-Video turns a single image into smooth, cinematic motion with fast, high-quality output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-2.0
text-to-video

vidu/text-to-video-2.0

Vidu Text-to-Video 2.0 converts text prompts into high-quality 720p videos with exceptional visual detail and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/one-click-v2/mv
audio-to-video

vidu/one-click-v2/mv

Vidu One-Click V2 MV transforms images and audio into videos with camera movements and subtitle support. Create professional video content with dynamic shots and text overlays in one click. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/image-to-video-fast
image-to-video

vidu/q2-pro/image-to-video-fast

Vidu Q2 Pro Fast Image to Video generates high-quality videos from a single image with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/q2-pro/start-end-to-video-fast
image-to-video

vidu/q2-pro/start-end-to-video-fast

Vidu Q2 Pro Fast Start-End to Video generates smooth video transitions between start and end images with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/reference-to-image-q2
image-to-image

vidu/reference-to-image-q2

Vidu Reference-to-Image Q2 generates high-quality images from 1–7 reference images plus a text prompt, preserving style and composition while allowing controlled changes to subjects, backgrounds, and fine details. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/text-to-image-q2
text-to-image

vidu/text-to-image-q2

Vidu Text-to-Image Q2 converts text prompts into high-quality images with exceptional visual detail and creative flexibility. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q2
text-to-video

vidu/text-to-video-q2

Vidu Q2 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/template/halloween
video-effects

vidu/template/halloween

Vidu Halloween Templates delivers ready-made image and video templates for spooky promos and event invites with overlays. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q2
image-to-video

vidu/reference-to-video-q2

Vidu Q2 is an Image-to-Video and Reference-to-Video model that emphasizes subtle facial expressions and smooth push-pull camera moves for natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-turbo/extend-video
video-extend

vidu/q2-turbo/extend-video

Vidu Q2 Turbo Extend Video seamlessly extends existing videos by 1-7 seconds with consistent motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/extend-video
video-extend

vidu/q2-pro/extend-video

Vidu Q2 Pro Extend Video seamlessly extends existing videos by 1-7 seconds with high-quality motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-turbo
image-to-video

vidu/start-end-to-video-q2-turbo

Vidu Q2 Turbo Start-End to Video creates smooth Image-to-Video transitions between start and end images with fast high-quality results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-pro
image-to-video

vidu/start-end-to-video-q2-pro

Vidu Q2 Pro Start-End to Video produces smooth image-to-video transitions between start and end images for seamless morphs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q1
text-to-video

vidu/text-to-video-q1

Vidu Text-to-Video Q1 converts text prompts into high-quality videos with exceptional visual fidelity and motion diversity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q1
image-to-video

vidu/image-to-video-q1

Vidu Image-to-Video creates smooth transition videos from specified start and end images, producing seamless image-to-video outputs for presentations and storytelling. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q1
image-to-video

vidu/start-end-to-video-q1

Vidu Q1 Start-End To Video turns specified start and end images into smooth image-to-video transitions for morphs and scene fades. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q1
image-to-video

vidu/reference-to-video-q1

Generate videos from reference images while keeping characters, objects, and scene identity consistent using Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Vidu Models

Special Models

• vidu/one-click-v2/mv  

Transforms images and audio into dynamic music videos with camera movement and subtitle support. Designed for fast creation of professional video content in a single workflow.

• vidu/drama  

A specialized model designed for short-form drama and narrative content. It is optimized for emotionally expressive scenes, story-driven pacing, character-focused performance, and dramatic visual continuity, making it ideal for mini-series, episodic content, and serialized storytelling.

• vidu/ad  

A specialized commercial video generation model built for advertising and promotional content. It is designed for product showcases, brand storytelling, marketing campaigns, and conversion-focused creatives, with an emphasis on polished visuals, product clarity, and production-ready commercial output.

Image-to-Video Models

• vidu/q3/image-to-video  

The newest-generation image-to-video model in the Vidu lineup, delivering best-in-class motion quality, strong structural fidelity, and cinematic realism. Ideal for complex scenes, expressive motion, and fine-grained detail preservation.

• vidu/q2-pro/image-to-video-fast  

A professional-grade image-to-video model optimized for speed. It combines Q2 Pro's sharp details, stable identity preservation, and polished motion with significantly lower latency for high-volume production workflows.

• vidu/image-to-video-q2-pro  

A premium image-to-video model offering sharper visual detail, more stable character identity, and refined cinematic motion. Well suited for polished production assets, hero shots, and client-facing deliverables.

• vidu/image-to-video-q2-turbo  

A high-speed image-to-video model built for complex scenes and multi-character compositions. It delivers smooth, coherent motion and strong structure preservation while supporting rapid preview and iteration.

• vidu/image-to-video-q1  

A high-fidelity image-to-video model with enhanced texture detail and strong portrait performance. It maintains lighting and identity consistency while generating cinematic motion and expressive character behavior.

• vidu/image-to-video-2.0  

Transforms a single image into a smooth, coherent video while preserving composition, structure, and layout. Offers strong temporal stability and natural camera movement for professional editing and post-production pipelines.

• vidu/image-to-video  

A lightweight and efficient baseline I2V model for rapid ideation, early-stage drafts, and social media content. It balances speed, clean motion, and structural consistency.

Text-to-Video Models

• vidu/q3/text-to-video  

The most advanced text-to-video model in the Vidu family, delivering superior prompt adherence, richer scene composition, and more natural multi-character interactions for premium storytelling and commercial production.

• vidu/text-to-video-q2  

A flagship text-to-video model with stronger temporal coherence, richer scene detail, and more precise camera and motion control. Designed for complex narratives, branded content, and high-end creative use cases.

• vidu/text-to-video-q1  

A high-fidelity text-to-video model with richer color, sharper details, and stronger narrative continuity. Ideal for cinematic storytelling, visual branding, and polished marketing content.

• vidu/text-to-video-2.0  

Generates videos directly from prompts with reliable prompt adherence, coherent multi-object scenes, and controllable camera movement. A strong choice for conceptual, narrative, and creative video generation.

• vidu/text-to-video  

A streamlined baseline text-to-video model optimized for efficiency and turnaround speed. Suitable for ads, explainers, and fast iteration on text-driven concepts.

Reference-to-Video Models

• vidu/reference-to-video-q2  

A reference-guided video generation model that supports multiple objects or characters within a single scene, enabling more complex compositions and richer interactions.

• vidu/reference-to-video-q1  

An upgraded reference-based generator with sharper details and more faithful style and identity transfer. It reduces drift and artifacts, especially in close-ups and longer shots.

• vidu/reference-to-video-2.0  

Creates videos from a reference image while preserving character likeness, visual style, wardrobe consistency, and overall scene coherence across frames.

Start-End Frame Video Models

• vidu/q2-pro/start-end-to-video-fast  

A professional-grade start-end interpolation model optimized for speed. It combines reinforced temporal coherence with fast generation for efficient production workflows.

• vidu/start-end-to-video-q2-pro  

A high-end model focused on precise motion control and reinforced temporal coherence. It generates stable intermediate frames while closely following user-defined start and end constraints.

• vidu/start-end-to-video-q2-turbo  

A fast start-end video model built for rapid iteration and preview. It preserves subject integrity and core visual coherence while reducing generation latency.

• vidu/start-end-to-video-q1  

Improves motion smoothness and narrative continuity, producing more natural transitions between poses, camera positions, and scene states.

• vidu/start-end-to-video-2.0  

Synthesizes smooth motion between user-defined start and end frames while respecting scene geometry and composition. Ideal for transitions, reveals, and structured motion design.

• vidu/start-end-to-video  

A compact baseline model for simple start-end interpolation and quick previews. Suitable for basic transitions, animatics, and fast storyboard development.

Image Models

• vidu/text-to-image-q2  

A high-resolution cinematic text-to-image model for generating polished hero images, thumbnails, posters, and key visuals directly from prompts.

• vidu/reference-to-image-q2  

A reference-guided image generation model that supports up to seven input images plus a text prompt to create new high-resolution visuals while preserving subject identity and composition.

Vidu Models API — pricing & performance

Run any model in the Vidu Models collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Vidu Models on WaveSpeedAI

Transparent pricing

Per-call pricing for every Vidu Models model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Vidu Models image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Vidu Models API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Vidu Models models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Yes — every account gets $1 in free credits on signup, enough to try most Vidu Models models without a credit card.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.