Seedance 2.0 立省 15% | 在 Video Generator 中創作 →
Vidu Models

Vidu Models

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

所有模型

35 個模型
vidu/q3/image-to-video
image-to-video

vidu/q3/image-to-video

Vidu Q3 Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/text-to-video
text-to-video

vidu/q3/text-to-video

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/image-to-video
image-to-video

vidu/q3-pro/image-to-video

Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3-turbo/image-to-video
image-to-video

vidu/q3-turbo/image-to-video

Vidu Q3 Turbo Image-to-Video animates static images with high-quality motion and faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/image-to-video-pro
image-to-video

vidu/q3/image-to-video-pro

Vidu Q3 Image-to-Video Pro generates high-resolution videos (720p/1080p/2K/4K) from images with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/reference-to-video
image-to-video

vidu/q3/reference-to-video

Vidu Q3 Reference-to-Video Mix generates multi-entity consistent videos from 1-4 reference images with text prompt guidance. Supports 360p to 1080p resolutions, up to 16 seconds duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/start-end-to-video
image-to-video

vidu/q3/start-end-to-video

Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-turbo/start-end-to-video
image-to-video

vidu/q3-turbo/start-end-to-video

Vidu Q3 Turbo Start-End-to-Video creates smooth transitions between two images with faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/start-end-to-video
image-to-video

vidu/q3-pro/start-end-to-video

Vidu Q3 Pro Start-End-to-Video creates smooth transitions between two keyframes with viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3-pro/text-to-video
text-to-video

vidu/q3-pro/text-to-video

Vidu Q3 Pro Text to Video is a fast AI video generation model that creates high-quality, audio-capable videos from text prompts with support for 1–16 second outputs. Ready-to-use REST inference API for cinematic clips, advertising creatives, social media videos, product visuals, storytelling, and professional text-to-video workflows with simple integration, no coldstarts, and affordable pricing.

vidu/image-to-video-2.0
image-to-video

vidu/image-to-video-2.0

Vidu Image to Video 2.0 converts images into smooth-transition videos with exceptional visual quality and diverse, natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-2.0
image-to-video

vidu/reference-to-video-2.0

Vidu Reference-to-Video 2.0 turns references into videos that preserve characters, objects, and environments with Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-2.0
image-to-video

vidu/start-end-to-video-2.0

Vidu Start-End to Video 2.0 generates smooth transition videos interpolating between given start and end images for natural morphing effects. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video
image-to-video

vidu/image-to-video

Vidu Image-to-Video converts images into smooth-transition videos with high visual quality and diverse motion for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video
text-to-video

vidu/text-to-video

Vidu Text to Video converts text prompts into high-quality 720p videos with exceptional visual fidelity and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video
image-to-video

vidu/start-end-to-video

Vidu Start-End to Video converts a start and end image into a smooth transition Image-to-Video clip that morphs scenes seamlessly. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-pro
image-to-video

vidu/image-to-video-q2-pro

Vidu Q2 Pro turns a single still image into smooth, cinematic image-to-video with stable motion, clean edges, and consistent lighting. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-turbo
image-to-video

vidu/image-to-video-q2-turbo

Vidu Q2 Turbo Image-to-Video turns a single image into smooth, cinematic motion with fast, high-quality output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q1
text-to-video

vidu/text-to-video-q1

Vidu Text-to-Video Q1 converts text prompts into high-quality videos with exceptional visual fidelity and motion diversity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-image-q2
image-to-image

vidu/reference-to-image-q2

Vidu Reference-to-Image Q2 generates high-quality images from 1–7 reference images plus a text prompt, preserving style and composition while allowing controlled changes to subjects, backgrounds, and fine details. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/one-click-v2/mv
audio-to-video

vidu/one-click-v2/mv

Vidu One-Click V2 MV transforms images and audio into videos with camera movements and subtitle support. Create professional video content with dynamic shots and text overlays in one click. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-image-q2
text-to-image

vidu/text-to-image-q2

Vidu Text-to-Image Q2 converts text prompts into high-quality images with exceptional visual detail and creative flexibility. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q2
text-to-video

vidu/text-to-video-q2

Vidu Q2 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/image-to-video-fast
image-to-video

vidu/q2-pro/image-to-video-fast

Vidu Q2 Pro Fast Image to Video generates high-quality videos from a single image with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/q2-pro/start-end-to-video-fast
image-to-video

vidu/q2-pro/start-end-to-video-fast

Vidu Q2 Pro Fast Start-End to Video generates smooth video transitions between start and end images with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/template/halloween
video-effects

vidu/template/halloween

Vidu Halloween Templates delivers ready-made image and video templates for spooky promos and event invites with overlays. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q2
image-to-video

vidu/reference-to-video-q2

Vidu Q2 is an Image-to-Video and Reference-to-Video model that emphasizes subtle facial expressions and smooth push-pull camera moves for natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-turbo
image-to-video

vidu/start-end-to-video-q2-turbo

Vidu Q2 Turbo Start-End to Video creates smooth Image-to-Video transitions between start and end images with fast high-quality results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-pro
image-to-video

vidu/start-end-to-video-q2-pro

Vidu Q2 Pro Start-End to Video produces smooth image-to-video transitions between start and end images for seamless morphs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-2.0
text-to-video

vidu/text-to-video-2.0

Vidu Text-to-Video 2.0 converts text prompts into high-quality 720p videos with exceptional visual detail and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-turbo/extend-video
video-extend

vidu/q2-turbo/extend-video

Vidu Q2 Turbo Extend Video seamlessly extends existing videos by 1-7 seconds with consistent motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/extend-video
video-extend

vidu/q2-pro/extend-video

Vidu Q2 Pro Extend Video seamlessly extends existing videos by 1-7 seconds with high-quality motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q1
image-to-video

vidu/image-to-video-q1

Vidu Image-to-Video creates smooth transition videos from specified start and end images, producing seamless image-to-video outputs for presentations and storytelling. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q1
image-to-video

vidu/start-end-to-video-q1

Vidu Q1 Start-End To Video turns specified start and end images into smooth image-to-video transitions for morphs and scene fades. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q1
image-to-video

vidu/reference-to-video-q1

Generate videos from reference images while keeping characters, objects, and scene identity consistent using Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Vidu Models

Vidu is Shengshu Technology's advanced video generation suite, combining the Q3, Q2, Q1, and 2.0 series models. Built on open-source diffusion backbones and trained on large-scale, high-quality datasets, Vidu delivers strong performance across a wide range of video creation tasks. Its models offer precise control, consistent visual quality, and robust temporal stability, making Vidu suitable for professional, production-grade workflows.

Image-to-Video Models

vidu/q3/image-to-video The newest-generation image-to-video model with best-in-class motion quality, structural fidelity, and cinematic realism. Sets a new benchmark for I2V across complex scenes and fine-grained detail preservation.

vidu/q2-pro/image-to-video-fast Professional-grade image-to-video generation at turbo speed. Combines Q2-Pro's sharp detail and stable identity with significantly reduced latency for high-volume production pipelines.

vidu/image-to-video-q2-pro A professional-grade image-to-video model offering sharper detail, more stable character identity, and refined cinematic motion. Suited for polished production assets, hero shots, and client-facing deliverables.

vidu/image-to-video-q2-turbo A high-speed image-to-video model for complex scenes and multi-character shots. Delivers smooth, coherent motion and solid structure preservation while enabling near real-time preview and refinement.

vidu/image-to-video-q1 A premium image-to-video model with enhanced texture detail and superior portrait handling. Maintains lighting and identity consistency while generating cinematic motion and expressive character performance.

vidu/image-to-video-2.0 Transforms a single image into a smooth, coherent video while preserving structure, composition, and layout. Provides strong temporal stability and natural camera motion for professional post-production and editing pipelines.

vidu/image-to-video A lightweight, fast I2V model for rapid drafts, ideation, and social media content. Balances speed and structural preservation, producing clean clips with minimal artifacts.

Text-to-Video Models

vidu/q3/text-to-video The most advanced text-to-video model in the Vidu lineup. Delivers superior prompt adherence, richer scene composition, and more natural multi-character interactions for high-end creative and commercial storytelling.

vidu/text-to-video-q2 A flagship text-to-video model with stronger temporal coherence, richer scene detail, and more precise camera and motion control. Designed for complex, multi-character narratives and high-end commercial storytelling.

vidu/text-to-video-q1 A high-fidelity T2V model offering richer color, sharper detail, and stronger narrative continuity. Ideal for cinematic storytelling, branding, and visually polished marketing assets.

vidu/text-to-video-2.0 Generates videos directly from text prompts with reliable prompt adherence, coherent multi-object scenes, and controllable camera motion. Well suited for high-quality conceptual and narrative video generation.

vidu/text-to-video A baseline T2V option optimized for efficiency and turnaround speed. Designed for ads, explainers, and straightforward text-driven concepts where fast iteration is key.

Reference-to-Video Models

vidu/reference-to-video-q2 Supports multiple distinct objects or characters interacting within a single video, enabling complex, reference-guided scene compositions.

vidu/reference-to-video-q1 An upgraded reference-based generator with sharper details and more faithful style and identity transfer. Reduces drift and artifacts, especially in close-ups and longer shots.

vidu/reference-to-video-2.0 Creates videos guided by a reference image, ensuring accurate character likeness, stable style control, and consistent wardrobe and appearance across frames.

Start-End Frame Video Models

vidu/q2-pro/start-end-to-video-fast Professional-grade start-end interpolation at turbo speed. Combines Q2-Pro's reinforced temporal coherence with drastically reduced generation time for rapid production workflows.

vidu/start-end-to-video-q2-pro A professional-grade model focused on reinforced temporal coherence and precise motion control. Generates stable intermediate frames while closely aligning with user-specified start and end constraints.

vidu/start-end-to-video-q2-turbo A high-speed variant optimized for rapid iteration and preview. Preserves core coherence and subject integrity while significantly reducing generation latency.

vidu/start-end-to-video-q1 Enhances narrative continuity and motion smoothness, producing more natural easing between poses, camera positions, and scene states.

vidu/start-end-to-video-2.0 Synthesizes smooth motion between user-defined start and end frames while respecting overall scene geometry and layout. Ideal for transitions, reveals, and structured motion design.

vidu/start-end-to-video A compact baseline model for simple start–end interpolation and quick previews. Suitable for basic transitions, animatics, and fast storyboard development.

Image Models

vidu/text-to-image-q2 High-resolution cinematic text-to-image model for generating polished hero shots, thumbnails, and key visuals directly from prompts.

vidu/reference-to-image-q2 Reference-guided image generator that uses up to seven input images plus a prompt to create new, high-res shots that preserve subject identity and composition.

Special Models

vidu/one-click-v2/mv Vidu One-Click V2 MV transforms images and audio into videos with camera movements and subtitle support. Create professional video content with dynamic shots and text overlays in one click.

vidu/template/halloween A themed template model for stylized seasonal video content. Apply pre-designed creative templates to quickly generate themed videos with minimal effort.

Vidu Models API — 價格與效能

透過單一 REST API 執行 Vidu Models 系列中的任何模型。按生成計費 — 無訂閱、無最低消費 — 在可用率 99.9% 的基礎架構上提供業界領先的延遲。

為什麼在 WaveSpeedAI 上執行 Vidu Models

透明的價格

每個 Vidu Models 模型都採按呼叫計費。價格列在每個模型的頁面上 — 不會額外加收平台費。

為低延遲最佳化

大多數 Vidu Models 影像模型在 2 秒內完成。影片與 3D 模型比自架方案快數倍。

99.9% 可用率

多區域故障轉移與自動重試可在供應商故障期間 — 仍將您的生產流量保持線上。

常見問題

Vidu Models API 多少錢?+

每個模型在其模型頁面上都列有自己的按呼叫價格。我們按每次成功生成計費,沒有訂閱費或最低消費。

Vidu Models 模型在 WaveSpeedAI 上有多快?+

本系列中的影像模型通常在 2 秒內完成。影片與 3D 模型取決於長度與解析度,但通常比自架執行快數倍。

不用信用卡可以試用 API 嗎?+

可以 — 每個帳戶註冊時即可獲得 $1 的免費額度,足以在不使用信用卡的情況下試用大多數 Vidu Models 模型。

有速率限制嗎?+

標準帳戶具有充足的並行任務限制。Enterprise 方案提供自訂 RPM、更高並行性和專屬容量 — 詳情請聯繫業務。