Seedance 2.0 立省 15% | 在 Video Generator 中创作 →
Vidu Models

Vidu Models

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

Shengshu's Vidu offers comprehensive AI video generation solutions with multiple specialized models and precise creative control.

所有模型

35 个模型
vidu/q3/image-to-video
image-to-video

vidu/q3/image-to-video

Vidu Q3 Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/text-to-video
text-to-video

vidu/q3/text-to-video

Vidu Q3 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/image-to-video
image-to-video

vidu/q3-pro/image-to-video

Vidu Q3 Pro Image-to-Video animates still images with high-quality motion via viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3-turbo/image-to-video
image-to-video

vidu/q3-turbo/image-to-video

Vidu Q3 Turbo Image-to-Video animates static images with high-quality motion and faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/image-to-video-pro
image-to-video

vidu/q3/image-to-video-pro

Vidu Q3 Image-to-Video Pro generates high-resolution videos (720p/1080p/2K/4K) from images with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/reference-to-video
image-to-video

vidu/q3/reference-to-video

Vidu Q3 Reference-to-Video Mix generates multi-entity consistent videos from 1-4 reference images with text prompt guidance. Supports 360p to 1080p resolutions, up to 16 seconds duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3/start-end-to-video
image-to-video

vidu/q3/start-end-to-video

Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-turbo/start-end-to-video
image-to-video

vidu/q3-turbo/start-end-to-video

Vidu Q3 Turbo Start-End-to-Video creates smooth transitions between two images with faster processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q3-pro/start-end-to-video
image-to-video

vidu/q3-pro/start-end-to-video

Vidu Q3 Pro Start-End-to-Video creates smooth transitions between two keyframes with viduq3-pro (1–16s). Billing follows Vidu's published Q3-pro per-second rates by resolution. Ready-to-use REST inference API on WaveSpeed.

vidu/q3-pro/text-to-video
text-to-video

vidu/q3-pro/text-to-video

Vidu Q3 Pro Text to Video is a fast AI video generation model that creates high-quality, audio-capable videos from text prompts with support for 1–16 second outputs. Ready-to-use REST inference API for cinematic clips, advertising creatives, social media videos, product visuals, storytelling, and professional text-to-video workflows with simple integration, no coldstarts, and affordable pricing.

vidu/image-to-video-2.0
image-to-video

vidu/image-to-video-2.0

Vidu Image to Video 2.0 converts images into smooth-transition videos with exceptional visual quality and diverse, natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-2.0
image-to-video

vidu/reference-to-video-2.0

Vidu Reference-to-Video 2.0 turns references into videos that preserve characters, objects, and environments with Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-2.0
image-to-video

vidu/start-end-to-video-2.0

Vidu Start-End to Video 2.0 generates smooth transition videos interpolating between given start and end images for natural morphing effects. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video
image-to-video

vidu/image-to-video

Vidu Image-to-Video converts images into smooth-transition videos with high visual quality and diverse motion for cinematic results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video
text-to-video

vidu/text-to-video

Vidu Text to Video converts text prompts into high-quality 720p videos with exceptional visual fidelity and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video
image-to-video

vidu/start-end-to-video

Vidu Start-End to Video converts a start and end image into a smooth transition Image-to-Video clip that morphs scenes seamlessly. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-pro
image-to-video

vidu/image-to-video-q2-pro

Vidu Q2 Pro turns a single still image into smooth, cinematic image-to-video with stable motion, clean edges, and consistent lighting. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q2-turbo
image-to-video

vidu/image-to-video-q2-turbo

Vidu Q2 Turbo Image-to-Video turns a single image into smooth, cinematic motion with fast, high-quality output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q1
text-to-video

vidu/text-to-video-q1

Vidu Text-to-Video Q1 converts text prompts into high-quality videos with exceptional visual fidelity and motion diversity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-image-q2
image-to-image

vidu/reference-to-image-q2

Vidu Reference-to-Image Q2 generates high-quality images from 1–7 reference images plus a text prompt, preserving style and composition while allowing controlled changes to subjects, backgrounds, and fine details. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/one-click-v2/mv
audio-to-video

vidu/one-click-v2/mv

Vidu One-Click V2 MV transforms images and audio into videos with camera movements and subtitle support. Create professional video content with dynamic shots and text overlays in one click. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-image-q2
text-to-image

vidu/text-to-image-q2

Vidu Text-to-Image Q2 converts text prompts into high-quality images with exceptional visual detail and creative flexibility. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-q2
text-to-video

vidu/text-to-video-q2

Vidu Q2 Text-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/image-to-video-fast
image-to-video

vidu/q2-pro/image-to-video-fast

Vidu Q2 Pro Fast Image to Video generates high-quality videos from a single image with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/q2-pro/start-end-to-video-fast
image-to-video

vidu/q2-pro/start-end-to-video-fast

Vidu Q2 Pro Fast Start-End to Video generates smooth video transitions between start and end images with faster generation speed. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

vidu/template/halloween
video-effects

vidu/template/halloween

Vidu Halloween Templates delivers ready-made image and video templates for spooky promos and event invites with overlays. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q2
image-to-video

vidu/reference-to-video-q2

Vidu Q2 is an Image-to-Video and Reference-to-Video model that emphasizes subtle facial expressions and smooth push-pull camera moves for natural motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-turbo
image-to-video

vidu/start-end-to-video-q2-turbo

Vidu Q2 Turbo Start-End to Video creates smooth Image-to-Video transitions between start and end images with fast high-quality results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q2-pro
image-to-video

vidu/start-end-to-video-q2-pro

Vidu Q2 Pro Start-End to Video produces smooth image-to-video transitions between start and end images for seamless morphs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/text-to-video-2.0
text-to-video

vidu/text-to-video-2.0

Vidu Text-to-Video 2.0 converts text prompts into high-quality 720p videos with exceptional visual detail and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-turbo/extend-video
video-extend

vidu/q2-turbo/extend-video

Vidu Q2 Turbo Extend Video seamlessly extends existing videos by 1-7 seconds with consistent motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/q2-pro/extend-video
video-extend

vidu/q2-pro/extend-video

Vidu Q2 Pro Extend Video seamlessly extends existing videos by 1-7 seconds with high-quality motion and scene continuity. Supports optional end-frame image guidance for precise control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/image-to-video-q1
image-to-video

vidu/image-to-video-q1

Vidu Image-to-Video creates smooth transition videos from specified start and end images, producing seamless image-to-video outputs for presentations and storytelling. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/start-end-to-video-q1
image-to-video

vidu/start-end-to-video-q1

Vidu Q1 Start-End To Video turns specified start and end images into smooth image-to-video transitions for morphs and scene fades. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

vidu/reference-to-video-q1
image-to-video

vidu/reference-to-video-q1

Generate videos from reference images while keeping characters, objects, and scene identity consistent using Multi-Entity Consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Vidu Models

Vidu is Shengshu Technology's advanced video generation suite, combining the Q3, Q2, Q1, and 2.0 series models. Built on open-source diffusion backbones and trained on large-scale, high-quality datasets, Vidu delivers strong performance across a wide range of video creation tasks. Its models offer precise control, consistent visual quality, and robust temporal stability, making Vidu suitable for professional, production-grade workflows.

Image-to-Video Models

vidu/q3/image-to-video The newest-generation image-to-video model with best-in-class motion quality, structural fidelity, and cinematic realism. Sets a new benchmark for I2V across complex scenes and fine-grained detail preservation.

vidu/q2-pro/image-to-video-fast Professional-grade image-to-video generation at turbo speed. Combines Q2-Pro's sharp detail and stable identity with significantly reduced latency for high-volume production pipelines.

vidu/image-to-video-q2-pro A professional-grade image-to-video model offering sharper detail, more stable character identity, and refined cinematic motion. Suited for polished production assets, hero shots, and client-facing deliverables.

vidu/image-to-video-q2-turbo A high-speed image-to-video model for complex scenes and multi-character shots. Delivers smooth, coherent motion and solid structure preservation while enabling near real-time preview and refinement.

vidu/image-to-video-q1 A premium image-to-video model with enhanced texture detail and superior portrait handling. Maintains lighting and identity consistency while generating cinematic motion and expressive character performance.

vidu/image-to-video-2.0 Transforms a single image into a smooth, coherent video while preserving structure, composition, and layout. Provides strong temporal stability and natural camera motion for professional post-production and editing pipelines.

vidu/image-to-video A lightweight, fast I2V model for rapid drafts, ideation, and social media content. Balances speed and structural preservation, producing clean clips with minimal artifacts.

Text-to-Video Models

vidu/q3/text-to-video The most advanced text-to-video model in the Vidu lineup. Delivers superior prompt adherence, richer scene composition, and more natural multi-character interactions for high-end creative and commercial storytelling.

vidu/text-to-video-q2 A flagship text-to-video model with stronger temporal coherence, richer scene detail, and more precise camera and motion control. Designed for complex, multi-character narratives and high-end commercial storytelling.

vidu/text-to-video-q1 A high-fidelity T2V model offering richer color, sharper detail, and stronger narrative continuity. Ideal for cinematic storytelling, branding, and visually polished marketing assets.

vidu/text-to-video-2.0 Generates videos directly from text prompts with reliable prompt adherence, coherent multi-object scenes, and controllable camera motion. Well suited for high-quality conceptual and narrative video generation.

vidu/text-to-video A baseline T2V option optimized for efficiency and turnaround speed. Designed for ads, explainers, and straightforward text-driven concepts where fast iteration is key.

Reference-to-Video Models

vidu/reference-to-video-q2 Supports multiple distinct objects or characters interacting within a single video, enabling complex, reference-guided scene compositions.

vidu/reference-to-video-q1 An upgraded reference-based generator with sharper details and more faithful style and identity transfer. Reduces drift and artifacts, especially in close-ups and longer shots.

vidu/reference-to-video-2.0 Creates videos guided by a reference image, ensuring accurate character likeness, stable style control, and consistent wardrobe and appearance across frames.

Start-End Frame Video Models

vidu/q2-pro/start-end-to-video-fast Professional-grade start-end interpolation at turbo speed. Combines Q2-Pro's reinforced temporal coherence with drastically reduced generation time for rapid production workflows.

vidu/start-end-to-video-q2-pro A professional-grade model focused on reinforced temporal coherence and precise motion control. Generates stable intermediate frames while closely aligning with user-specified start and end constraints.

vidu/start-end-to-video-q2-turbo A high-speed variant optimized for rapid iteration and preview. Preserves core coherence and subject integrity while significantly reducing generation latency.

vidu/start-end-to-video-q1 Enhances narrative continuity and motion smoothness, producing more natural easing between poses, camera positions, and scene states.

vidu/start-end-to-video-2.0 Synthesizes smooth motion between user-defined start and end frames while respecting overall scene geometry and layout. Ideal for transitions, reveals, and structured motion design.

vidu/start-end-to-video A compact baseline model for simple start–end interpolation and quick previews. Suitable for basic transitions, animatics, and fast storyboard development.

Image Models

vidu/text-to-image-q2 High-resolution cinematic text-to-image model for generating polished hero shots, thumbnails, and key visuals directly from prompts.

vidu/reference-to-image-q2 Reference-guided image generator that uses up to seven input images plus a prompt to create new, high-res shots that preserve subject identity and composition.

Special Models

vidu/one-click-v2/mv Vidu One-Click V2 MV transforms images and audio into videos with camera movements and subtitle support. Create professional video content with dynamic shots and text overlays in one click.

vidu/template/halloween A themed template model for stylized seasonal video content. Apply pre-designed creative templates to quickly generate themed videos with minimal effort.

Vidu Models API — 价格与性能

通过单一 REST API 运行 Vidu Models 系列中的任意模型。按生成计费 — 无订阅、无最低消费 — 在 99.9% 可用性的基础设施上提供行业领先的延迟。

为什么在 WaveSpeedAI 上运行 Vidu Models

透明定价

每个 Vidu Models 模型都有按调用计价。价格在每个模型的页面上列出 — 不收取额外的平台费。

为低延迟优化

大多数 Vidu Models 图像模型在 2 秒内完成。视频和 3D 模型比自托管方案快数倍。

99.9% 可用性

多区域故障转移和自动重试可确保您的生产流量保持在线 — 即使在供应商故障期间。

常见问题

Vidu Models API 多少钱?+

每个模型在其模型页面上都列有自己的按调用价格。我们按每次成功生成计费,没有订阅费或最低消费。

Vidu Models 模型在 WaveSpeedAI 上有多快?+

本系列中的图像模型通常在 2 秒内完成。视频和 3D 模型取决于时长和分辨率,但通常比自托管运行快数倍。

不用信用卡可以试用 API 吗?+

可以 — 每个账户在注册时获得 $1 的免费额度,足以在不使用信用卡的情况下试用大多数 Vidu Models 模型。

有速率限制吗?+

标准账户有充足的并发任务限制。企业版计划提供自定义 RPM、更高并发和专用容量 — 详情请联系销售。