WaveSpeed.ai
Startseite/Entdecken/wavespeed-ai/short-video-generator
text-to-video

text-to-video

Short Video Generator

wavespeed-ai/short-video-generator

WaveSpeed Short Video Generator creates professional short-form videos from text prompts and optional reference images with native audio, smooth motion, and versatile aspect ratios. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input

Idle

Ihre Anfrage kostet $0.8 pro Durchlauf.

Für $10 können Sie dieses Modell ungefähr 12 Mal ausführen.

Noch etwas:

BeispieleAlle anzeigen

README

Short Video Generator

Short Video Generator produces professional short-form videos with native audio from text prompts and optional reference images. A versatile all-purpose model for creating polished clips across any format — landscape, vertical, or classic — with cinematic motion, realistic physics, and expressive character performance.

Why Choose This?

  • All-Purpose Short Video One model for every short-form need: ads, social posts, product demos, explainers, teasers, and more. No specialized pipeline required.

  • Native Audio Every video includes synchronized sound — ambient effects, background music, and dialogue with accurate lip-sync. Ready to publish without audio post-production.

  • Cinematic Motion Smooth, natural movement with professional camera work — tracking shots, dolly moves, rack focus, and handheld feel that elevate short-form content to broadcast quality.

  • Real-World Physics Objects interact naturally with proper weight, momentum, and collision. Fluid dynamics and realistic inertia make every scene believable.

  • Reference Image Support Upload up to 4 images to guide character appearance, product details, brand aesthetics, or scene composition for consistent results.

  • Every Aspect Ratio 16:9 for YouTube and web, 9:16 for TikTok and Reels, 4:3 and 3:4 for classic and editorial formats.

Parameters

ParameterRequiredDescription
promptYesDescribe the scene, action, and mood for the video
imagesNoUp to 4 reference images for style, character, or brand guide
aspect_ratioNo16:9 (default), 9:16, 4:3, or 3:4
durationNo5 (default), 10, or 15 seconds

How to Use

  1. Write your prompt — describe the scene with detail: environment, characters, action, lighting, and mood.
  2. Add reference images (optional) — upload product photos, character headshots, or style references.
  3. Select aspect ratio — pick the format for your target platform or use case.
  4. Set duration — 5s for bumpers and hooks, 10s for standard clips, 15s for mini-narratives.
  5. Generate — submit and receive a polished video with synchronized audio.

Pricing

DurationCost
5 s$0.80
10 s$1.60
15 s$2.40

Billing Rules

  • Rate: $0.80 per 5 seconds
  • All aspect ratios are the same price
  • Reference images do not affect pricing

Best Use Cases

  • Marketing & Ads — Generate high-converting video ads with product reference images for brand consistency.
  • Social Media — Produce platform-ready content for any social channel in the right aspect ratio.
  • Product Demos — Showcase features and benefits with professional motion and lighting.
  • Explainer Clips — Turn concepts into visual demonstrations with clear, engaging motion.
  • Event Teasers — Create anticipation with dramatic, cinematic short clips.
  • E-Commerce — Generate lifestyle product videos at scale from product photos.

Prompt Tips

  • Be descriptive about the scene: Include environment, lighting, time of day, and atmosphere.
  • Specify camera movement: "slow pan across the table", "tracking shot following the subject", "static close-up" all work well.
  • Describe the mood: "energetic and upbeat", "calm and serene", "dramatic and tense" guide the overall feel.
  • Use reference images strategically: Product shots maintain brand identity; character photos ensure face consistency.
  • Match duration to content: 5s for punchy hooks, 10s for standard content, 15s for story-driven clips.

Notes

  • Prompt is the only required field. Reference images are optional but improve consistency.
  • Native audio is generated automatically with every video.
  • Default aspect ratio is 16:9 (landscape).
  • Duration options: 5, 10, or 15 seconds.

Related Models