Vidu Contest
WaveSpeed.ai
image-to-video

image-to-video

Hailuo 02 MiniMax Video Generation

minimax/video-02

Hailuo 02 is an AI video generation model fine-tuned for ultra-clear 1080P output and handling complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

The model automatically optimizes incoming prompts to enhance output quality. This also activates the safety checker, which ensures content safety by detecting and filtering potential risks.

Idle

Votre requête coûtera $0.25 par exécution.

Pour $10 vous pouvez exécuter ce modèle environ 40 fois.

Encore une chose :

ExemplesTout voir

README

MiniMax Video-02 — minimax/video-02

MiniMax Video-02 generates short video clips from a text prompt, with optional image guidance. Describe the subject, action, scene, and camera movement, and the model produces a coherent, motion-rich clip suitable for story beats, ads, and creative prototyping.

Key capabilities

  • Text-to-video generation with strong motion and scene coherence
  • Optional image input to anchor composition and style
  • Camera-direction friendly prompting (follow, pull back, track, tilt, orbit)
  • Prompt expansion option to automatically enhance prompts and enable the safety checker

Use cases

  • Cinematic shot generation (camera moves, blocking, atmosphere)
  • Storyboarding and pre-visualization for short scenes
  • Marketing creatives and social clips with clear action cues
  • Image-guided variants (keep a reference look while changing motion/camera)

Pricing

ResolutionPrice per video
720p$0.0625
1080p$0.11

Parameters

  • prompt (required): What happens in the video (subject, action, scene, camera, style)
  • image (optional): Reference image to guide composition/style
  • resolution: Output resolution (e.g., 720p, 1080p)
  • duration: Video length (seconds)
  • enable_prompt_expansion: Enhances the prompt automatically and enables the safety checker

Prompting tips

  • Lead with action verbs (runs, spins, juggles, turns, laughs), then add camera language (pull back, track left, tilt up).
  • Keep one “main event” per clip; add atmosphere as a second layer (dust, fog, rim light, crowd).
  • If using an image, state what must stay consistent (character identity, outfit, composition) and what should change (motion, camera path, mood).