Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only

Pixverse AI Models

Pixverse v5 delivers versatile AI video generation with both standard and fast processing options.

Pixverse v5 delivers versatile AI video generation with both standard and fast processing options.

All Models

19 models
image-to-video

pixverse/pixverse-v6/transition

PixVerse V6 Transition creates smooth AI-generated video transitions between a start image and an optional end image. Supports 360p to 1080p resolutions, 1-15 second duration, multiple aspect ratios, optional audio generation, and multi-clip mode. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-extend

pixverse/pixverse-v6/extend

PixVerse V6 Extend continues and enhances existing video content by analyzing the ending segment and generating new frames forward. Supports 360p to 1080p resolutions, 1-15 second extension duration, optional audio generation, and multiple styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v6/image-to-video

PixVerse V6 generates high-quality videos from images with flexible duration (1-15s), multiple resolutions up to 1080p, and optional audio generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-video

pixverse/pixverse-v6/text-to-video

PixVerse V6 generates high-quality videos from text prompts with flexible duration (1-15s), multiple resolutions up to 1080p, and optional audio generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-video

pixverse/pixverse-v5.5/image-to-video

PixVerse V5.5 Image-to-Video turns a single image into cinematic clips with smooth motion, clean detail, and strong subject fidelity—ideal for logo stingers, character motion, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-video

pixverse/pixverse-v5.5/text-to-video

PixVerse V5.5 transforms text prompts into realistic videos with smooth motion and natural detail in seconds—ideal for stories, ads, and social clips. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v5.5-transition

Create smooth morph transitions between two images into 5s, 8s or 10s videos at 360p, 540p, 720p, or 1080p—perfect for logo reveals, before-and-after shots, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-video

pixverse/pixverse-v5.6/image-to-video

PixVerse V5.6 Image-to-Video turns a single image into cinematic clips with smooth motion, clean detail, and strong subject fidelity—ideal for logo stingers, character motion, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-video

pixverse/pixverse-v5.6/text-to-video

PixVerse V5.6 transforms text prompts into realistic videos with smooth motion and natural detail in seconds—ideal for stories, ads, and social clips. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v5-effects

PixVerse V5 Effects converts images into smooth, natural short videos with lifelike motion; supports 5s/8s and 720p/1080p outputs. Ready-to-use REST API, no coldstarts, best performance, affordable pricing.

image-to-video

pixverse/pixverse-v5-i2v

PixVerse V5 converts images to short, smooth, natural-looking videos. 5s video: $0.15 (360p/540p), $0.20 (720p), $0.40 (1080p). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v4.5-i2v

PixVerse V4.5 I2V creates high-quality videos from text or image prompts, offering multiple resolutions, aspect ratios, and motion modes for versatile cinematic output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

pixverse/pixverse-v4.5-t2v

Pixverse v4.5 turns text prompts into high-quality videos with multiple resolutions, aspect ratios, and motion modes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human

pixverse/lipsync

PixVerse LipSync converts audio into realistic lip-sync animations with advanced algorithms for precise mouth movements and timing for video avatars. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v5.5-effects

PixVerse V5.5 Effects is an AI image-to-video model that converts still images into smooth, natural short videos with lifelike motion, supporting 5s/8s/10s clips at 360p to 1080p for social posts, ads, and previews. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

image-to-video

pixverse/pixverse-v4.5-i2v-fast

Pixverse V4.5 I2V Fast converts images or text into high-quality videos with multi-resolution, aspect-ratio and motion-mode control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

pixverse/pixverse-v4.5-t2v-fast

Pixverse v4.5 Fast turns text prompts into high-quality videos with multiple resolutions, aspect ratios, and motion modes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

pixverse/pixverse-v5-t2v

PixVerse V5 Text-to-Video generates smooth, natural 5s videos from text prompts in seconds, with 720p output available ($0.20 per 5s). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

pixverse/pixverse-v5-transition

Create smooth morph transitions between two static images into 5s or 8s videos at 360p, 540p, 720p, or 1080p. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Pixverse AI Models

Pixverse v4.5–v5.5 is a complete AI video suite for storyboards, ads, trailers and social clips. It covers both Text-to-Video (T2V) and Image-to-Video (I2V), with standard and fast variants, so you can choose between maximum cinematic quality and rapid turnaround for bulk production. All models support 480p/720p/1080p output and are tuned for stable motion, clean details and consistent characters.

Model Lineup

  1. pixverse-v5.5-image-to-video – Flagship I2V for cinematic character shots, complex scenes and fine detail.
  2. pixverse-v5.5-text-to-video – High-end T2V for carefully directed camera moves, lighting and story prompts.
  3. pixverse-v5.5-transition – Premium transitions for smooth scene changes, morphs and between-shot motion.
  4. pixverse-v5.5-effects – Adds stylized motion, lighting and atmosphere effects to existing clips.
  5. pixverse-v5-i2v – Lightweight everyday I2V for simple animations and social content.
  6. pixverse-v5-effects – Fast effects model for quick visual polish on v5 clips.
  7. pixverse-v4.5-i2v – Stable general-purpose I2V with natural motion and strong subject consistency.
  8. pixverse-v4.5-t2v – Versatile T2V for product demos, explainers and narrative videos.
  9. pixverse-v4.5-i2v-fast – Speed-optimized I2V for batch jobs and rapid iteration.
  10. pixverse-v4.5-t2v-fast – Fast T2V for drafts, variations and A/B testing.
  11. pixverse-v5-transition – Cost-effective transition model for basic scene links and camera moves.
  12. pixverse/pixverse-v5.6 – cinematic realism and advanced physics engine, built for high-end commercial video and immersive storytelling.
  13. lipsync – Turns a still portrait into a talking character, matching lip motion to your audio.

Why Pixverse on WaveSpeedAI?

  1. Quality vs. speed choices – Standard models for premium shots, fast variants for large-scale production.
  2. Consistent characters – Strong identity preservation across frames and shots.
  3. Flexible resolutions & lengths – Great for short hooks, mid-length promos and looping clips.
  4. Built-in effects & transitions – Finish your cuts with motion FX and scene links in the same toolkit.
  5. Voice-ready – Combine with lipsync to create talking characters, hosts and product spokespeople.

Great for

  1. Shorts & social hooks – 3–10s clips for TikTok, Reels and YouTube Shorts.
  2. Ads & e-commerce – Product hero shots, fashion walks, beauty close-ups and CTAs.
  3. Trailers & teasers – Dynamic openings, transitions and atmospheric B-roll.
  4. Talking avatars – Reviews, announcements and tutorials using the lipsync model.