Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
ホーム/探索/Pixverse AI Models/pixverse/pixverse-v6/text-to-video
text-to-video

text-to-video

PixVerse V6

pixverse/pixverse-v6/text-to-video

PixVerse V6 generates high-quality videos from text prompts with flexible duration (1-15s), multiple resolutions up to 1080p, and optional audio generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input
Whether to generate audio for the video.

Idle

このリクエストには1回あたりで$0.1の費用がかかります。

$1でおよそ10回実行できます。

もうひとつお知らせ:

サンプルすべて表示

README

PixVerse V6 Text-to-Video

PixVerse V6 is PixVerse's latest text-to-video model, delivering high-fidelity cinematic video from natural language prompts. With resolution options from 360p to 1080p, flexible aspect ratios, optional synchronized audio generation, and a thinking mode for complex scenes, it supports a wide range of creative and production workflows.

Why Choose This?

  • High-fidelity video generation Produces detailed, visually coherent video with accurate motion, lighting, and scene composition from text descriptions.

  • Four resolution tiers Generate from 360p up to 1080p — balance quality and cost based on your delivery needs.

  • Optional audio generation Enable generate_audio_switch to produce synchronized ambient sound and atmosphere alongside the video.

  • Thinking mode The thinking_type parameter lets the model apply extended reasoning for complex or nuanced scene descriptions.

  • Flexible aspect ratios Supports multiple orientations to fit social, cinematic, and broadcast formats.

  • Prompt Enhancer Built-in tool to automatically improve your scene descriptions for richer output.

Parameters

ParameterRequiredDescription
promptYesText description of the scene, motion, camera style, and atmosphere.
aspect_ratioNoOutput aspect ratio. Default: 16:9.
resolutionNoOutput resolution: 360p, 540p, 720p (default), or 1080p.
durationNoClip length in seconds. Default: 5.
generate_audio_switchNoWhether to generate synchronized audio for the video. Default: off.
thinking_typeNoReasoning mode for scene generation. Default: auto.

How to Use

  1. Write your prompt — describe the scene, characters, camera movement, lighting, and atmosphere. Use the Prompt Enhancer for better results.
  2. Select aspect ratio — choose the format that fits your target platform.
  3. Select resolution — 360p for drafts, 720p for standard output, 1080p for final production.
  4. Set duration — choose your desired clip length in seconds.
  5. Enable audio (optional) — check generate_audio_switch to generate synchronized sound alongside the video.
  6. Set thinking_type (optional) — use auto or adjust for complex scene descriptions.
  7. Submit — generate, preview, and download your video.

Pricing

ResolutionWithout AudioWith Audio
360p$0.025/s$0.035/s
540p$0.035/s$0.045/s
720p$0.045/s$0.060/s
1080p$0.090/s$0.115/s

Billing Rules

  • Billing is calculated per second of video generated
  • For 360p: $0.025/s without audio, $0.035/s with audio
  • For 540p: $0.035/s without audio, $0.045/s with audio
  • For 720p: $0.045/s without audio, $0.060/s with audio
  • For 1080p: $0.090/s without audio, $0.115/s with audio
  • For $1 you can run this model for approximately 40 seconds at 360p (no audio) or about 11 seconds at 1080p (no audio)

Best Use Cases

  • Cinematic Storytelling — Render rich, narrative-driven scenes from detailed text descriptions.
  • Social Media Content — Generate short-form video clips at the right resolution and aspect ratio for any platform.
  • Marketing & Advertising — Produce promotional video content quickly without a film crew.
  • Concept Visualization — Bring creative directions and moods to life for pitching and storyboarding.
  • Audio-Visual Content — Use audio generation for fully immersive ambient scenes.

Pro Tips

  • Be specific in your prompt — include camera angle, lighting quality, motion style, and subject behavior.
  • Use 360p or 540p to rapidly test and iterate on prompts before committing to a 1080p final render.
  • Enable audio for scenes with strong environmental elements like crowds, nature, or music performances.
  • Use the Prompt Enhancer to expand a simple concept into a richly detailed generation prompt.

Notes

  • Only prompt is required; all other parameters are optional.
  • Please follow PixVerse's content usage policies when crafting prompts.

Related Models