Seedance 2.0 | Special Offer ✦ 10% OFF NOW | Ends May 13 (UTC+0)

Pruna AI P-Video Avatar API

pruna-ai /

Pruna AI P-Video Avatar is a fast AI avatar video generation model that creates high-quality avatar videos for digital humans, talking characters, social media content, marketing creatives, virtual presenters, and AI video workflows. Ready-to-use REST inference API with simple integration, no coldstarts, and affordable pricing.

digital-human
Giriş

Sürükleyip bırakın veya yüklemek için tıklayın

preview

Sürükleyip bırakın veya yüklemek için tıklayın

Boşta

$0.025çalıştırma başına·~40 / $1

ÖrneklerTümünü görüntüle

İlgili Modeller

README

Pruna AI P-Video Avatar

Pruna AI P-Video Avatar generates a talking or performing avatar video from a reference image and an audio clip, with optional prompt guidance for motion and expression. It is designed for character-driven video generation where the image defines the avatar and the audio drives the timing and delivery.

Why Choose This?

  • Image + audio avatar generation Combine a reference image with an audio track to generate a video avatar performance.

  • Prompt-guided motion control Use video_prompt to nudge expression, movement, or overall performance style.

  • Simple output settings Choose resolution and seed without a heavy configuration workflow.

  • Audio-driven timing Video length follows the uploaded audio duration, making it easier to generate synced outputs.

  • Production-ready workflow Useful for avatar clips, talking portraits, character presentations, and short-form content generation.

Parameters

ParameterRequiredDescription
imageYesReference image used as the avatar source.
audioYesAudio file used to drive the avatar video.
video_promptNoOptional prompt describing expression, motion, or overall video style. Keep it simple for better stability.
resolutionNoOutput resolution: 720p or 1080p.
seedNoRandom seed for reproducibility. Use the same seed for more consistent results.

How to Use

  1. Upload your image — provide the reference image you want to animate.
  2. Upload your audio — use a clear audio clip to drive the avatar performance.
  3. Add a simple video prompt (optional) — describe only the key motion or mood you want.
  4. Choose resolution — use 720p for lower cost or 1080p for higher quality.
  5. Set a seed (optional) — use a fixed seed for more reproducible outputs.
  6. Submit — run the model and download the generated avatar video.

Example Prompt

Natural head movement, subtle facial expression, stable identity, clean speaking performance, realistic motion

Pricing

Pricing is based on the audio duration and resolution.

720p

Audio DurationCost
5s$0.125
10s$0.25
30s$0.75
60s$1.50
600s$15.00

1080p

Audio DurationCost
5s$0.225
10s$0.45
30s$1.35
60s$2.70
600s$27.00

Billing Rules

  • Pricing is based on the uploaded audio duration
  • Minimum billed duration is 5 seconds
  • Maximum billed duration is 600 seconds
  • 720p uses a base rate of $0.025 per second
  • 1080p costs 1.8× the 720p rate
  • video_prompt and seed do not affect pricing

Best Use Cases

  • Talking avatar videos — Generate speaking portraits from a single image and audio track.
  • Character presentation clips — Create short performance-based videos for storytelling or demos.
  • Social media avatar content — Produce short avatar-driven clips for lightweight content workflows.
  • Narration-driven character scenes — Pair a static character image with voice content for expressive video output.
  • Prototype virtual presenters — Quickly test avatar-based presentation ideas without full animation workflows.

Pro Tips

  • Keep the audio reasonably short for better reliability and easier iteration.
  • Use a clear, front-facing image for better avatar stability.
  • Keep video_prompt simple and direct — overly detailed prompts are more likely to fail.
  • Focus the prompt on a few essentials, such as natural motion, subtle expression, or stable identity.
  • Start with 720p for testing, then switch to 1080p for final-quality outputs.
  • Reuse the same seed when you want more consistent variations.

Notes

  • Both image and audio are required.
  • Very long audio is not recommended; shorter clips are easier to generate successfully.
  • The model works best when video_prompt is simple rather than highly detailed.
  • Billing uses the audio duration, with a minimum of 5 seconds and a cap of 600 seconds.
  • save_audio is not exposed in the current input settings shown here.

Related Models

Erişilebilirlik:Bu web sitesi, üçüncü taraflarca sağlanan yapay zeka modellerini kullanmaktadır.