P Video Avatar | AI Digital Human API

Pruna AI P-Video Avatar

Pruna AI P-Video Avatar generates a talking or performing avatar video from a reference image and an audio clip, with optional prompt guidance for motion and expression. It is designed for character-driven video generation where the image defines the avatar and the audio drives the timing and delivery.

Why Choose This?

Image + audio avatar generation Combine a reference image with an audio track to generate a video avatar performance.
Prompt-guided motion control Use video_prompt to nudge expression, movement, or overall performance style.
Simple output settings Choose resolution and seed without a heavy configuration workflow.
Audio-driven timing Video length follows the uploaded audio duration, making it easier to generate synced outputs.
Production-ready workflow Useful for avatar clips, talking portraits, character presentations, and short-form content generation.

Parameters

Parameter	Required	Description
image	Yes	Reference image used as the avatar source.
audio	Yes	Audio file used to drive the avatar video.
video_prompt	No	Optional prompt describing expression, motion, or overall video style. Keep it simple for better stability.
resolution	No	Output resolution: `720p` or `1080p`.
seed	No	Random seed for reproducibility. Use the same seed for more consistent results.

How to Use

Upload your image — provide the reference image you want to animate.
Upload your audio — use a clear audio clip to drive the avatar performance.
Add a simple video prompt (optional) — describe only the key motion or mood you want.
Choose resolution — use 720p for lower cost or 1080p for higher quality.
Set a seed (optional) — use a fixed seed for more reproducible outputs.
Submit — run the model and download the generated avatar video.

Example Prompt

Natural head movement, subtle facial expression, stable identity, clean speaking performance, realistic motion

Pricing

Pricing is based on the audio duration and resolution.

720p

Audio Duration	Cost
5s	$0.125
10s	$0.25
30s	$0.75
60s	$1.50
600s	$15.00

1080p

Audio Duration	Cost
5s	$0.225
10s	$0.45
30s	$1.35
60s	$2.70
600s	$27.00

Billing Rules

Pricing is based on the uploaded audio duration
Minimum billed duration is 5 seconds
Maximum billed duration is 600 seconds
720p uses a base rate of $0.025 per second
1080p costs 1.8× the 720p rate
video_prompt and seed do not affect pricing

Best Use Cases

Talking avatar videos — Generate speaking portraits from a single image and audio track.
Character presentation clips — Create short performance-based videos for storytelling or demos.
Social media avatar content — Produce short avatar-driven clips for lightweight content workflows.
Narration-driven character scenes — Pair a static character image with voice content for expressive video output.
Prototype virtual presenters — Quickly test avatar-based presentation ideas without full animation workflows.

Pro Tips

Keep the audio reasonably short for better reliability and easier iteration.
Use a clear, front-facing image for better avatar stability.
Keep video_prompt simple and direct — overly detailed prompts are more likely to fail.
Focus the prompt on a few essentials, such as natural motion, subtle expression, or stable identity.
Start with 720p for testing, then switch to 1080p for final-quality outputs.
Reuse the same seed when you want more consistent variations.

Notes

Both image and audio are required.
Very long audio is not recommended; shorter clips are easier to generate successfully.
The model works best when video_prompt is simple rather than highly detailed.
Billing uses the audio duration, with a minimum of 5 seconds and a cap of 600 seconds.
save_audio is not exposed in the current input settings shown here.

Related Models

Pruna AI P-Video Text-to-Video — Generate videos directly from natural-language prompts.
Pruna AI P-Video Image-to-Video — Animate a reference image into a video clip with prompt guidance.
Pruna AI P-Image Text-to-Image — Generate still images for image-first creative workflows.

P Video Avatar API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/pruna-ai/p-video/avatar with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for P Video Avatar below.

HTTP example

# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/pruna-ai/p-video/avatar" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "video_prompt": "The person is talking.",
    "resolution": "720p",
    "seed": -1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("pruna-ai/p-video/avatar", {
        "image": "https://example.com/your-input.jpg",
        "audio": "https://example.com/your-audio.mp3",
        "video_prompt": "The person is talking.",
        "resolution": "720p",
        "seed": -1
});

console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "pruna-ai/p-video/avatar",
    {
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "video_prompt": "The person is talking.",
    "resolution": "720p",
    "seed": -1
}
)

print(output["outputs"][0])  # → URL of the generated output

P Video Avatar API — Frequently asked questions

What is the P Video Avatar API?

P Video Avatar is a Pruna Ai model for talking-avatar generation, exposed as a REST API on WaveSpeedAI. Pruna AI P-Video Avatar is a fast AI avatar video generation model that creates high-quality avatar videos for digital humans, talking characters, social media content, marketing creatives, virtual presenters, and AI video workflows. Ready-to-use REST inference API with simple integration, no coldstarts, and affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the P Video Avatar API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/pruna-ai/pruna-ai-p-video-avatar.

How much does P Video Avatar cost per run?

P Video Avatar starts at $0.025 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does P Video Avatar accept?

Key inputs: `image`, `audio`, `resolution`, `seed`, `video_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/pruna-ai/pruna-ai-p-video-avatar.

How do I get started with the P Video Avatar API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use P Video Avatar outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Pruna Ai). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ÖrneklerTümünü görüntüle

İlgili Modeller

README