50 % sur les modèles Vidu Q3 et Q3 Pro · Uniquement sur WaveSpeedAI | 20 mai – 2 juin

SkyReels V3 Standard Single Avatar API

skywork-ai /

SkyReels V3 Standard Single Avatar is a fast AI talking avatar video generation model that creates audio-driven avatar videos from one image, one audio file, and a motion prompt. Ready-to-use REST inference API for digital humans, virtual presenters, product explainers, social media videos, education content, marketing creatives, and professional avatar video workflows with simple integration, no coldstarts, and affordable pricing.

digital-human
Entrée

Glisser-déposer ou cliquer pour téléverser

preview

Glisser-déposer ou cliquer pour téléverser

En attente

$0.04par exécution·~25 / $1

ExemplesTout voir

Modèles associés

README

Skywork AI SkyReels V3 Standard Single Avatar

Skywork AI SkyReels V3 Standard Single Avatar generates a talking avatar video from a single reference image and an audio clip. It is designed for character-driven video generation where the image defines the avatar and the audio drives the speaking performance, making it suitable for digital presenters, short-form avatar content, narration clips, and virtual spokesperson workflows.

Why Choose This?

  • Single-image avatar generation Turn one portrait image into a speaking avatar video.

  • Audio-driven lip-sync Use an uploaded audio track to drive speech timing and performance.

  • Prompt-guided behavior Add a short prompt to influence expression, motion, or presentation style.

  • Simple avatar workflow Upload an image, upload audio, write a prompt, and generate the final clip with minimal setup.

  • Production-ready API Suitable for avatar presenters, talking portraits, announcement videos, and other character-led media workflows.

Parameters

ParameterRequiredDescription
promptYesText instruction describing the desired avatar behavior, style, or delivery.
imageYesReference image used as the avatar source.
audioYesAudio track used to drive the avatar’s speaking performance.
durationNoOutput video duration in seconds.
seedNoRandom seed for reproducibility, if supported in the workflow.

How to Use

  1. Upload your image — provide a clear portrait image of the person you want to animate.
  2. Upload your audio — use a clean audio clip to drive the avatar’s speech.
  3. Write your prompt — describe the desired speaking style, facial behavior, or overall presentation.
  4. Set duration (optional) — choose the desired output length if needed.
  5. Submit — run the model and download the generated avatar video.

Example Prompt

Let the woman speak naturally with subtle head motion, calm facial expression, and realistic lip-sync.

Pricing

Pricing is based on duration.

DurationCost
5s$0.20
10s$0.40
15s$0.60

Best Use Cases

  • Talking portrait videos — Turn a still portrait into a speaking clip.
  • Digital spokesperson content — Create short avatar-based communications or announcements.
  • Virtual presenters — Generate presenter videos for demos, explainers, or onboarding content.
  • Social media avatar clips — Produce short-form talking-head content from a single image.
  • Narration-driven character media — Pair a portrait with recorded audio for expressive delivery.

Pro Tips

  • Use a clean, front-facing portrait for better avatar stability.
  • Upload clear audio for stronger lip-sync and more natural speaking motion.
  • Keep the prompt simple and focused on expression or delivery style.
  • Start with shorter durations to validate quality before generating longer clips.
  • Use a consistent portrait and audio setup when iterating on the same avatar.

Notes

  • prompt, image, and audio are required.
  • Pricing depends on duration.
  • A clear portrait image and clean audio generally improve output quality.
  • This workflow is intended for single-avatar speaking video generation.

Related Models

Accessibilité :Ce site utilise des modèles d'IA fournis par des tiers.