InfiniteTalk is an audio-driven conversational AI video generation model. Create talking or singing videos from a single image and audio input. Our endpoint starts with $0.15 per 5 seconds (480p) or $0.3 per 5 seconds (720p) video generation and supports a maximum generation length of 10 minutes.
$0.15
infinitetalk
$0.15
infinitetalk/multi
$0.15
infinitetalk/video-to-video
$0.2
wan-2.2/animate
$1.8
veo3-fast/image-to-video
$3.2
veo3/image-to-video
$0.15
lipsync
$0.025
lipsync-1.9.0-beta
$0.05
lipsync-2
$0.08
lipsync-2-pro
$0.15
lipsync
$0.15
multitalk
$0.15
wan-2.1/multitalk
$0.15
wan-2.2/speech-to-video
$0.05
song-generation
$0.12
avatar-omni-human
$0.25
avatar-omni-human-1.5
$0.15
lipsync/audio-to-video
$0.15
kling-lipsync/audio-to-video
$0.14
kling-lipsync/text-to-video
$0.15
latentsync
$0.0375
video-translate