
InfiniteTalk is an audio-driven conversational AI video generation model. Create talking or singing videos from a single image and audio input. Our endpoint starts with $0.15 per 5 seconds (480p) or $0.3 per 5 seconds (720p) video generation and supports a maximum generation length of 10 minutes.
$0.15
infinitetalk
$0.2
wan-2.1/mocha
$0.2
wan-2.2/animate
$0.15
infinitetalk/multi
$0.15
infinitetalk/video-to-video
$0.05
speech-02-hd
$0.03
speech-02-turbo
$0.2
lipsync
$0.025
lipsync-1.9.0-beta
$0.05
lipsync-2
$0.08
lipsync-2-pro
$0.15
lipsync
$0.15
multitalk
$0.15
wan-2.1/multitalk
$0.15
wan-2.2/speech-to-video
$0.05
song-generation
$0.5
voice-clone
$0.12
avatar-omni-human
$0.25
avatar-omni-human-1.5
$0.5
voice-design
$0.15
lipsync/audio-to-video
$0.15
kling-lipsync/audio-to-video
$0.14
kling-lipsync/text-to-video
$0.06
speech-2.5-hd-preview
$0.04
speech-2.5-turbo-preview
$0.15
latentsync
$0.03
music-v1.5
$0.0375
video-translate