
LongCat Avatar produces super-realistic, lip-synchronized long video generation with natural dynamics and consistent identity. Converts one photo + audio into audio-driven talking or singing avatar videos (Image-to-Video), up to 2 minutes, 720p tier $0.40/5s. Ready-to-use REST API, no coldstarts, affordable pricing.
longcat-avatar
infinitetalk
wan-2.1/mocha
wan-2.2/animate
ltx-2-19b/lipsync
latentsync
kling-v1-ai-avatar-standard
wan-2.1/multitalk
infinitetalk/multi
infinitetalk-fast/multi
infinitetalk-fast
infinitetalk-fast/video-to-video
infinitetalk/video-to-video
lipsync
lipsync-1.9.0-beta
lipsync-2
lipsync-2-pro
lipsync
hunyuan-avatar
multitalk
wan-2.2/speech-to-video
avatar-omni-human
avatar-omni-human-1.5
lipsync/audio-to-video
kling-lipsync/audio-to-video
kling-lipsync/text-to-video
steady-dancer
latentsync
kling-v1-ai-avatar-pro
kling-v2-ai-avatar-standard
kling-v2-ai-avatar-pro
video-translate
fabric-1.0