Latest news on AI image and video generation models
Bytedance OmniHuman turns a single portrait photo into avatar video with lifelike motion and expressions ($0.12/sec). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
OmniHuman 1.5 converts audio and visual cues into lifelike avatar animations for virtual humans, storytelling, and interactive agents. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
ByteDance Dreamina 3.0 Edit is an image-to-image model that enhances aesthetics, style and detail and accepts text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 converts text or image prompts into 1080P videos with natural expression, diverse styles, and multi-scene narratives. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 converts text or images into pro 720P videos with natural dynamic expression, diverse styles and multi-scene narratives. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 Pro creates 1080P videos from text or image prompts with natural dynamic expression and multi-scene narratives. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 Pro turns text or image prompts into 1080P professional videos with natural dynamics, diverse styles, and multi-scene narratives. Ready REST API; best performance, no coldstarts, affordable pricing.
ByteDance Dreamina V3.0 is a text-to-image model emphasizing upgraded visual effects, richer detail, and improved style accuracy to generate more aesthetic, faithful images from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 turns text or image prompts into 1080P videos with natural dynamic expression, diverse styles, and multiple scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Dreamina V3.0 creates 720P videos from text or image prompts with natural dynamic expression, diverse styles, and multi-scene narratives. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
ByteDance Dreamina V3.1 is a text-to-image model with enhanced aesthetics and style accuracy, producing richer, more polished images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Bytedance LipSync turns audio into lifelike talking videos by generating precise lip movements fully synced to input audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.