Browse 1,000+ AI Models

image-to-image

bytedance / seedream-v5.0-pro / edit

Seedream V5.0 Pro Edit by ByteDance edits and generates images from single-image or multi-reference inputs, supporting up to 10 reference images, aspect ratio selection, and 1K / 2K output tiers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

text-to-image

$0.0450$0.0405

bytedance / seedream-v5.0-pro

Seedream V5.0 Pro Text to Image by ByteDance generates high-quality images from text prompts, with aspect ratio selection, strong prompt following, and 1K / 2K output tiers for flexible image creation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

image-to-video

$0.6000$0.5400

bytedance / seedance-2.0 / image-to-video

Seedance 2.0 (Image-to-Video) generates Hollywood-grade cinematic videos from reference images and text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture, it preserves the input image's subject and composition while adding expressive, physically accurate motion.

5% OFF

image-to-image

$0.0700$0.0665

openai / gpt-image-2 / edit

OpenAI's GPT Image 2 Edit enables image editing from natural-language instructions with one or more reference images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

5% OFF

text-to-image

$0.0600$0.0570

openai / gpt-image-2 / text-to-image

OpenAI's GPT Image 2 Text-to-Image generates high-quality images from natural-language prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

video-to-video

$0.7500$0.6750

bytedance / seedance-2.0 / video-edit

Seedance 2.0 (Video-Edit) edits an input video from a natural-language prompt. The reference video drives subject identity, composition, and motion while the model rewrites lighting, style, weather, environment, or specific elements as instructed. Built on ByteDance Seed's unified multimodal architecture for cinematic, motion-stable output. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

10% OFF

video-to-video

$0.9500$0.8550

bytedance / seedance-2.0 / video-edit-turbo

Seedance 2.0 (Video-Edit Turbo) is the turbo tier for editing an input video from a natural-language prompt — faster, more affordable high-resolution output while preserving subject identity, composition, and motion. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

bytedance/seedance-2.0-fast/video-edit-turbo

10% OFF

video-to-video

$0.8500$0.7650

bytedance / seedance-2.0-fast / video-edit-turbo

Seedance 2.0 Fast (Video-Edit Turbo) is the fastest, cheapest turbo tier for editing an input video from a natural-language prompt — high-resolution output with optimized cost and speed. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

10% OFF

video-to-video

$0.6500$0.5850

bytedance / seedance-2.0-fast / video-edit

Seedance 2.0 Fast (Video-Edit) edits an input video from a natural-language prompt at a faster, cheaper tier. Built on ByteDance Seed's unified multimodal architecture, it preserves subject identity, composition, and motion while rewriting lighting, style, weather, environment, or specific elements as instructed. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

10% OFF

video-extend

$0.6000$0.5400

bytedance / seedance-2.0 / video-extend

Seedance 2.0 (Video-Extend) extends an input video with a new cinematic continuation generated from its last frame and a natural-language prompt. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

10% OFF

video-extend

$0.5000$0.4500

bytedance / seedance-2.0-fast / video-extend

Seedance 2.0 Fast (Video-Extend) extends an input video with a new cinematic continuation generated from its last frame and a natural-language prompt — at the faster, cheaper Seedance 2.0 Fast tier. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

10% OFF

text-to-video

$0.6000$0.5400

bytedance / seedance-2.0 / text-to-video

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

10% OFF

image-to-video

$0.6000$0.5400

bytedance / seedance-2.0 / image-to-video-spicy

Seedance 2.0 Spicy Image to Video is a fast AI image-to-video generation model that creates high-quality cinematic clips from images, optimized for scalable content generation with smooth animations and stable aesthetics. Ready-to-use REST inference API for animating images, social media clips, product videos, advertising creatives, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

10% OFF

image-to-video

$0.5000$0.4500

bytedance / seedance-2.0-fast / image-to-video

Seedance 2.0 Fast (Image-to-Video) generates cinematic videos from reference images and text prompts with native audio-visual synchronization, director-level control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

10% OFF

text-to-video

$0.5000$0.4500

bytedance / seedance-2.0-fast / text-to-video

Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

10% OFF

image-to-video

$0.5000$0.4500

bytedance / seedance-2.0-fast / image-to-video-spicy

Seedance 2.0 Fast Spicy Image to Video is a fast AI image-to-video generation model that creates high-quality cinematic clips from images at faster speed and lower cost, optimized for scalable content generation with smooth animations and stable aesthetics. Ready-to-use REST inference API for animating images, social media clips, product videos, advertising creatives, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

10% OFF

image-to-image

$0.1400$0.1260

google / nano-banana-pro / edit

Google Nano Banana Pro (Gemini 3.0 Pro Image) Edit enables image editing with 4K-capable output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

image-to-image

$0.0700$0.0630

google / nano-banana-2 / edit

Google Nano Banana 2 Edit (Gemini 3.1 Flash Image) enables advanced image editing with 4K-capable output, fast iteration, and precise instruction following. Supports text translation, localization within images, and maintains subject consistency during edits. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

image-to-video

$0.7000$0.6300

bytedance / seedance-2.0 / image-to-video-turbo

Seedance 2.0 (Image-to-Video Turbo) generates cinematic 720p/1080p videos from reference images —delivering high-resolution output at near-480p speed with native audio-visual synchronization, director-level control, and exceptional motion stability.

10% OFF

text-to-video

$0.7000$0.6300

bytedance / seedance-2.0 / text-to-video-turbo

Seedance 2.0 (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts —delivering high-resolution output at near-480p speed with native audio-visual synchronization, director-level control, and exceptional motion stability.

10% OFF

image-to-video

$0.6000$0.5400

bytedance / seedance-2.0-fast / image-to-video-turbo

Seedance 2.0 Fast (Image-to-Video Turbo) generates cinematic 720p/1080p videos from reference images using speed-optimized inference —the fastest and most affordable Seedance image-to-video option with native audio-visual synchronization and director-level control.

10% OFF

text-to-video

$0.6000$0.5400

bytedance / seedance-2.0-fast / text-to-video-turbo

Seedance 2.0 Fast (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts using speed-optimized inference —the fastest and most affordable Seedance option with native audio-visual synchronization and director-level control.

10% OFF

text-to-image

$0.1400$0.1260

google / nano-banana-pro / text-to-image

Google's Nano Banana pro (Gemini 3.0 Pro Image) is a cutting-edge text-to-image model enabling high-res 4K image generation optimized for phones. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

10% OFF

text-to-image

$0.0700$0.0630

google / nano-banana-2 / text-to-image

Google Nano Banana 2 (Gemini 3.1 Flash Image) delivers Pro-quality image generation at Flash speed with 512px to 4K resolution support. Features include improved text rendering, character consistency for up to 5 characters, and real-world knowledge integration. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

$0.1300

minimax / h3 / image-to-video

MiniMax H3 Image to Video animates a first-frame image into a coherent 2K video, with natural-language motion instructions and optional last-frame control for consistent motion, scene continuity, and cinematic video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

$0.1300

minimax / h3 / reference-to-video

MiniMax H3 Reference to Video generates coherent 2K videos from natural-language prompts and multimodal references, including images, videos, and audio, guiding subject consistency, motion, timing, visual style, and scene continuity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

$0.1300

minimax / h3 / text-to-video

MiniMax H3 Text to Video generates coherent 2K videos from text prompts, with flexible 5-15 second duration and adaptive or custom aspect ratios for cinematic scenes, creative videos, and production workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human

$0.1500

wavespeed-ai / infinitetalk / multi

InfiniteTalk Multi converts a single image and two audio inputs into multi-character talking or singing videos at up to 720p. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human

$0.1500

wavespeed-ai / infinitetalk

InfiniteTalk converts one photo + audio into audio-driven talking or singing avatar videos (Image-to-Video), up to 10 minutes, 720p tier $0.30/5s. Ready-to-use REST API, no coldstarts, affordable pricing.

image-to-video

$0.6000

bytedance / seedance-2.0-mini / image-to-video

Seedance 2.0 Mini Image to Video is ByteDance's faster, lower-cost image-to-video model for cinematic multi-shot videos. It turns reference images and optional text prompts into narrative sequences with AI camera control, consistent characters across scenes, 480P / 720P / 1080P / 4K output, 4-15s duration, and flexible aspect ratios. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

$0.7500

bytedance / seedance-2.0-mini / video-edit

Seedance 2.0 Mini Video Edit is ByteDance's faster, lower-cost video editing model for prompt-guided video modification. It edits existing videos with cinematic multi-shot quality, AI camera control, consistent characters, 480P / 720P / 1080P / 4K output, 4-15s duration, and flexible aspect ratios. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

$0.7000

bytedance / seedance-2.0-mini / image-to-video-turbo

Seedance 2.0 Mini Image to Video Turbo is ByteDance's faster, lower-cost image-to-video model for cinematic multi-shot videos. It turns reference images and optional text prompts into narrative sequences with AI camera control, consistent characters, 720P / 1080P output, 5-12s duration, and flexible aspect ratios. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

bytedance/seedance-2.0-mini/video-edit-turbo

video-to-video

$0.9500

bytedance / seedance-2.0-mini / video-edit-turbo

Seedance 2.0 Mini is ByteDance's faster, lower-cost video generation model for text to video and image to video. It creates cinematic multi-shot videos with AI camera control, consistent characters across scenes, 720P / 1080P output, 5-12s duration, and flexible aspect ratios. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

alibaba/happyhorse-1.1/reference-to-video

image-to-video

$0.7000

alibaba / happyhorse-1.1 / reference-to-video

Alibaba HappyHorse 1.1 Reference to Video generates new video scenes from reference images, preserving character identity, visual style, and scene consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

$0.7000

alibaba / happyhorse-1.0 / video-edit

Alibaba Happy Horse 1.0 (Video Edit) performs prompt-driven video editing with multi-image reference support, supporting 720p/1080p output. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

alibaba/happyhorse-1.0/reference-to-video

image-to-video

$0.7000

alibaba / happyhorse-1.0 / reference-to-video

Alibaba Happy Horse 1.0 (Reference-to-Video) generates new video scenes guided by reference images, maintaining consistent characters, styles, and visual identity. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-extend

$0.7000

alibaba / happyhorse-1.1 / video-extend

Alibaba HappyHorse 1.1 Video Extend extends existing videos with seamless AI-generated continuation, supporting 720P / 1080P output while preserving visual continuity and motion consistency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-extend

$0.7000

alibaba / happyhorse-1.0 / video-extend

Alibaba Happy Horse 1.0 (Video Extend) extends existing videos with seamless AI-generated continuation, supporting 720p/1080p output. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

image-to-image

$0.0750

alibaba / wan-2.7 / image-edit-pro

WAN 2.7 Image Edit Pro performs prompt-driven image editing with multi-image reference support and up to 2K output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

$0.6000

alibaba / wan-2.7 / image-to-video-pro

Wan 2.7 Image to Video Pro is a fast AI image-to-video generation model that converts images into premium-quality videos with superior motion dynamics, enhanced visual fidelity, and professional cinematic output. Ready-to-use REST inference API for product videos, advertising creatives, cinematic clips, social media content, character animation, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

image-to-video

$0.1000

wavespeed-ai / open-video / image-to-video

OpenVideo Image to Video is a fast AI image-to-video generation model that creates short cinematic clips with native audio from a single reference image. It gives users full creative control over scene, style, and motion prompts, with support for 480p, 720p, and 1080p output and 3–20 second duration tiers. Ready-to-use REST inference API for cinematic clips, product videos, social media content, advertising creatives, visual storytelling, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

text-to-video

$0.6000

bytedance / seedance-2.0-mini / text-to-video

Seedance 2.0 Mini Text to Video is ByteDance's faster, lower-cost text-to-video model for cinematic multi-shot videos. It generates narrative sequences from text prompts with AI camera control, consistent characters across scenes, 480P / 720P / 1080P / 4K output, 4-15s duration, and flexible aspect ratios. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

$0.0400

google / nano-banana-2-lite / edit

Google Nano Banana 2 Lite Edit transforms uploaded images with text instructions, supporting fast prompt-guided image editing, visual refinements, and creative changes with low latency. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image

$0.0400

google / nano-banana-2-lite / text-to-image

Google Nano Banana 2 Lite Text to Image generates high-quality images from text prompts with low latency, flexible aspect ratios, and fast image creation for creative and production workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

$0.0100

pruna-ai / p-image / edit

Pruna AI P-Image Edit is a fast AI image editing model that edits and transforms images based on text instructions and reference images. Ready-to-use REST inference API for photo retouching, creative edits, product image updates, background changes, marketing assets, and AI image editing workflows with simple integration, no coldstarts, and affordable pricing.

wavespeed-ai/open-video/image-to-video-lora

lora-support

$0.1500

wavespeed-ai / open-video / image-to-video-lora

OpenVideo Image to Video LoRA is a fast AI image-to-video generation model that creates short cinematic clips with native audio from a single reference image, with optional preset control and per-LoRA strength settings for style, motion, and look-and-feel. Supports 480p, 720p, and 1080p output and 3–20 second duration tiers. Ready-to-use REST inference API for cinematic clips, character-consistent videos, stylized motion, product videos, social media content, advertising creatives, and professional LoRA-based image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

image-to-video

$0.0200

pruna-ai / p-video / image-to-video

Pruna AI P-Video Image to Video is a fast AI video generation model that transforms input images into high-quality videos. Ready-to-use REST inference API for animating product photos, character art, marketing creatives, social media content, visual storytelling, and image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

text-to-image

$0.0600

wavespeed-ai / krea-v2-large / text-to-image

Krea 2 Large Text to Image is a fast AI image generation model that creates high-fidelity images from text prompts with aspect ratio, creativity, and optional style reference controls. Ready-to-use REST inference API for creative design, marketing visuals, product mockups, brand assets, social media content, concept art, and professional text-to-image workflows with simple integration, no coldstarts, and affordable pricing.

Ultimate AI MediaGeneration Platform

Category

Grok Imagine Video V1.5 Models

MiniMax H3 Models

HappyHorse 1.1 Models

Luma AI Models

Bria AI Models

Sonilo AI Models

Skywork AI Models

Mureka AI Models

Clarity AI Models

Pruna AI Models

HappyHorse Models

Seedance 2.0 Models

Wan 2.7 Models

Qwen Image 2 Models

Grok Models

Seedance 1.5 Pro Models

Wan 2.6 Models

Kling O3 Models

OpenAI Models

Wan 2.5 Models

Seedream Models

Wan 2.2 Models

Dreamina AI Models

Seedance Models

Flux Image Tools

Minimax Hailuo Models

Kling Models

Google Models

Flux Kontext Models

Runwayml AI Models

Wan 2.1 Video Models

Hunyuan Models

Vidu Models

Ideogram Image Models

Recraft Image

Qwen AI Models

Pixverse AI Models

Stability AI Models

Video Extend

Object Detection and Segmentation

Content Detection Models

Motion Control Models

Best Video Models

Best Image Models

Swap Anything

Audio for Video

Video Edit

Ultra Selection

LoRA Generation

Generate Music

First and Last Frame Video

Remove Anything

3D Creation

Avatar Lipsync Models

Training Tools

Enhance Videos

Image Editing

Upscale Image

Speech Generation

text-to-video

text-to-image

lora-support

image-to-video

image-to-image

video-dubbing

training

video-to-video

upscaler

portrait-transfer

text-to-audio

audio-to-audio

ai-remover

digital-human

image-to-3d

motion-control

content-moderation

llm

Ultimate AI Media
Generation Platform