Introducing Kuaishou Kling V2.1 I2V Pro Start End Frame on WaveSpeedAI
Kling v2.1 I2V Pro Start-End Frame generates cinematic Image-to-Video clips with precise start/end frame control, enhanced visual fidelity, and dynamic camera motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing Kuaishou Kling Video O1 Std Text-to-Video on WaveSpeedAI
Kling Omni Video O1 (Standard) is Kuaishou's first unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Ready-to-use
Introducing Lightricks LTX 2 Retake on WaveSpeedAI
LTX-2 Retake performs targeted retakes on any section of a video—replace visuals, audio, or both—while preserving timing and continuity with $0.1 per output video second. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing Luma Ray 2 Flash T2V on WaveSpeedAI
Luma Ray 2 Flash turns text into high-quality videos with flexible sizes and built-in prompt optimization for precise outputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing Luma Ray 2 T2V on WaveSpeedAI
Luma Ray 2 is a Text-to-Video model that creates high-quality videos from text prompts, with advanced prompt optimization and support for various video sizes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing MiniMax Hailuo 02 Fast on WaveSpeedAI
Hailuo 02 Fast is a minimax image-to-video model that creates high-quality 6s and 10s clips at 512p for creators and marketers. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing WaveSpeedAI Steady Dancer on WaveSpeedAI
SteadyDancer is a 14B-parameter human image animation framework that transforms static images into coherent dance videos. Features first-frame preservation, robust identity consistency, and temporal coherence for realistic motion generation. Ready-to-use REST inference API, best performance, no cold
Introducing MiniMax Speech 2.5 Hd Preview on WaveSpeedAI
MiniMax Speech 2.5 HD Preview offers HD TTS with enhanced multilingual expressiveness, accurate voice cloning, and 40-language support. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
Introducing MiniMax Speech 2.6 Hd on WaveSpeedAI
Minimax Speech 2.6 HD: Ultra-human, low-latency (< 250ms) TTS with voice cloning, text normalization and support for 40+ languages. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing MiniMax Speech 2.6 Turbo on WaveSpeedAI
Minimax Speech 2.6 Turbo is a Text-to-Speech model offering ultra-human voice cloning, industry-leading text normalization, sub-250ms latency and 40+ language support. Pricing: $0.06 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing Clarity AI Crystal Upscaler on WaveSpeedAI
Clarity AI Crystal Upscaler boosts image resolution with AI upscaling and adjustable detail for portraits and landscapes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing MiniMax Speech 2.5 Turbo Preview on WaveSpeedAI
Minimax Speech 2.5 Turbo Preview: HD TTS with multilingual support, accurate voice replication across 40 languages. $0.04/1000 chars. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.