Introducing Microsoft VibeVoice on WaveSpeedAI
Microsoft VibeVoice generates natural multi-speaker conversations with up to 4 voices. 9 preset voices across English, Chinese, and Indian languages. REST API, $0.12 per generation, no cold starts.
Introducing MiniMax Music 2.5 on WaveSpeedAI
MiniMax Music 2.5 is a full-dimensional breakthrough in AI music generation with high-fidelity audio, humanized vocals, and precise creative control. Ready-to-u
Introducing Sourceful Riverflow 2.0 Pro Edit on WaveSpeedAI
Sourceful Riverflow 2.0 Pro Edit is an agentic image model for high-precision editing and transformation. Up to 10 reference images, 4K output, transparent backgrounds. REST API, $0.135 per run, no cold starts.
Introducing Vidu Q2 Pro Image-to-Video Fast on WaveSpeedAI
Vidu Q2 Pro Fast Image to Video generates high-quality videos from a single image with faster generation speed. Ready-to-use REST inference API, best performanc
Introducing Vidu Q3 Start End To Video on WaveSpeedAI
Vidu Q3 Start End Image-to-Video turns text prompts into high-quality videos with exceptional visual fidelity and diverse motion. Ready-to-use REST inference AP
Introducing Vidu Q3 Turbo Start End To Video on WaveSpeedAI
Vidu Q3 Turbo Start-End-to-Video creates smooth transitions between two images with faster processing. Ready-to-use REST inference API, best performance, no col
Introducing Vidu Q3 Turbo Image-to-Video on WaveSpeedAI
Vidu Q3 Turbo Image-to-Video animates static images with high-quality motion and faster processing. Ready-to-use REST inference API, best performance, no coldst
Introducing Vidu Q3 Turbo Text-to-Video on WaveSpeedAI
Vidu Q3 Turbo Text-to-Video generates high-quality videos from text prompts with faster processing. Ready-to-use REST inference API, best performance, no coldst
Introducing WaveSpeedAI Ace Step 1.5 on WaveSpeedAI
ACE-Step 1.5 generates up to 4-minute music with lyrics from text. Supports 50+ languages, high acoustic fidelity, and runs efficiently on consumer hardware. Re
Introducing WaveSpeedAI Heartmula Generate Music on WaveSpeedAI
HeartMuLa is a state-of-the-art music generation model that creates high-quality songs from lyrics and style tags. Ready-to-use REST inference API with best per
Introducing WaveSpeedAI Heartmula Transcribe Lyrics on WaveSpeedAI
HeartMuLa Transcribe extracts lyrics from audio files using advanced AI. Supports multilingual transcription. Ready-to-use REST inference API with best performa
Introducing WaveSpeedAI Hunyuan 3d V3.1 Image To 3d Rapid on WaveSpeedAI
Hunyuan 3D V3.1 Rapid is a fast image-to-3D generation model, quickly converting 2D images into 3D models. Ready-to-use REST inference API, best performance, no