Introducing ByteDance Dreamina V3.1 Text-to-Image on WaveSpeedAI
ByteDance Dreamina V3.1 is a text-to-image model with enhanced aesthetics and style accuracy, producing richer, more polished images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ByteDance LipSync Audio To Video on WaveSpeedAI
Bytedance LipSync turns audio into lifelike talking videos by generating precise lip movements fully synced to input audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ByteDance Seedance V1 Lite Reference To Video on WaveSpeedAI
ByteDance Seedance v1 Lite converts 1 to 4 reference images into high-quality videos with reference-to-video image-to-video generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ByteDance Seedream V3.1 on WaveSpeedAI
Seedream V3.1 by ByteDance is a text-to-image model with upgraded visuals, stronger style fidelity, and rich detail from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ByteDance Seedream V4.5 Sequential on WaveSpeedAI
Seedream 4.5 Sequential generates multi-image sets with consistent characters and objects, unifying palette, lighting, and style across all outputs. Supports up to 4K results for campaigns, storyboards, and product lines. Ready-to-use REST inference API, best performance, no cold starts, affordable
Introducing ByteDance Video Upscaler on WaveSpeedAI
ByteDance Video Upscaler uses AI super-resolution to upscale videos to 4K and recover fine detail in a secure cloud environment. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ByteDance Uso on WaveSpeedAI
USO (Unified Style-Subject Optimized) by ByteDance unifies style-driven and subject-driven generation to produce consistent outputs that blend artistic style with subject fidelity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ElevenLabs Eleven V3 on WaveSpeedAI
ElevenLabs eleven-v3 is a text-to-speech model available as a hosted endpoint; requests cost $0.1 per 1000 characters. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ElevenLabs Flash V2.5 on WaveSpeedAI
ElevenLabs Flash V2 is a Text-to-Speech model that converts text into spoken audio using the ElevenLabs Flash V2 engine. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ElevenLabs Flash V2 on WaveSpeedAI
ElevenLabs Flash V2 is a Text-to-Speech model that converts text into spoken audio using the ElevenLabs Flash V2 engine. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Introducing ElevenLabs Eleven V3 Timing on WaveSpeedAI
ElevenLabs Eleven-V3 Timing converts text to natural speech and returns alignment metadata—character/word timestamps in JSON—for precise subtitles, karaoke effects, and lip-sync. Supports voice_id, similarity/stability, and optional Speaker Boost. Priced at $0.10 per 1,000 characters. Ready-to-u
Introducing ElevenLabs Multilingual V1 on WaveSpeedAI
ElevenLabs Multilingual V1 provides natural-sounding multilingual text-to-speech across many languages. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.