Introducing WaveSpeedAI Molmo2 Prompt Optimizer on WaveSpeedAI
Molmo2-4B Prompt Optimizer: Enhance prompts for image and video generation with intelligent restructuring, style guidance, and context-aware improvements. Open-
Introducing WaveSpeedAI Molmo2 Text Content Moderator on WaveSpeedAI
Molmo2-4B Text Content Moderator: Analyze text content for safety, appropriateness, and policy compliance. Detects hate speech, violence, sexual content, and ot
Introducing WaveSpeedAI Molmo2 Video Captioner on WaveSpeedAI
Molmo2-4B Video Captioner: Generate detailed, accurate captions for videos with customizable detail levels (low, medium, high). Open-source vision-language mode
Introducing WaveSpeedAI Molmo2 Video Content Moderator on WaveSpeedAI
Molmo2-4B Video Content Moderator analyzes video content for safety, appropriateness, and policy compliance. Detects violence, nudity, gore, and other harmful v
Introducing WaveSpeedAI Molmo2 Video Qa on WaveSpeedAI
Molmo2-4B Video QA: Answer questions about video content with temporal understanding. Open-source vision-language model. Ready-to-use REST API, no cold starts,
Introducing WaveSpeedAI Molmo2 Video Understanding on WaveSpeedAI
Molmo2-4B Video Understanding: Analyze videos with specialized tasks (general, summary, analysis, counting, scene description). Open-source vision-language mode
Introducing WaveSpeedAI Openai Whisper With Video on WaveSpeedAI
OpenAI Whisper Large v3 (Video-to-Text) delivers high-accuracy multilingual transcription directly from video files, with automatic language detection and optio
Introducing WaveSpeedAI Paddle Ocr on WaveSpeedAI
PaddleOCR-VL is an ultra-compact 0.9B parameter vision-language model for document parsing, supporting 109 languages with text, table, formula, and chart recogn
Introducing WaveSpeedAI Qwen Image 2512 LoRA Trainer on WaveSpeedAI
Qwen-Image-2512 LoRA Trainer lets you train custom LoRA models 10x faster with style, character, and object training. From concept to model in minutes, not hour
Introducing WaveSpeedAI Qwen Image Text-to-Image 2512 LoRA on WaveSpeedAI
Qwen-Image-2512 LoRA is an enhanced 20B MMDiT text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST infer
Introducing WaveSpeedAI Video Background Remover on WaveSpeedAI
WaveSpeed Video Background Remover replaces or removes video backgrounds with a custom image. Upload or paste a link to your video, then provide a background im
Introducing WaveSpeedAI Z Image Turbo Controlnet on WaveSpeedAI
Z-Image-Turbo ControlNet generates images guided by structural control signals (depth, canny edge, pose) for precise composition control. Ready-to-use REST infe