Overview

Overview

WaveSpeedAI is a unified AI platform that provides access to 700+ state-of-the-art models for image generation, video creation, audio synthesis, and more. Whether you’re a developer building AI-powered applications or a creator producing visual content, WaveSpeedAI offers the tools you need through a single API.

Platform Pages

PageDescription
ModelsBrowse 700+ models for image, video, audio, and 3D generation
StudioAdvanced workspace for complex AI workflows
HistoryView your generation history (retained for 7 days)
LLMChat with leading large language models
ServerlessDeploy custom AI workers on our infrastructure
InspirationDiscover community creations and prompts
Desktop AppDownload native apps for Windows, macOS, Linux
BlogTutorials, announcements, and tips
AffiliateJoin our affiliate program and earn commissions

What You Can Create

Image Generation

CategoryDescription
Text to ImageGenerate images from text prompts
Image to ImageEdit, transform, or style transfer images
UpscalerEnhance resolution and image quality
AI RemoverRemove objects, backgrounds, or watermarks

Popular models: Seedream 4.5, FLUX 2, Nano Banana Pro, Qwen Image Edit

Video Generation

CategoryDescription
Image to VideoAnimate still images into video
Text to VideoCreate videos from text descriptions
Video EffectsApply visual effects and filters
Video to VideoEdit, restyle, or enhance videos
Video ExtendExtend video duration seamlessly
Motion ControlControl movement and animation

Popular models: Veo 3.1, Kling 2.6, Sora 2, Hailuo 2.3, Wan 2.6, Seedance

Audio & Speech

CategoryDescription
Text to SpeechGenerate natural voice from text
Text to AudioCreate music and sound effects
Speech to TextTranscribe audio to text
Audio EditingEdit and enhance audio files
Video to AudioGenerate audio/music for videos

Popular models: Minimax Speech, ElevenLabs, Minimax Music 02

Digital Human & Portrait

CategoryDescription
Digital HumanCreate talking avatar videos
Portrait TransferFace swap and portrait editing

Popular models: InfiniteTalk, MultiTalk, LatentSync, Kling Motion Control

3D Generation

CategoryDescription
Image to 3DGenerate 3D models from images
Text to 3DCreate 3D models from text

Popular models: Hunyuan3D, Tripo3D, Hyper3D

AI Vision & Analysis

CategoryDescription
Image to TextImage captioning and analysis
Content ModerationFilter inappropriate content
Video to TextVideo understanding and captioning

Popular models: Molmo2, Moondream3, Content Moderator

Training & Customization

CategoryDescription
Custom LoRAApply custom LoRA models to generation
TrainingTrain your own LoRA models

LLM

CategoryDescription
Large Language ModelsChat and text generation with leading LLMs

Account Levels

WaveSpeedAI offers four account tiers based on your usage needs:

LevelImages/minVideos/minConcurrent TasksHow to Unlock
Bronze1053Default for new users
Silver50060100Top-up $100 total
Gold3,0006002,000Top-up $1,000 total
Ultra5,0005,0005,000Top-up $10,000 total

New accounts receive $1 trial credit to explore the platform. Some premium models may require a paid balance.

Ways to Use WaveSpeedAI

MethodBest ForLink
Web InterfaceQuick testing, no coding requiredWeb Guide
REST APICustom integrations, production appsAPI Docs
Python SDKPython developersSDK Guide
JavaScript SDKNode.js and web developersnpm package
Desktop AppLocal usage, batch processingDesktop Guide
ComfyUINode-based AI workflowsComfyUI Guide
N8NNo-code automationN8N Guide

Next Steps

© 2025 WaveSpeedAI. All rights reserved.