Serverless Overview

Serverless Overview

Deploy and run your AI workloads on high-performance GPUs with Waverless, WaveSpeedAI’s serverless GPU platform.

What is Waverless?

Waverless is a serverless GPU task orchestration system designed for AI inference and training workloads. It provides on-demand access to powerful GPUs without managing infrastructure.

Key Features

FeatureDescription
RunPod CompatibleZero-code migration from RunPod with compatible API
Auto ScalingAutomatically adjusts worker count based on task queue depth
Multi-EndpointIsolate different applications through separate endpoints
Graceful ShutdownZero task loss during rolling updates and scale down
High AvailabilityMulti-replica deployment with no single point of failure

How It Works

1. Create Endpoint    →    Define your worker image and GPU spec
2. Deploy Workers     →    Workers auto-scale based on demand
3. Submit Tasks       →    Send tasks via API
4. Get Results        →    Receive results via polling or webhook

Use Cases

  • Custom Model Deployment — Run your own AI models on dedicated GPUs
  • Batch Processing — Process large volumes of data in parallel
  • Training Workloads — Fine-tune models with on-demand compute
  • High-Throughput Inference — Scale inference pipelines automatically

Architecture

Waverless uses a pull-based architecture where workers actively pull tasks from a queue:

  • Task Queue — Tasks are queued and distributed to available workers
  • Worker Pool — Workers pull tasks, execute them, and return results
  • Auto Scaler — Monitors queue depth and adjusts worker count

Getting Started

  1. View GPU Pricing — See available GPUs and costs
  2. Quick Start — Get up and running in minutes
  3. Create Endpoint — Deploy your first endpoint
  4. Build Worker — Write your handler code

Enterprise Access

Waverless is currently available for enterprise customers. To request access:

  1. Go to wavespeed.ai/serverless
  2. Fill out the request form
  3. Our team will contact you to discuss your use case
© 2025 WaveSpeedAI. All rights reserved.