Seedream 5.0 Pro ist LIVE | Jetzt im Bildgenerator testen →

Dashboard Entdecken KI-GeneratorBeliebt Desktop-App

LLM

API-Schlüssel Aufladen

Einstellungen

Object Detection and Segmentation

Detect, identify, and segment objects in images and videos with AI models on WaveSpeed

Unsere Auswahl

image-to-text

wavespeed-ai/moondream3-preview/point

Moondream3 Point finds objects in images and returns precise coordinate points for computer vision tasks, enabling accurate point localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Jetzt ausprobieren!Dokumentation ansehen

Alle Modelle

10 Modelle

image-to-text

wavespeed-ai/moondream3-preview/point

image-to-text

wavespeed-ai/moondream3-preview/detect

Moondream3 Detect: Precise object bounding boxes in images for accurate computer vision localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/sam-3d-body

Advanced SAM 3D body generation model for creating detailed 3D human body models from images with optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/sam-3d-objects

Advanced SAM 3D objects generation model for creating detailed 3D object models from images with text prompts and optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/sam3-video

SAM3 Video is a unified foundation model for prompt-based video segmentation. Provide text, point, box, or mask prompts and the model segments and tracks targets across frames with strong temporal consistency. Supports concept-level (“segment anything with concepts”) and multi-object masks for editing, analytics, and VFX. Ready-to-use REST inference API with fast response, no cold starts, and affordable pricing.

image-to-image

wavespeed-ai/sam3-image

SAM 3 is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-text

wavespeed-ai/sam3-video-rle

SAM 3 Video RLE is a unified foundation model for prompt-based segmentation in video. Track and segment objects across frames using text, points, or boxes, returning RLE encoded masks for efficient processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-text

wavespeed-ai/sam3-image-rle

SAM 3 RLE is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Returns RLE (Run-Length Encoding) encoded masks for efficient storage and processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

bria/embed-product

Bria Embed Product seamlessly integrates product images into scene backgrounds with natural lighting and perspective matching. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/void-video-inpainting/mask

VOID Video Inpainting removes objects from videos using mask-guided inpainting. Supports quad-mask or auto-generated SAM-3 masks, optional Pass 2 refinement for temporal consistency, adjustable denoising steps, guidance scale, and temporal window size. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Object Detection and Segmentation API — Preise und Performance

Nutzen Sie jedes Modell der Object Detection and Segmentation-Sammlung über eine einzige REST-API. Bezahlen Sie pro Generierung — keine Abos, keine Mindestbeträge — mit branchenführender Latenz auf einer Infrastruktur mit 99,9 % Verfügbarkeit.

Warum Object Detection and Segmentation auf WaveSpeedAI ausführen

Transparente Preise

Abrechnung pro Aufruf für jedes Object Detection and Segmentation-Modell. Der Preis ist auf jeder Modellseite ausgewiesen — keine Plattformgebühren obendrauf.

Auf niedrige Latenz optimiert

Die meisten Object Detection and Segmentation-Bildmodelle laufen in unter 2 Sekunden. Video- und 3D-Modelle sind mehrfach schneller als selbst gehostete Alternativen.

99,9 % Verfügbarkeit

Multi-Region-Failover und automatische Wiederholungen halten Ihren Produktionsverkehr online — auch bei Anbieter-Ausfällen.

Häufig gestellte Fragen

Wie viel kostet die Object Detection and Segmentation-API?+

Jedes Modell hat seinen eigenen Preis pro Aufruf, der auf der Modellseite angegeben ist. Wir rechnen pro erfolgreicher Generierung ab — ohne Abogebühren oder Mindestbeträge.

Wie schnell sind Object Detection and Segmentation-Modelle auf WaveSpeedAI?+

Bildmodelle in dieser Sammlung sind typischerweise in unter 2 Sekunden fertig. Video- und 3D-Modelle hängen von Dauer und Auflösung ab, sind aber meist mehrfach schneller als selbst gehostete Läufe.

Kann ich die API ohne Kreditkarte testen?+

Ja — jedes Konto erhält bei der Anmeldung 1 $ Startguthaben, genug, um die meisten Object Detection and Segmentation-Modelle ohne Kreditkarte auszuprobieren.

Gibt es Rate-Limits?+

Standardkonten haben großzügige Limits für gleichzeitige Jobs. Enterprise-Pläne bieten individuelle RPM, höhere Parallelität und reservierte Kapazität — bei Interesse den Vertrieb kontaktieren.

Entdecke 1.000+ KI-Modelle

Durchsuche unseren vollständigen Katalog modernster KI-Modelle — Bild, Video, 3D, Audio, LLM und mehr.

wavespeed.ai/models →

Bauen mit der API

Integriere KI in deine eigenen Apps. RESTful-API mit Client-Bibliotheken — keine Cold Starts, Pay-per-Use.

wavespeed.ai/docs →