Seedream 5.0 Pro jest już LIVE | Wypróbuj w Generator obrazów →

Panel Odkrywaj Generator AIGorące Aplikacja desktopowa

LLM

Klucze API Doładuj

Ustawienia

Object Detection and Segmentation

Detect, identify, and segment objects in images and videos with AI models on WaveSpeed

Nasz wybór

image-to-text

wavespeed-ai/moondream3-preview/point

Moondream3 Point finds objects in images and returns precise coordinate points for computer vision tasks, enabling accurate point localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Wypróbuj teraz!Zobacz dokumentację

Wszystkie modele

10 modeli

image-to-text

wavespeed-ai/moondream3-preview/point

image-to-text

wavespeed-ai/moondream3-preview/detect

Moondream3 Detect: Precise object bounding boxes in images for accurate computer vision localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/sam-3d-body

Advanced SAM 3D body generation model for creating detailed 3D human body models from images with optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d

wavespeed-ai/sam-3d-objects

Advanced SAM 3D objects generation model for creating detailed 3D object models from images with text prompts and optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/sam3-video

SAM3 Video is a unified foundation model for prompt-based video segmentation. Provide text, point, box, or mask prompts and the model segments and tracks targets across frames with strong temporal consistency. Supports concept-level (“segment anything with concepts”) and multi-object masks for editing, analytics, and VFX. Ready-to-use REST inference API with fast response, no cold starts, and affordable pricing.

image-to-image

wavespeed-ai/sam3-image

SAM 3 is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-text

wavespeed-ai/sam3-video-rle

SAM 3 Video RLE is a unified foundation model for prompt-based segmentation in video. Track and segment objects across frames using text, points, or boxes, returning RLE encoded masks for efficient processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-text

wavespeed-ai/sam3-image-rle

SAM 3 RLE is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Returns RLE (Run-Length Encoding) encoded masks for efficient storage and processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image

bria/embed-product

Bria Embed Product seamlessly integrates product images into scene backgrounds with natural lighting and perspective matching. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

video-to-video

wavespeed-ai/void-video-inpainting/mask

VOID Video Inpainting removes objects from videos using mask-guided inpainting. Supports quad-mask or auto-generated SAM-3 masks, optional Pass 2 refinement for temporal consistency, adjustable denoising steps, guidance scale, and temporal window size. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

API Object Detection and Segmentation — ceny i wydajność

Uruchamiaj dowolny model z kolekcji Object Detection and Segmentation przez jedno REST API. Płać za generację — bez subskrypcji, bez minimów — z czołową w branży latencją i infrastrukturą o dostępności 99,9%.

Dlaczego uruchamiać Object Detection and Segmentation w WaveSpeedAI

Przejrzyste ceny

Cena za wywołanie dla każdego modelu Object Detection and Segmentation. Cena jest podana na stronie każdego modelu — bez dodatkowych opłat platformowych.

Zoptymalizowane pod kątem niskich opóźnień

Większość modeli graficznych Object Detection and Segmentation kończy w mniej niż 2 sekundy. Modele wideo i 3D są kilkakrotnie szybsze niż alternatywy samodzielnie hostowane.

Dostępność 99,9%

Przełączanie awaryjne między regionami i automatyczne ponawianie utrzymują ruch produkcyjny online — nawet podczas awarii dostawcy.

Najczęściej zadawane pytania

Ile kosztuje API Object Detection and Segmentation?+

Każdy model ma własną cenę za wywołanie podaną na stronie modelu. Naliczamy za każdą udaną generację, bez opłat subskrypcyjnych ani minimów.

Jak szybkie są modele Object Detection and Segmentation w WaveSpeedAI?+

Modele graficzne z tej kolekcji zwykle kończą w mniej niż 2 sekundy. Modele wideo i 3D zależą od czasu trwania i rozdzielczości, ale zazwyczaj są kilkakrotnie szybsze niż uruchomienia samodzielnie hostowane.

Czy mogę wypróbować API bez karty kredytowej?+

Tak — każde konto otrzymuje przy rejestracji 1 $ darmowych kredytów, co wystarcza, aby wypróbować większość modeli Object Detection and Segmentation bez karty kredytowej.

Czy istnieją limity szybkości?+

Konta standardowe mają hojne limity równoległych zadań. Plany Enterprise oferują niestandardowy RPM, wyższą współbieżność i dedykowaną pojemność — skontaktuj się z działem sprzedaży po szczegóły.

Przeglądaj ponad 1000 modeli AI

Przeglądaj nasz pełny katalog najnowocześniejszych modeli AI — obraz, wideo, 3D, audio, LLM i więcej.

wavespeed.ai/models →

Buduj z API

Integruj AI z własnymi aplikacjami. RESTful API z bibliotekami klienta — bez zimnych startów, płać za użycie.

wavespeed.ai/docs →