Seedance 2.0 立省 15% | 在 Video Generator 中創作 →
Object Detection and Segmentation

Object Detection and Segmentation

Detect, identify, and segment objects in images and videos with AI models on WaveSpeed

我們的選擇

wavespeed-ai/moondream3-preview/point
image-to-text

wavespeed-ai/moondream3-preview/point

Moondream3 Point finds objects in images and returns precise coordinate points for computer vision tasks, enabling accurate point localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

所有模型

10 個模型
wavespeed-ai/moondream3-preview/point
image-to-text

wavespeed-ai/moondream3-preview/point

Moondream3 Point finds objects in images and returns precise coordinate points for computer vision tasks, enabling accurate point localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/moondream3-preview/detect
image-to-text

wavespeed-ai/moondream3-preview/detect

Moondream3 Detect: Precise object bounding boxes in images for accurate computer vision localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/sam-3d-body
image-to-3d

wavespeed-ai/sam-3d-body

Advanced SAM 3D body generation model for creating detailed 3D human body models from images with optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/sam-3d-objects
image-to-3d

wavespeed-ai/sam-3d-objects

Advanced SAM 3D objects generation model for creating detailed 3D object models from images with text prompts and optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/sam3-video
video-to-video

wavespeed-ai/sam3-video

SAM3 Video is a unified foundation model for prompt-based video segmentation. Provide text, point, box, or mask prompts and the model segments and tracks targets across frames with strong temporal consistency. Supports concept-level (“segment anything with concepts”) and multi-object masks for editing, analytics, and VFX. Ready-to-use REST inference API with fast response, no cold starts, and affordable pricing.

wavespeed-ai/sam3-image
image-to-image

wavespeed-ai/sam3-image

SAM 3 is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/sam3-video-rle
video-to-text

wavespeed-ai/sam3-video-rle

SAM 3 Video RLE is a unified foundation model for prompt-based segmentation in video. Track and segment objects across frames using text, points, or boxes, returning RLE encoded masks for efficient processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/sam3-image-rle
image-to-text

wavespeed-ai/sam3-image-rle

SAM 3 RLE is a unified foundation model for promptable image segmentation using text, points, or boxes to detect and segment objects. Returns RLE (Run-Length Encoding) encoded masks for efficient storage and processing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

bria/embed-product
image-to-image

bria/embed-product

Bria Embed Product seamlessly integrates product images into scene backgrounds with natural lighting and perspective matching. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/void-video-inpainting/mask
video-to-video

wavespeed-ai/void-video-inpainting/mask

VOID Video Inpainting removes objects from videos using mask-guided inpainting. Supports quad-mask or auto-generated SAM-3 masks, optional Pass 2 refinement for temporal consistency, adjustable denoising steps, guidance scale, and temporal window size. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Object Detection and Segmentation API — 價格與效能

透過單一 REST API 執行 Object Detection and Segmentation 系列中的任何模型。按生成計費 — 無訂閱、無最低消費 — 在可用率 99.9% 的基礎架構上提供業界領先的延遲。

為什麼在 WaveSpeedAI 上執行 Object Detection and Segmentation

透明的價格

每個 Object Detection and Segmentation 模型都採按呼叫計費。價格列在每個模型的頁面上 — 不會額外加收平台費。

為低延遲最佳化

大多數 Object Detection and Segmentation 影像模型在 2 秒內完成。影片與 3D 模型比自架方案快數倍。

99.9% 可用率

多區域故障轉移與自動重試可在供應商故障期間 — 仍將您的生產流量保持線上。

常見問題

Object Detection and Segmentation API 多少錢?+

每個模型在其模型頁面上都列有自己的按呼叫價格。我們按每次成功生成計費,沒有訂閱費或最低消費。

Object Detection and Segmentation 模型在 WaveSpeedAI 上有多快?+

本系列中的影像模型通常在 2 秒內完成。影片與 3D 模型取決於長度與解析度,但通常比自架執行快數倍。

不用信用卡可以試用 API 嗎?+

可以 — 每個帳戶註冊時即可獲得 $1 的免費額度,足以在不使用信用卡的情況下試用大多數 Object Detection and Segmentation 模型。

有速率限制嗎?+

標準帳戶具有充足的並行任務限制。Enterprise 方案提供自訂 RPM、更高並行性和專屬容量 — 詳情請聯繫業務。