Giảm 50% mô hình Vidu Q3 & Q3 Pro · Chỉ trên WaveSpeedAI | 20/5 – 2/6
Content Detection Models

Content Detection Models

Detect objects, faces, poses, text, depth, and more with powerful AI detection and analysis models on WaveSpeed

Our selection

wavespeed-ai/content-moderator/text
content-moderation

wavespeed-ai/content-moderator/text

Scalable Text Content Moderator for filtering and classifying user-generated text, ideal for safety and compliance workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

All models

11 models
wavespeed-ai/content-moderator/text
content-moderation

wavespeed-ai/content-moderator/text

Scalable Text Content Moderator for filtering and classifying user-generated text, ideal for safety and compliance workflows. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/content-moderator/image
content-moderation

wavespeed-ai/content-moderator/image

Image Content Moderator provides automated image moderation to detect and flag policy-violating or inappropriate images for automation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/moondream3-preview/point
image-to-text

wavespeed-ai/moondream3-preview/point

Moondream3 Point finds objects in images and returns precise coordinate points for computer vision tasks, enabling accurate point localization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/molmo2/image-captioner
image-to-text

wavespeed-ai/molmo2/image-captioner

Molmo2-4B Image Captioner: Generate detailed, accurate captions for images with customizable detail levels (low, medium, high). Open-source vision-language model with object grounding capabilities. Ready-to-use REST API, no cold starts, affordable pricing.

wavespeed-ai/molmo2/video-captioner
video-to-text

wavespeed-ai/molmo2/video-captioner

Molmo2-4B Video Captioner: Generate detailed, accurate captions for videos with customizable detail levels (low, medium, high). Open-source vision-language model with temporal understanding capabilities. Ready-to-use REST API, no cold starts, duration-based pricing.

wavespeed-ai/molmo2/video-qa
video-to-text

wavespeed-ai/molmo2/video-qa

Molmo2-4B Video QA: Answer questions about video content with temporal understanding. Open-source vision-language model. Ready-to-use REST API, no cold starts, duration-based pricing.

wavespeed-ai/molmo2/video-understanding
video-to-text

wavespeed-ai/molmo2/video-understanding

Molmo2-4B Video Understanding: Analyze videos with specialized tasks (general, summary, analysis, counting, scene description). Open-source vision-language model with temporal understanding. Ready-to-use REST API, no cold starts, duration-based pricing.

wavespeed-ai/molmo2/image-qa
image-to-text

wavespeed-ai/molmo2/image-qa

Molmo2-4B Image QA: Answer questions about images with support for multi-image comparison (1-2 images). Open-source vision-language model. Ready-to-use REST API, no cold starts, affordable pricing.

wavespeed-ai/molmo2/text-content-moderator
content-moderation

wavespeed-ai/molmo2/text-content-moderator

Molmo2-4B Text Content Moderator: Analyze text content for safety, appropriateness, and policy compliance. Detects hate speech, violence, sexual content, and other harmful categories. Open-source vision-language model. Ready-to-use REST API, no cold starts, affordable pricing.

wavespeed-ai/molmo2/image-content-moderator
content-moderation

wavespeed-ai/molmo2/image-content-moderator

Molmo2-4B Image Content Moderator: Analyze image content for safety, appropriateness, and policy compliance. Detects violence, nudity, gore, and other harmful visual content. Open-source vision-language model. Ready-to-use REST API, no cold starts, affordable pricing.

wavespeed-ai/molmo2/video-content-moderator
content-moderation

wavespeed-ai/molmo2/video-content-moderator

Molmo2-4B Video Content Moderator analyzes video content for safety, appropriateness, and policy compliance. Detects violence, nudity, gore, and other harmful visual content in videos using an open-source vision-language model. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Content Detection Models API — pricing & performance

Run any model in the Content Detection Models collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Content Detection Models on WaveSpeedAI

Transparent pricing

Per-call pricing for every Content Detection Models model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Content Detection Models image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Content Detection Models API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Content Detection Models models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Có — mỗi tài khoản nhận $1 tín dụng miễn phí khi đăng ký, đủ để thử hầu hết các mô hình Content Detection Models mà không cần thẻ tín dụng.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.