
Molmo2-4B Image Captioner: Generate detailed, accurate captions for images with customizable detail levels (low, medium, high). Open-source vision-language model with object grounding capabilities. Ready-to-use REST API, no cold starts, affordable pricing.

molmo2/image-captioner

molmo2/image-content-moderator

molmo2/image-qa

molmo2/text-content-moderator

molmo2/video-captioner

molmo2/video-content-moderator

molmo2/video-qa

molmo2/video-understanding

content-moderator/text

moondream3-preview/point

content-moderator/image