
Molmo2-4B Image Captioner: Generate detailed, accurate captions for images with customizable detail levels (low, medium, high). Open-source vision-language model with object grounding capabilities. Ready-to-use REST API, no cold starts, affordable pricing.
molmo2/image-captioner
molmo2/video-captioner
molmo2/video-qa
molmo2/video-understanding
molmo2/image-qa
molmo2/text-content-moderator
molmo2/image-content-moderator
molmo2/video-content-moderator
content-moderator/text
moondream3-preview/point
content-moderator/image