video-to-text
Idle
Your request will cost $0.015 per run.
For $1 you can run this model approximately 66 times.
MiniCPM-V is a series of efficient end-side multimodal LLMs (MLLMs), which accept images, videos and text as inputs and deliver high-quality text outputs, including support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.