ComingUp
Marlin-2B: a tiny VLM to extract structured information from videos

Marlin-2B: a tiny VLM to extract structured information from videos

May 18, 2026 AI & Machine Learning
computer vision video analysis visual-language-model

Gallery

Marlin-2B: a tiny VLM to extract structured information from videos

About

The Marlin-2B is a tiny visual language model (VLM) that extracts structured information from videos. It is designed to process visual and textual data to generate relevant outputs. This model is available on Hugging Face and can be integrated into various applications for video analysis tasks.

Comments (0)

No comments yet. Be the first to comment!