May 18, 2026
AI & Machine Learning
computer vision
video analysis
visual-language-model
Gallery
About
The Marlin-2B is a tiny visual language model (VLM) that extracts structured information from videos. It is designed to process visual and textual data to generate relevant outputs. This model is available on Hugging Face and can be integrated into various applications for video analysis tasks.
Comments (0)
No comments yet. Be the first to comment!
Related Products
OpenBrief – Local-first video downloader/summarizer
Nerve – self hosted runtime for AI agents
skills-for-humanity – 171 structured reasoning skills for Claude Code
skills-for-humanity – 171 structured reasoning skills for Claude Code
OpenBrief – Local-first video downloader/summarizer
Bae – AI companion built around persistent memory architecture