Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.00142
Cited By
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
28 November 2024
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"
3 / 3 papers shown
Title
On the Suitability of Reinforcement Fine-Tuning to Visual Tasks
X. Chen
Wei Li
Chunxu Liu
Chi Xie
Xiaoyan Hu
Chengqian Ma
Feng Zhu
Rui Zhao
ReLM
LRM
48
0
0
08 Apr 2025
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference
Hao Yin
Guangzong Si
Zilei Wang
40
0
0
17 Mar 2025
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence
Granite Vision Team
Leonid Karlinsky
Assaf Arbelle
Abraham Daniels
A. Nassar
...
Sriram Raghavan
T. Syeda-Mahmood
Peter W. J. Staar
Tal Drory
Rogerio Feris
VLM
AI4TS
102
0
0
14 Feb 2025
1