Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2412.18108
Cited By
v1
v2 (latest)
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
Computer Vision and Pattern Recognition (CVPR), 2024
24 December 2024
Jing Bi
Junjia Guo
Yunlong Tang
Lianggong Wen
Zhang Liu
Chenliang Xu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach"
3 / 3 papers shown
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
Yunlong Tang
Pinxin Liu
Mingqian Feng
Mingqian Feng
Rui Mao
...
Hang Hua
Ali Vosoughi
Luchuan Song
Zeliang Zhang
Chenliang Xu
LRM
468
4
0
26 May 2025
Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Rui Hu
Delai Qiu
Shuyu Wei
J.N. Zhang
Yining Wang
Shengping Liu
Jitao Sang
AuLLM
VLM
387
1
0
27 Feb 2025
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
720
170
0
29 Dec 2023
1
Page 1 of 1