Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2025 |
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2024 |
Towards Interpreting Visual Information Processing in Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |