Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation
Incorporating Gloss InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
Correlation Information Bottleneck: Towards Adapting Pretrained
Multimodal Models for Robust Visual Question AnsweringInternational Journal of Computer Vision (IJCV), 2022 |
SimVQA: Exploring Simulated Environments for Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2022 |