
Title |
|---|
![]() Why Relational Graphs Will Save the Next Generation of Vision Foundation Models?Social Science Research Network (SSRN), 2025 |
![]() Predicting Implicit Arguments in Procedural Video InstructionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal InconsistencyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |