Take A Step Back: Rethinking the Two Stages in Visual ReasoningEuropean Conference on Computer Vision (ECCV), 2024 |
Multimodal Representations for Teacher-Guided Compositional Visual
ReasoningAdvanced Concepts for Intelligent Vision Systems Conference (ACIVS), 2023 |
Curriculum Learning for Compositional Visual ReasoningVISIGRAPP (VISIGRAPP), 2023 |
Multimodal Learning with Transformers: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 |
An experimental study of the vision-bottleneck in VQASocial Science Research Network (SSRN), 2022 |
VisQA: X-raying Vision and Language Reasoning in TransformersIEEE Transactions on Visualization and Computer Graphics (TVCG), 2021 |