Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.03230
Cited By
v1
v2
v3
v4
v5 (latest)
Meta Module Network for Compositional Visual Reasoning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
8 October 2019
Wenhu Chen
Zhe Gan
Linjie Li
Yu Cheng
Wenjie Wang
Jingjing Liu
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (43★)
Papers citing
"Meta Module Network for Compositional Visual Reasoning"
35 / 35 papers shown
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Fucai Ke
Joy Hsu
Zhixi Cai
Zixian Ma
Xin Zheng
...
P. D. Haghighi
Gholamreza Haffari
Ranjay Krishna
Jiajun Wu
H. Rezatofighi
ReLM
CoGe
LRM
419
13
0
24 Aug 2025
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
Chen Li
Chinthani Sugandhika
Yeo Keat Ee
Eric Peh
Hao Zhang
Hong Yang
Deepu Rajan
Basura Fernando
LRM
229
3
0
04 Aug 2025
Multi-Sourced Compositional Generalization in Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Chuanhao Li
Wenbo Ye
Zhen Li
Yuwei Wu
Yunde Jia
CoGe
316
0
0
29 May 2025
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
Thanh-Son Nguyen
Hong Yang
Tzeh Yuan Neoh
Hao Zhang
Ee Yeo Keat
Basura Fernando
NAI
534
3
0
19 Mar 2025
On the Role of Visual Grounding in VQA
Daniel Reich
Tanja Schultz
288
3
0
26 Jun 2024
VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images
Anna Penzkofer
Lei Shi
Andreas Bulling
251
1
0
06 May 2024
Detection-based Intermediate Supervision for Visual Question Answering
Yuhang Liu
Daowan Peng
Wei Wei
Yuanyuan Fu
Wenfeng Xie
Dangyang Chen
225
3
0
26 Dec 2023
Modularized Zero-shot VQA with Pre-trained Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Rui Cao
Jing Jiang
LRM
301
3
0
27 May 2023
Curriculum Learning for Compositional Visual Reasoning
VISIGRAPP (VISIGRAPP), 2023
Wafa Aissa
Marin Ferecatu
M. Crucianu
LRM
249
3
0
27 Mar 2023
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations
Computer Vision and Pattern Recognition (CVPR), 2023
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
338
78
0
23 Mar 2023
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning
Zhenfang Chen
Qinhong Zhou
Songlin Yang
Yining Hong
Hao Zhang
Chuang Gan
LRM
VLM
350
54
0
12 Jan 2023
Visually Grounded VQA by Lattice-based Retrieval
Daniel Reich
F. Putze
Tanja Schultz
212
3
0
15 Nov 2022
Declaration-based Prompt Tuning for Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Yuhang Liu
Wei Wei
Daowan Peng
Feida Zhu
MLLM
VLM
196
21
0
05 May 2022
Measuring Compositional Consistency for Video Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Mona Gandhi
Mustafa Omer Gul
Eva Prakash
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
309
19
0
14 Apr 2022
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
358
61
0
15 Dec 2021
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yi-Liang Nie
Linjie Li
Zhe Gan
Shuohang Wang
Chenguang Zhu
Michael Zeng
Zicheng Liu
Joey Tianyi Zhou
Lijuan Wang
189
10
0
08 Dec 2021
Coarse-to-Fine Reasoning for Visual Question Answering
Binh X. Nguyen
Tuong Khanh Long Do
Huy Tran
Erman Tjiputra
Quang-Dieu Tran
A. Nguyen
NAI
393
45
0
06 Oct 2021
ProTo: Program-Guided Transformer for Program-Guided Tasks
Zelin Zhao
Karan Samel
Binghong Chen
Le Song
ViT
LM&Ro
303
32
0
02 Oct 2021
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Zhuowan Li
Elias Stengel-Eskin
Yixiao Zhang
Cihang Xie
Q. Tran
Benjamin Van Durme
Alan Yuille
VLM
217
19
0
01 Oct 2021
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
198
19
0
04 Sep 2021
X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question Answering
ACM Multimedia (ACM MM), 2021
Jingjing Jiang
Zi-yi Liu
Yifan Liu
Jingjing Jiang
N. Zheng
OOD
306
20
0
24 Jul 2021
Supervising the Transfer of Reasoning Patterns in VQA
Neural Information Processing Systems (NeurIPS), 2021
Corentin Kervadec
Christian Wolf
G. Antipov
M. Baccouche
Madiha Nadri Wolf
250
11
0
10 Jun 2021
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
IEEE International Conference on Computer Vision (ICCV), 2021
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjD
VLM
687
1,108
0
26 Apr 2021
How Transferable are Reasoning Patterns in VQA?
Computer Vision and Pattern Recognition (CVPR), 2021
Corentin Kervadec
Theo Jaunet
G. Antipov
M. Baccouche
Romain Vuillemot
Christian Wolf
LRM
198
29
0
08 Apr 2021
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels
Chenliang Li
Ming Yan
Haiyang Xu
Fuli Luo
Wei Wang
Bin Bi
Songfang Huang
VLM
229
41
0
14 Mar 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
597
172
0
02 Jan 2021
Object-Centric Diagnosis of Visual Reasoning
Jianwei Yang
Jiayuan Mao
Jiajun Wu
Devi Parikh
David D. Cox
J. Tenenbaum
Chuang Gan
OCL
236
17
0
21 Dec 2020
A Closer Look at the Robustness of Vision-and-Language Pre-trained Models
Linjie Li
Zhe Gan
Jingjing Liu
VLM
355
50
0
15 Dec 2020
Interpretable Visual Reasoning via Induced Symbolic Space
IEEE International Conference on Computer Vision (ICCV), 2020
Zhonghao Wang
Kai Wang
Mo Yu
Jinjun Xiong
Wen-mei W. Hwu
M. Hasegawa-Johnson
Humphrey Shi
LRM
OCL
267
22
0
23 Nov 2020
LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering
Weixin Liang
Fei Niu
Aishwarya N. Reganti
Govind Thattai
Gokhan Tur
229
19
0
21 Nov 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
364
28
0
24 Oct 2020
Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Neural Information Processing Systems (NeurIPS), 2020
Zhe Gan
Yen-Chun Chen
Linjie Li
Chen Zhu
Yu Cheng
Jingjing Liu
ObjD
VLM
422
545
0
11 Jun 2020
Roses Are Red, Violets Are Blue... but Should Vqa Expect Them To?
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
OOD
340
104
0
09 Jun 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
European Conference on Computer Vision (ECCV), 2020
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
1.0K
2,197
0
13 Apr 2020
VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
Computer Vision and Pattern Recognition (CVPR), 2020
J. Liu
Wenhu Chen
Yu Cheng
Zhe Gan
Licheng Yu
Yiming Yang
Jingjing Liu
MLLM
VGen
350
77
0
25 Mar 2020
1
Page 1 of 1