Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1511.02799
Cited By
v1
v2
v3
v4 (latest)
Neural Module Networks
9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Module Networks"
50 / 653 papers shown
Compositional Chain-of-Thought Prompting for Large Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2023
Chancharik Mitra
Brandon Huang
Trevor Darrell
Roei Herzig
MLLM
LRM
332
166
0
27 Nov 2023
VALUED -- Vision and Logical Understanding Evaluation Dataset
Soumadeep Saha
Saptarshi Saha
Utpal Garain
200
0
0
21 Nov 2023
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback
ACM Multimedia (ACM MM), 2023
Minghe Gao
Juncheng Li
Hao Fei
Liang Pang
Wei Ji
Guoming Wang
Wenqiao Zhang
Siliang Tang
Yueting Zhuang
170
12
0
21 Nov 2023
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
Sen Yang
Xin Li
Leyang Cui
Li Bing
Wai Lam
LRM
NAI
302
16
0
16 Nov 2023
Attribute Diversity Determines the Systematicity Gap in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ian Berlot-Attwell
Kumar Krishna Agrawal
A. M. Carrell
Yash Sharma
Naomi Saphra
254
2
0
15 Nov 2023
Analyzing Modular Approaches for Visual Question Decomposition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Apoorv Khandelwal
Ellie Pavlick
Chen Sun
258
5
0
10 Nov 2023
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Zhenfang Chen
Rui Sun
Wenjun Liu
Yining Hong
Chuang Gan
LRM
309
22
0
08 Nov 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
259
140
0
08 Nov 2023
Exploitation-Guided Exploration for Semantic Embodied Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Justin Wasserman
Girish Chowdhary
Abhinav Gupta
Unnat Jain
261
11
0
06 Nov 2023
RigLSTM: Recurrent Independent Grid LSTM for Generalizable Sequence Learning
Ziyu Wang
Wenhao Jiang
Zixuan Zhang
Wei Tang
Junchi Yan
213
0
0
03 Nov 2023
MetaReVision: Meta-Learning with Retrieval for Visually Grounded Compositional Concept Acquisition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Guangyue Xu
Parisa Kordjamshidi
Joyce Chai
162
2
0
02 Nov 2023
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Siming Lan
Rui Zhang
Qi Yi
Jiaming Guo
Shaohui Peng
...
Zidong Du
Xingui Hu
Xishan Zhang
Ling Li
Yunji Chen
288
15
0
02 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion (Inf. Fusion), 2023
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
398
71
0
01 Nov 2023
Meta-Learning Strategies through Value Maximization in Neural Networks
Rodrigo Carrasco-Davis
Javier Masís
Andrew M. Saxe
218
3
0
30 Oct 2023
LILO: Learning Interpretable Libraries by Compressing and Documenting Code
International Conference on Learning Representations (ICLR), 2023
Gabriel Grand
L. Wong
Matthew Bowers
Theo X. Olausson
Muxin Liu
Joshua B. Tenenbaum
Jacob Andreas
315
31
0
30 Oct 2023
OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Rim Assouel
Pau Rodríguez
Perouz Taslakian
David Vazquez
Yoshua Bengio
LRM
OCL
196
0
0
28 Oct 2023
ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model for Visual Question Answering in Vietnamese
Khiem Vinh Tran
Hao Phu Phan
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
154
15
0
27 Oct 2023
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Neural Information Processing Systems (NeurIPS), 2023
Xingrui Wang
Wufei Ma
Zhuowan Li
Adam Kortylewski
Yaoyao Liu
CoGe
318
21
0
27 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
International Conference on Machine Learning (ICML), 2023
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
207
40
0
26 Oct 2023
Visually Grounded Continual Language Learning with Selective Specialization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kyra Ahrens
Lennart Bengtson
Jae Hee Lee
Stefan Wermter
306
0
0
24 Oct 2023
Cross-Modal Conceptualization in Bottleneck Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Danis Alukaev
S. Kiselev
Ilya Pershin
Bulat Ibragimov
Vladimir Ivanov
Alexey Kornaev
Ivan Titov
255
9
0
23 Oct 2023
API-Assisted Code Generation for Question Answering on Varied Table Structures
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yihan Cao
Shuyi Chen
Ryan Liu
Zhiruo Wang
Daniel Fried
LMTD
248
26
0
23 Oct 2023
MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model
Le Zhang
Yihong Wu
Fengran Mo
Jian-Yun Nie
Aishwarya Agrawal
MLLM
RALM
239
7
0
20 Oct 2023
Neurosymbolic Grounding for Compositional World Models
Atharva Sehgal
Arya Grayeli
Jennifer J. Sun
Swarat Chaudhuri
350
11
0
19 Oct 2023
Instilling Inductive Biases with Subnetworks
Enyan Zhang
Michael A. Lepori
Ellie Pavlick
AI4CE
262
5
0
17 Oct 2023
Neural Relational Inference with Fast Modular Meta-learning
Neural Information Processing Systems (NeurIPS), 2023
Ferran Alet
Erica Weng
Tomás Lozano Pérez
L. Kaelbling
303
58
0
10 Oct 2023
NEUCORE: Neural Concept Reasoning for Composed Image Retrieval
Shu Zhao
Huijuan Xu
147
9
0
02 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
312
5
0
02 Oct 2023
Compositional Program Generation for Few-Shot Systematic Generalization
Tim Klinger
Luke Liu
Soham Dan
A. Rezaee
Parikshit Ram
Ali Movaghar
NAI
215
4
0
28 Sep 2023
D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Amir Rahimi
Vanessa D’Amario
Moyuru Yamada
Kentaro Takemoto
Tomotake Sasaki
Xavier Boix
166
2
0
15 Sep 2023
Dynamic MOdularized Reasoning for Compositional Structured Explanation Generation
Xiyan Fu
Anette Frank
LRM
211
2
0
14 Sep 2023
Neurons in Large Language Models: Dead, N-gram, Positional
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Elena Voita
Javier Ferrando
Christoforos Nalmpantis
MILM
394
73
0
09 Sep 2023
A Survey on Interpretable Cross-modal Reasoning
Dizhan Xue
Shengsheng Qian
Zuyi Zhou
Changsheng Xu
LRM
400
5
0
05 Sep 2023
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Shouqing Yang
Hao Wen
Yunxin Liu
243
2
0
29 Aug 2023
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Moon Ye-Bin
Jisoo Kim
Hong-Kyu Kim
Kilho Son
Tae-Hyun Oh
290
11
0
27 Jul 2023
Efficient Learning of Discrete-Continuous Computation Graphs
Neural Information Processing Systems (NeurIPS), 2023
David Friede
Mathias Niepert
153
3
0
26 Jul 2023
Free-Form Composition Networks for Egocentric Action Recognition
Haoran Wang
Qinghua Cheng
Baosheng Yu
Yibing Zhan
Dapeng Tao
Liang Ding
Haibin Ling
EgoV
316
2
0
13 Jul 2023
AVSegFormer: Audio-Visual Segmentation with Transformer
AAAI Conference on Artificial Intelligence (AAAI), 2023
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
VOS
370
80
0
03 Jul 2023
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2023
A. S. Penamakuri
Manish Gupta
Mithun Das Gupta
Anand Mishra
166
7
0
29 Jun 2023
Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation
Conference on Robot Learning (CoRL), 2023
Harel Biggie
Ajay Narasimha Mopidevi
Dusty Woods
Christoffer Heckman
LM&Ro
254
13
0
15 Jun 2023
Modularity Trumps Invariance for Compositional Robustness
I. Mason
Anirban Sarkar
Tomotake Sasaki
Xavier Boix
OOD
216
1
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
254
9
0
14 Jun 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn
Difei Gao
Lei Ji
Luowei Zhou
Kevin Lin
Joya Chen
Zihan Fan
Mike Zheng Shou
MLLM
419
108
0
14 Jun 2023
Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions
IEEE Access (IEEE Access), 2023
N. Rodis
Christos Sardianos
Panagiotis I. Radoglou-Grammatikis
Panagiotis G. Sarigiannidis
Iraklis Varlamis
Georgios Th. Papadopoulos
333
38
0
09 Jun 2023
ModuleFormer: Modularity Emerges from Mixture-of-Experts
Songlin Yang
Zheyu Zhang
Tianyou Cao
Shawn Tan
Zhenfang Chen
Chuang Gan
KELM
MoE
197
13
0
07 Jun 2023
M
3
^3
3
IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
Lei Li
Yuwei Yin
Shicheng Li
Liang Chen
Peiyi Wang
...
Yazheng Yang
Jingjing Xu
Xu Sun
Lingpeng Kong
Qi Liu
MLLM
VLM
376
136
0
07 Jun 2023
Learning Transformer Programs
Neural Information Processing Systems (NeurIPS), 2023
Dan Friedman
Alexander Wettig
Danqi Chen
291
47
0
01 Jun 2023
Differentiable Tree Operations Promote Compositional Generalization
International Conference on Machine Learning (ICML), 2023
Paul Soulos
J. E. Hu
Kate McCurdy
Yunmo Chen
Roland Fernandez
P. Smolensky
Jianfeng Gao
AI4CE
137
7
0
01 Jun 2023
Emergent Modularity in Pre-trained Transformers
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Chaojun Xiao
Xiaozhi Wang
Xu Han
Zhiyuan Liu
Ruobing Xie
Maosong Sun
Jie Zhou
MoE
229
35
0
28 May 2023
Modularized Zero-shot VQA with Pre-trained Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Rui Cao
Jing Jiang
LRM
254
3
0
27 May 2023
Previous
1
2
3
4
5
6
...
12
13
14
Next