Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1511.02799
Cited By
v1
v2
v3
v4 (latest)
Neural Module Networks
9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Module Networks"
50 / 653 papers shown
Title
A Complexity-Based Theory of Compositionality
Eric Elmoznino
Thomas Jiralerspong
Yoshua Bengio
Guillaume Lajoie
CoGe
718
17
0
18 Oct 2024
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
International Conference on Learning Representations (ICLR), 2024
Wenhao Chai
Enxin Song
Y. Du
Chenlin Meng
Vashisht Madhavan
Omer Bar-Tal
Jeng-Neng Hwang
Saining Xie
Christopher D. Manning
3DV
605
89
0
04 Oct 2024
On The Specialization of Neural Modules
International Conference on Learning Representations (ICLR), 2024
Devon Jarvis
Richard Klein
Benjamin Rosman
Andrew M. Saxe
304
18
0
23 Sep 2024
Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs
IEEE International Conference on Robotics and Automation (ICRA), 2024
A. Mavrogiannis
Dehao Yuan
Yiannis Aloimonos
LM&Ro
283
2
0
23 Sep 2024
Breaking Neural Network Scaling Laws with Modularity
International Conference on Learning Representations (ICLR), 2024
Akhilan Boopathy
Sunshine Jiang
William Yue
Jaedong Hwang
Abhiram Iyer
Ila Fiete
OOD
388
6
0
09 Sep 2024
One-shot Video Imitation via Parameterized Symbolic Abstraction Graphs
IEEE International Conference on Robotics and Automation (ICRA), 2024
Jianren Wang
Kangni Liu
Dingkun Guo
Xian Zhou
Christopher G Atkeson
307
2
0
22 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yanjie Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
292
6
0
05 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
342
23
0
27 Jul 2024
Gradient-based inference of abstract task representations for generalization in neural networks
Ali Hummos
Felipe del-Rio
Brabeeba Mien Wang
Julio Hurtado
Cristian B. Calderon
G. Yang
182
4
0
24 Jul 2024
Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought
Jue Chen
Yongxin Deng
Xihe Qiu
Weidi Xu
Chao Qu
Wei Chu
Yinghui Xu
Yuan Qi
LRM
AI4CE
LM&Ro
188
14
0
18 Jul 2024
Compositional Models for Estimating Causal Effects
Purva Pruthi
David D. Jensen
CML
391
0
0
25 Jun 2024
UQE: A Query Engine for Unstructured Databases
Hanjun Dai
B. Wang
Xingchen Wan
Bo Dai
Sherry Yang
Azade Nova
Pengcheng Yin
P. Phothilimthana
Charles Sutton
Dale Schuurmans
159
17
0
23 Jun 2024
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Xiaocheng Yang
Bingsen Chen
Yik-Cheung Tam
LRM
201
21
0
28 May 2024
THREAD: Thinking Deeper with Recursive Spawning
Philip Schroeder
Nathaniel Morgan
Hongyin Luo
James R. Glass
LRM
LLMAG
ReLM
284
8
0
27 May 2024
A Survey of Multimodal Large Language Model from A Data-centric Perspective
Tianyi Bai
Hao Liang
Binwang Wan
Yanran Xu
Xi Li
...
Ping Huang
Jiulong Shan
Conghui He
Binhang Yuan
Wentao Zhang
335
64
0
26 May 2024
When does compositional structure yield compositional generalization? A kernel theory
Samuel Lippl
Kim Stachenfeld
NAI
CoGe
559
14
0
26 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
AI4CE
481
6
0
24 May 2024
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
International Conference on Language Resources and Evaluation (LREC), 2024
Guangmin Zheng
Jin Wang
Xiaobing Zhou
Xuejie Zhang
LRM
122
6
0
16 May 2024
Interpretability Needs a New Paradigm
Andreas Madsen
Himabindu Lakkaraju
Siva Reddy
Sarath Chandar
181
6
0
08 May 2024
Feature Map Convergence Evaluation for Functional Module
Ludan Zhang
Chaoyi Chen
Lei He
Keqiang Li
166
6
0
07 May 2024
VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images
Anna Penzkofer
Lei Shi
Andreas Bulling
198
1
0
06 May 2024
Large Language Models Synergize with Automated Machine Learning
Jinglue Xu
Jialong Li
Zhen Liu
Nagar Anthel Venkatesh Suryanarayanan
Guoyuan Zhou
Jia Guo
Hitoshi Iba
Kenji Tei
182
7
0
06 May 2024
AgentKit: Flow Engineering with Graphs, not Coding
Yue Wu
Yewen Fan
So Yeon Min
Shrimai Prabhumoye
Alexander Shmakov
Yonatan Bisk
Ruslan Salakhutdinov
Yuanzhi Li
Tom Michael Mitchell
AI4CE
308
0
0
17 Apr 2024
ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Quan Van Nguyen
Dan Quang Tran
Huy Quang Pham
Thang Kien-Bao Nguyen
Nghia Hieu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
CoGe
554
7
0
16 Apr 2024
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
Juhong Min
Shyamal Buch
Arsha Nagrani
Minsu Cho
Cordelia Schmid
LRM
356
61
0
09 Apr 2024
Visual Knowledge in the Big Model Era: Retrospect and Prospect
Wenguan Wang
Yi Yang
Yunhe Pan
VLM
264
29
0
05 Apr 2024
Self-Expansion of Pre-trained Models with Mixture of Adapters for Continual Learning
Huiyi Wang
Haodong Lu
Lina Yao
Dong Gong
KELM
CLL
320
24
0
27 Mar 2024
SYNAPSE: SYmbolic Neural-Aided Preference Synthesis Engine
Sadanand Modak
Noah T Patton
Işıl Dillig
Joydeep Biswas
263
2
0
25 Mar 2024
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Zhuowan Li
Bhavan A. Jasani
Peng Tang
Shabnam Ghadar
LRM
282
24
0
25 Mar 2024
Conditional computation in neural networks: principles and research trends
Intelligenza Artificiale (IA), 2024
Simone Scardapane
Alessandro Baiocchi
Alessio Devoto
V. Marsocci
Pasquale Minervini
Jary Pomponi
343
6
0
12 Mar 2024
How Far Are We from Intelligent Visual Deductive Reasoning?
Yizhe Zhang
Richard He Bai
Ruixiang Zhang
Jiatao Gu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
ReLM
LRM
355
26
0
07 Mar 2024
CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Yanqi Dai
Dong Jing
Nanyi Fei
Zhiwu Lu
Nanyi Fei
Guoxing Yang
Zhiwu Lu
286
4
0
07 Mar 2024
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World
Weiyun Wang
Yiming Ren
Hao Luo
Tiantong Li
Chenxiang Yan
...
Qingyun Li
Lewei Lu
Xizhou Zhu
Yu Qiao
Jifeng Dai
MLLM
274
85
0
29 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Kushagra Pandey
Robert Bamler
Sina Daubener
...
Yixin Wang
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
720
40
0
28 Feb 2024
REFACTOR: Learning to Extract Theorems from Proofs
Jin Peng Zhou
Yuhuai Wu
Qiyang Li
Roger C. Grosse
AIMat
177
9
0
26 Feb 2024
Inference of Abstraction for a Unified Account of Reasoning and Learning
Hiroyuki Kido
48
0
0
14 Feb 2024
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Zhicheng Zheng
Xin Yan
Zhenfang Chen
Jingzhou Wang
Qin Zhi Eddie Lim
Joshua B. Tenenbaum
Chuang Gan
LRM
190
13
0
09 Feb 2024
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning
Yanbin Wei
Shuai Fu
Weisen Jiang
Zejian Zhang
Zhixiong Zeng
Qi Wu
James T. Kwok
Yu Zhang
175
27
0
03 Feb 2024
Neural Language of Thought Models
Yi-Fu Wu
Minseung Lee
Sungjin Ahn
MLLM
VLM
312
9
0
02 Feb 2024
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
Elias Stengel-Eskin
Archiki Prasad
Mohit Bansal
221
19
0
29 Jan 2024
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yueqian Wang
Yuxuan Wang
Kai Chen
Dongyan Zhao
192
2
0
08 Jan 2024
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Aleksandar Stanić
Sergi Caelles
Michael Tschannen
LRM
VLM
270
12
0
03 Jan 2024
Detection-based Intermediate Supervision for Visual Question Answering
Yuhang Liu
Daowan Peng
Wei Wei
Yuanyuan Fu
Wenfeng Xie
Dangyang Chen
159
3
0
26 Dec 2023
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
213
0
0
20 Dec 2023
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows
Madeleine Grunde-McLaughlin
Michelle S. Lam
Ranjay Krishna
Daniel S. Weld
Jeffrey Heer
AI4CE
276
25
0
18 Dec 2023
InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs
International Conference on Human Factors in Computing Systems (CHI), 2023
Zhongyi Zhou
Jing Jin
Vrushank Phadnis
Xiuxiu Yuan
Jun Jiang
...
A. Olwal
David Kim
Ram Iyengar
Na Li
Andrea Colaço
213
6
0
15 Dec 2023
Recursive Visual Programming
European Conference on Computer Vision (ECCV), 2023
Jiaxin Ge
Sanjay Subramanian
Baifeng Shi
Roei Herzig
Trevor Darrell
178
9
0
04 Dec 2023
Zero-Shot Video Question Answering with Procedural Programs
Rohan Choudhury
Koichiro Niinuma
Kishore Venkateshan
László A. Jeni
159
37
0
01 Dec 2023
Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Xiaoqian Wu
Yong-Lu Li
Jianhua Sun
Cewu Lu
158
29
0
29 Nov 2023
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation
Christel Chappuis
Eliot Walt
Vincent Mendez
Sylvain Lobry
B. L. Saux
D. Tuia
226
7
0
28 Nov 2023
Previous
1
2
3
4
5
...
12
13
14
Next