Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.03067
Cited By
v1
v2 (latest)
Compositional Attention Networks for Machine Reasoning
8 March 2018
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Compositional Attention Networks for Machine Reasoning"
50 / 330 papers shown
Title
SRNN: Spatiotemporal Relational Neural Network for Intuitive Physics Understanding
Fei Yang
105
0
0
10 Nov 2025
Memorizing Long-tail Data Can Help Generalization Through Composition
Mo Zhou
Haoyang Ma
Rong Ge
TDI
345
0
0
18 Oct 2025
Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs
SuYang Xi
Chenxi Yang
Hong Ding
Yiqing Ni
Catherine C. Liu
Yunhao Liu
Chengqi Zhang
LRM
98
0
0
12 Oct 2025
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara
Varindra V. Persad Maharaj
ELM
149
1
0
05 Oct 2025
Can Constructions "SCAN" Compositionality ?
Ganesh Katrapati
Manish Shrivastava
88
0
0
24 Sep 2025
Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies
Ayushman Sarkar
Mohd Yamani Idna Idris
Zhenyu Yu
LRM
144
8
0
14 Aug 2025
Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance
Yuchu Jiang
Jian Zhao
Yuchen Yuan
Tianle Zhang
Yao Huang
...
Ya Zhang
Shuicheng Yan
Chi Zhang
Z. He
Xuelong Li
SILM
442
2
0
12 Aug 2025
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
Chen Li
Chinthani Sugandhika
Yeo Keat Ee
Eric Peh
Hao Zhang
Hong Yang
Deepu Rajan
Basura Fernando
LRM
136
1
0
04 Aug 2025
Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Adam Ishay
Zhun Yang
Joohyung Lee
Ilgu Kang
Dongjae Lim
NAI
254
1
0
12 Jun 2025
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding
Bin Guo
John H.L. Hansen
214
0
0
11 Jun 2025
Inherently Faithful Attention Maps for Vision Transformers
Ananthu Aniraj
C. Dantas
Dino Ienco
Diego Marcos
OOD
OCL
490
1
0
10 Jun 2025
A Good CREPE needs more than just Sugar: Investigating Biases in Compositional Vision-Language Benchmarks
Vishaal Udandarao
Mehdi Cherti
Shyamgopal Karthik
J. Jitsev
Samuel Albanie
Matthias Bethge
CoGe
184
1
0
09 Jun 2025
Multi-Sourced Compositional Generalization in Visual Question Answering
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Chuanhao Li
Wenbo Ye
Zhen Li
Yuwei Wu
Yunde Jia
CoGe
193
0
0
29 May 2025
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning
Tien Pham
Angelo Cangelosi
202
1
0
14 Apr 2025
Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Davide Berasi
Matteo Farina
Goran Frehse
Elisa Ricci
Nicola Strisciuglio
CoGe
273
3
0
21 Mar 2025
Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering
Thanh-Son Nguyen
Hong Yang
Tzeh Yuan Neoh
Hao Zhang
Ee Yeo Keat
Basura Fernando
NAI
355
3
0
19 Mar 2025
Visual Graph Question Answering with ASP and LLMs for Language Parsing
International Conference on Logic Programming (ICLP), 2025
Jakob Johannes Bauer
Thomas Eiter
Nelson Higuera Ruiz
J. Oetsch
GNN
336
0
0
13 Feb 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
407
2
0
13 Jan 2025
Consistency of Compositional Generalization across Multiple Levels
AAAI Conference on Artificial Intelligence (AAAI), 2024
Chuanhao Li
Zhen Li
Chenchen Jing
Xiaomeng Fan
Wenbo Ye
Yuwei Wu
Yunde Jia
CoGe
233
0
0
18 Dec 2024
Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing
Bin Chen
Wenbo Yu
Qinshan Zhang
Tianqu Zhuang
Yong Jiang
Shu-Tao Xia
686
1
0
24 Nov 2024
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
Neural Information Processing Systems (NeurIPS), 2024
Shantanu Jaiswal
Debaditya Roy
Basura Fernando
Cheston Tan
ReLM
LRM
327
5
0
20 Nov 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
261
8
0
17 Nov 2024
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem
Neural Information Processing Systems (NeurIPS), 2024
Declan Campbell
Sunayana Rane
Tyler Giallanza
Nicolò De Sabbata
Kia Ghods
...
Alexander Ku
Steven M. Frankland
Thomas Griffiths
Jonathan D. Cohen
Taylor W. Webb
385
50
0
31 Oct 2024
Compositional Physical Reasoning of Objects and Events from Videos
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zhenfang Chen
Shilong Dong
Kexin Yi
Yunzhu Li
Mingyu Ding
Antonio Torralba
Joshua B. Tenenbaum
Chuang Gan
OCL
325
9
0
02 Aug 2024
SADL: An Effective In-Context Learning Method for Compositional Visual QA
Long Hoang Dang
T. Le
Vuong Le
Tu Minh Phuong
Truyen Tran
ReLM
CoGe
262
4
0
02 Jul 2024
On the Role of Visual Grounding in VQA
Daniel Reich
Tanja Schultz
201
2
0
26 Jun 2024
How Could AI Support Design Education? A Study Across Fields Fuels Situating Analytics
Ajit Jain
Andruid Kerne
Hannah Fowler
Jinsil Seo
Galen Newman
Nic Lupfer
Aaron Perrine
95
3
0
26 Apr 2024
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
288
18
0
12 Apr 2024
REFACTOR: Learning to Extract Theorems from Proofs
Jin Peng Zhou
Yuhuai Wu
Qiyang Li
Roger C. Grosse
AIMat
177
9
0
26 Feb 2024
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Zhicheng Zheng
Xin Yan
Zhenfang Chen
Jingzhou Wang
Qin Zhi Eddie Lim
Joshua B. Tenenbaum
Chuang Gan
LRM
194
14
0
09 Feb 2024
Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model
European Conference on Artificial Intelligence (ECAI), 2024
Taehee Kim
Yeongjae Cho
Heejun Shin
Yohan Jo
Dongmyung Shin
319
6
0
12 Jan 2024
Towards Goal-Oriented Agents for Evolving Problems Observed via Conversation
SGAI Conferences (SGAI), 2024
Michael Free
Andrew Langworthy
Mary Dimitropoulaki
Simon Thompson
LLMAG
149
1
0
11 Jan 2024
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yueqian Wang
Yuxuan Wang
Kai Chen
Dongyan Zhao
196
2
0
08 Jan 2024
Detection-based Intermediate Supervision for Visual Question Answering
Yuhang Liu
Daowan Peng
Wei Wei
Yuanyuan Fu
Wenfeng Xie
Dangyang Chen
163
3
0
26 Dec 2023
Interactive Visual Task Learning for Robots
Weiwei Gu
Anant Sah
N. Gopalan
219
7
0
20 Dec 2023
EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering
Junjue Wang
Zhuo Zheng
Zihang Chen
A. Ma
Yanfei Zhong
165
43
0
19 Dec 2023
Benchmarks for Physical Reasoning AI
Andrew Melnik
Robin Schiewer
Moritz Lange
Andrei Muresanu
Mozhgan Saeidi
Animesh Garg
Helge J. Ritter
334
9
0
17 Dec 2023
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models
Haicheng Liao
Huanming Shen
Zhenning Li
Chengyue Wang
Guofa Li
Yiming Bie
Chengzhong Xu
214
79
0
06 Dec 2023
Attribute Diversity Determines the Systematicity Gap in VQA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ian Berlot-Attwell
Kumar Krishna Agrawal
A. M. Carrell
Yash Sharma
Naomi Saphra
226
2
0
15 Nov 2023
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Zhenfang Chen
Rui Sun
Wenjun Liu
Yining Hong
Chuang Gan
LRM
294
22
0
08 Nov 2023
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Te-Lin Wu
Zi-Yi Dou
Qingyuan Hu
Yu Hou
Nischal Reddy Chandra
Marjorie Freedman
R. Weischedel
Nanyun Peng
274
9
0
02 Nov 2023
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Neural Information Processing Systems (NeurIPS), 2023
Xingrui Wang
Wufei Ma
Zhuowan Li
Adam Kortylewski
Yaoyao Liu
CoGe
270
20
0
27 Oct 2023
What's Left? Concept Grounding with Logic-Enhanced Foundation Models
Neural Information Processing Systems (NeurIPS), 2023
Joy Hsu
Jiayuan Mao
Joshua B. Tenenbaum
Jiajun Wu
VLM
ReLM
LRM
380
38
0
24 Oct 2023
Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Osman Batur .Ince
Tanin Zeraati
Semih Yagcioglu
Yadollah Yaghoobzadeh
Erkut Erdem
Aykut Erdem
146
3
0
18 Oct 2023
LLark: A Multimodal Instruction-Following Language Model for Music
International Conference on Machine Learning (ICML), 2023
Josh Gardner
Simon Durand
Daniel Stoller
Rachel M. Bittner
AuLLM
289
28
0
11 Oct 2023
InstructDET: Diversifying Referring Object Detection with Generalized Instructions
International Conference on Learning Representations (ICLR), 2023
Ronghao Dang
Jiangyan Feng
Haodong Zhang
Chongjian Ge
Lin Song
...
Chengju Liu
Qi Chen
Feng Zhu
Rui Zhao
Yibing Song
ObjD
399
16
0
08 Oct 2023
Sentence Attention Blocks for Answer Grounding
IEEE International Conference on Computer Vision (ICCV), 2023
Seyedalireza Khoshsirat
Chandra Kambhamettu
168
8
0
20 Sep 2023
D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Amir Rahimi
Vanessa D’Amario
Moyuru Yamada
Kentaro Takemoto
Tomotake Sasaki
Xavier Boix
153
2
0
15 Sep 2023
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
European Conference on Computer Vision (ECCV), 2023
Cheng Shi
Sibei Yang
LRM
162
12
0
03 Sep 2023
Learning the meanings of function words from grounded language using a visual question answering model
Cognitive Sciences (CogSci), 2023
Eva Portelance
Michael C. Frank
Dan Jurafsky
NAI
263
7
0
16 Aug 2023
1
2
3
4
5
6
7
Next