ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.03067
  4. Cited By
Compositional Attention Networks for Machine Reasoning
v1v2 (latest)

Compositional Attention Networks for Machine Reasoning

8 March 2018
Drew A. Hudson
Christopher D. Manning
    BDLOODLRM
ArXiv (abs)PDFHTML

Papers citing "Compositional Attention Networks for Machine Reasoning"

50 / 330 papers shown
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingIEEE International Conference on Computer Vision (ICCV), 2021
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjDVLM
612
1,051
0
26 Apr 2021
Jointly Learning Truth-Conditional Denotations and Groundings using
  Parallel Attention
Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention
Leon Bergen
Dzmitry Bahdanau
Timothy J. O'Donnell
FedML
168
1
0
14 Apr 2021
Object-Centric Representation Learning for Video Question Answering
Object-Centric Representation Learning for Video Question AnsweringIEEE International Joint Conference on Neural Network (IJCNN), 2021
Long Hoang Dang
T. Le
Vuong Le
T. Tran
230
8
0
12 Apr 2021
Explainability-aided Domain Generalization for Image Classification
Explainability-aided Domain Generalization for Image Classification
Robin M. Schmidt
FAttOOD
187
2
0
05 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021
Alana de Santana Correia
Esther Luna Colombini
HAI
328
255
0
31 Mar 2021
Grounding Physical Concepts of Objects and Events Through Dynamic Visual
  Reasoning
Grounding Physical Concepts of Objects and Events Through Dynamic Visual ReasoningInternational Conference on Learning Representations (ICLR), 2021
Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kwan-Yee K. Wong
J. Tenenbaum
Chuang Gan
VGen
236
100
0
30 Mar 2021
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning
AGQA: A Benchmark for Compositional Spatio-Temporal ReasoningComputer Vision and Pattern Recognition (CVPR), 2021
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
219
146
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labelsComputer Vision and Pattern Recognition (CVPR), 2021
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
286
25
0
29 Mar 2021
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network
  for Video Reasoning over Traffic Events
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic EventsComputer Vision and Pattern Recognition (CVPR), 2021
Kepeng Xu
He Huang
Jun Liu
ViTLRM
391
110
0
29 Mar 2021
How to Design Sample and Computationally Efficient VQA Models
How to Design Sample and Computationally Efficient VQA Models
Karan Samel
Zelin Zhao
Binghong Chen
Kuan-Chieh Wang
Haozheng Luo
Le Song
159
4
0
22 Mar 2021
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Hopper: Multi-hop Transformer for Spatiotemporal ReasoningInternational Conference on Learning Representations (ICLR), 2021
Honglu Zhou
Asim Kadav
Farley Lai
Alexandru Niculescu-Mizil
Martin Renqiang Min
Mubbasir Kapadia
H. Graf
LRM
207
18
0
19 Mar 2021
Automatic Generation of Contrast Sets from Scene Graphs: Probing the
  Compositional Consistency of GQA
Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQANorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Yonatan Bitton
Gabriel Stanovsky
Roy Schwartz
Michael Elhadad
CoGe
198
33
0
17 Mar 2021
Tune-In: Training Under Negative Environments with Interference for
  Attention Networks Simulating Cocktail Party Effect
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party EffectAAAI Conference on Artificial Intelligence (AAAI), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
125
7
0
02 Mar 2021
Contrastive Separative Coding for Self-supervised Representation
  Learning
Contrastive Separative Coding for Self-supervised Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
SSL
123
3
0
01 Mar 2021
KANDINSKYPatterns -- An experimental exploration environment for Pattern
  Analysis and Machine Intelligence
KANDINSKYPatterns -- An experimental exploration environment for Pattern Analysis and Machine Intelligence
Andreas Holzinger
Anna Saranti
Heimo Mueller
286
11
0
28 Feb 2021
ViLT: Vision-and-Language Transformer Without Convolution or Region
  Supervision
ViLT: Vision-and-Language Transformer Without Convolution or Region SupervisionInternational Conference on Machine Learning (ICML), 2021
Wonjae Kim
Bokyung Son
Ildoo Kim
VLMCLIP
547
2,101
0
05 Feb 2021
Open World Compositional Zero-Shot Learning
Open World Compositional Zero-Shot LearningComputer Vision and Pattern Recognition (CVPR), 2021
Goran Frehse
Muhammad Ferjad Naeem
Yongqin Xian
Zeynep Akata
CoGe
406
160
0
29 Jan 2021
HySTER: A Hybrid Spatio-Temporal Event Reasoner
HySTER: A Hybrid Spatio-Temporal Event Reasoner
Théophile Sautory
Nuri Cingillioglu
A. Russo
NAI
167
4
0
17 Jan 2021
Understanding the Role of Scene Graphs in Visual Question Answering
Understanding the Role of Scene Graphs in Visual Question Answering
Vinay Damodaran
Sharanya Chakravarthy
Akshay Kumar
Anjana Umapathy
Teruko Mitamura
Yuta Nakashima
Noa Garcia
Chenhui Chu
GNN
260
38
0
14 Jan 2021
Improving Multi-hop Knowledge Base Question Answering by Learning
  Intermediate Supervision Signals
Improving Multi-hop Knowledge Base Question Answering by Learning Intermediate Supervision SignalsWeb Search and Data Mining (WSDM), 2021
Gaole He
Yunshi Lan
Jing Jiang
Wayne Xin Zhao
Ji-Rong Wen
437
243
0
11 Jan 2021
Progressive Interpretation Synthesis: Interpreting Task Solving by
  Quantifying Previously Used and Unused Information
Progressive Interpretation Synthesis: Interpreting Task Solving by Quantifying Previously Used and Unused InformationNeural Computation (Neural Comput.), 2021
Zhengqi He
Taro Toyoizumi
242
1
0
08 Jan 2021
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Minne Li
Mengyue Yang
Furui Liu
Xu Chen
Zhitang Chen
Jun Wang
SyDaCML
182
16
0
28 Dec 2020
Object-Centric Diagnosis of Visual Reasoning
Object-Centric Diagnosis of Visual Reasoning
Jianwei Yang
Jiayuan Mao
Jiajun Wu
Devi Parikh
David D. Cox
J. Tenenbaum
Chuang Gan
OCL
190
17
0
21 Dec 2020
Attention over learned object embeddings enables complex visual
  reasoning
Attention over learned object embeddings enables complex visual reasoningNeural Information Processing Systems (NeurIPS), 2020
David Ding
Felix Hill
Adam Santoro
Malcolm Reynolds
M. Botvinick
OCL
357
78
0
15 Dec 2020
On the Binding Problem in Artificial Neural Networks
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
575
288
0
09 Dec 2020
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
Tayfun Ates
Muhammed Samil Atesoglu
Cagatay Yigit
.Ilker Kesen
Mert Kobaş
Erkut Erdem
Aykut Erdem
T. Goksun
Deniz Yuret
311
38
0
08 Dec 2020
Revisiting Iterative Back-Translation from the Perspective of
  Compositional Generalization
Revisiting Iterative Back-Translation from the Perspective of Compositional Generalization
Yinuo Guo
Hualei Zhu
Zeqi Lin
Bei Chen
Jian-Guang Lou
Dongmei Zhang
BDL
430
28
0
08 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering
WeaQA: Weak Supervision via Captions for Visual Question AnsweringFindings (Findings), 2020
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
329
38
0
04 Dec 2020
Learning from Lexical Perturbations for Consistent Visual Question
  Answering
Learning from Lexical Perturbations for Consistent Visual Question Answering
Spencer Whitehead
Hui Wu
Yi R. Fung
Heng Ji
Rogerio Feris
Kate Saenko
145
11
0
26 Nov 2020
Transformation Driven Visual Reasoning
Transformation Driven Visual ReasoningComputer Vision and Pattern Recognition (CVPR), 2020
Xin Hong
Yanyan Lan
Liang Pang
Jiafeng Guo
Xueqi Cheng
LRM
175
25
0
26 Nov 2020
Interpretable Visual Reasoning via Induced Symbolic Space
Interpretable Visual Reasoning via Induced Symbolic SpaceIEEE International Conference on Computer Vision (ICCV), 2020
Zhonghao Wang
Kai Wang
Mo Yu
Jinjun Xiong
Wen-mei W. Hwu
M. Hasegawa-Johnson
Humphrey Shi
LRMOCL
212
21
0
23 Nov 2020
LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular
  Supervision for Visual Question Answering
LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering
Weixin Liang
Fei Niu
Aishwarya N. Reganti
Govind Thattai
Gokhan Tur
191
19
0
21 Nov 2020
Logically Consistent Loss for Visual Question Answering
Logically Consistent Loss for Visual Question Answering
Anh-Cat Le-Ngo
T. Tran
Santu Rana
Sunil R. Gupta
Svetha Venkatesh
OOD
191
0
0
19 Nov 2020
Reasoning Over History: Context Aware Visual Dialog
Reasoning Over History: Context Aware Visual Dialog
Muhammad A. Shah
Shikib Mehri
Tejas Srinivasan
157
4
0
02 Nov 2020
Measuring non-trivial compositionality in emergent communication
Measuring non-trivial compositionality in emergent communication
Tomasz Korbak
Julian Zubek
Joanna Rkaczaszek-Leonardi
224
12
0
28 Oct 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual
  Question Answering
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question AnsweringFindings (Findings), 2020
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
213
61
0
27 Oct 2020
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question
  Answering
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question AnsweringIEEE transactions on multimedia (TMM), 2020
Zanxia Jin
Heran Wu
Chun Yang
Fang Zhou
Jingyan Qin
Lei Xiao
Xu-Cheng Yin
232
37
0
24 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual
  Questions
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
266
26
0
24 Oct 2020
Deep Reinforcement Learning with Stacked Hierarchical Attention for
  Text-based Games
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games
Yunqiu Xu
Meng Fang
Ling-Hao Chen
Yali Du
Qiufeng Wang
Chengqi Zhang
OffRL
286
48
0
22 Oct 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing
  Functional Entropies
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional EntropiesNeural Information Processing Systems (NeurIPS), 2020
Itai Gat
Idan Schwartz
Alex Schwing
Tamir Hazan
258
98
0
21 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A ReviewNeurocomputing (Neurocomputing), 2020
Wei Chen
Weiping Wang
Tianpeng Liu
M. Lew
VLM
329
36
0
16 Oct 2020
Interpretable Neural Computation for Real-World Compositional Visual
  Question Answering
Interpretable Neural Computation for Real-World Compositional Visual Question AnsweringChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2020
Ruixue Tang
Chao Ma
CoGe
78
2
0
10 Oct 2020
Think before you act: A simple baseline for compositional generalization
Think before you act: A simple baseline for compositional generalization
C. Heinze-Deml
Diane Bouchacourt
CoGe
295
16
0
29 Sep 2020
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language
  Grounded Image Scenes
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes
Raeid Saqur
Ameet Deshpande
GNNNAI
137
0
0
19 Sep 2020
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
Thierry Deruyttere
Simon Vandenhende
Dusan Grujicic
Yu Liu
Luc Van Gool
Matthew Blaschko
Tinne Tuytelaars
Marie-Francine Moens
227
6
0
18 Sep 2020
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
N. Rufus
U. R. Nair
K. M. Krishna
Vineet Gandhi
201
15
0
13 Sep 2020
Span-based Semantic Parsing for Compositional Generalization
Span-based Semantic Parsing for Compositional GeneralizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Jonathan Herzig
Jonathan Berant
ReLMLRM
238
103
0
13 Sep 2020
AttnGrounder: Talking to Cars with Attention
AttnGrounder: Talking to Cars with Attention
Vivek Mittal
ViT
254
16
0
11 Sep 2020
Systematic Generalization on gSCAN with Language Conditioned Embedding
Systematic Generalization on gSCAN with Language Conditioned Embedding
Tong Gao
Qi Huang
Raymond J. Mooney
228
22
0
11 Sep 2020
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
249
13
0
18 Aug 2020
Previous
1234567
Next