v1v2 (latest)

Compositional Attention Networks for Machine Reasoning

8 March 2018

Drew A. Hudson

Christopher D. Manning

Papers citing "Compositional Attention Networks for Machine Reasoning"

50 / 330 papers shown

MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingIEEE International Conference on Computer Vision (ICCV), 2021

612

1,051

26 Apr 2021

Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention

168

14 Apr 2021

Object-Centric Representation Learning for Video Question AnsweringIEEE International Joint Conference on Neural Network (IJCNN), 2021

230

12 Apr 2021

Explainability-aided Domain Generalization for Image Classification

Robin M. Schmidt

FAtt OOD

187

05 Apr 2021

Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021

Alana de Santana Correia

Esther Luna Colombini

HAI

328

255

31 Mar 2021

Grounding Physical Concepts of Objects and Events Through Dynamic Visual ReasoningInternational Conference on Learning Representations (ICLR), 2021

Zhenfang Chen

Jiayuan Mao

Jiajun Wu

Kwan-Yee K. Wong

J. Tenenbaum

Chuang Gan

VGen

236

100

30 Mar 2021

AGQA: A Benchmark for Compositional Spatio-Temporal ReasoningComputer Vision and Pattern Recognition (CVPR), 2021

Madeleine Grunde-McLaughlin

Ranjay Krishna

Maneesh Agrawala

CoGe

219

146

30 Mar 2021

Domain-robust VQA with diverse datasets and methods but no target labelsComputer Vision and Pattern Recognition (CVPR), 2021

286

29 Mar 2021

SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic EventsComputer Vision and Pattern Recognition (CVPR), 2021

Kepeng Xu

He Huang

Jun Liu

ViT LRM

391

110

29 Mar 2021

How to Design Sample and Computationally Efficient VQA Models

159

22 Mar 2021

Hopper: Multi-hop Transformer for Spatiotemporal ReasoningInternational Conference on Learning Representations (ICLR), 2021

Honglu Zhou

Asim Kadav

Farley Lai

Alexandru Niculescu-Mizil

207

19 Mar 2021

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQANorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Gabriel Stanovsky

198

17 Mar 2021

Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party EffectAAAI Conference on Artificial Intelligence (AAAI), 2021

125

02 Mar 2021

Contrastive Separative Coding for Self-supervised Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

123

01 Mar 2021

KANDINSKYPatterns -- An experimental exploration environment for Pattern Analysis and Machine Intelligence

Andreas Holzinger

Anna Saranti

Heimo Mueller

286

28 Feb 2021

ViLT: Vision-and-Language Transformer Without Convolution or Region SupervisionInternational Conference on Machine Learning (ICML), 2021

547

2,101

05 Feb 2021

Open World Compositional Zero-Shot LearningComputer Vision and Pattern Recognition (CVPR), 2021

Goran Frehse

Muhammad Ferjad Naeem

Yongqin Xian

Zeynep Akata

CoGe

406

160

29 Jan 2021

HySTER: A Hybrid Spatio-Temporal Event Reasoner

167

17 Jan 2021

Understanding the Role of Scene Graphs in Visual Question Answering

Vinay Damodaran

Sharanya Chakravarthy

260

14 Jan 2021

Improving Multi-hop Knowledge Base Question Answering by Learning Intermediate Supervision SignalsWeb Search and Data Mining (WSDM), 2021

437

243

11 Jan 2021

Progressive Interpretation Synthesis: Interpreting Task Solving by Quantifying Previously Used and Unused InformationNeural Computation (Neural Comput.), 2021

Zhengqi He

Taro Toyoizumi

242

08 Jan 2021

Causal World Models by Unsupervised Deconfounding of Physical Dynamics

182

28 Dec 2020

Object-Centric Diagnosis of Visual Reasoning

Jianwei Yang

Jiayuan Mao

Jiajun Wu

Devi Parikh

David D. Cox

J. Tenenbaum

Chuang Gan

OCL

190

21 Dec 2020

Attention over learned object embeddings enables complex visual reasoningNeural Information Processing Systems (NeurIPS), 2020

357

15 Dec 2020

On the Binding Problem in Artificial Neural Networks

Klaus Greff

Sjoerd van Steenkiste

Jürgen Schmidhuber

OCL

575

288

09 Dec 2020

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions

Tayfun Ates

Muhammed Samil Atesoglu

311

08 Dec 2020

Revisiting Iterative Back-Translation from the Perspective of Compositional Generalization

430

08 Dec 2020

WeaQA: Weak Supervision via Captions for Visual Question AnsweringFindings (Findings), 2020

Pratyay Banerjee

Tejas Gokhale

Yezhou Yang

Chitta Baral

329

04 Dec 2020

Learning from Lexical Perturbations for Consistent Visual Question Answering

Heng Ji

145

26 Nov 2020

Transformation Driven Visual ReasoningComputer Vision and Pattern Recognition (CVPR), 2020

Liang Pang

175

26 Nov 2020

Interpretable Visual Reasoning via Induced Symbolic SpaceIEEE International Conference on Computer Vision (ICCV), 2020

Jinjun Xiong

212

23 Nov 2020

LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering

Govind Thattai

191

21 Nov 2020

Logically Consistent Loss for Visual Question Answering

191

19 Nov 2020

Reasoning Over History: Context Aware Visual Dialog

Muhammad A. Shah

Shikib Mehri

Tejas Srinivasan

157

02 Nov 2020

Measuring non-trivial compositionality in emergent communication

Tomasz Korbak

Julian Zubek

Joanna Rkaczaszek-Leonardi

224

28 Oct 2020

MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question AnsweringFindings (Findings), 2020

213

27 Oct 2020

RUArt: A Novel Text-Centered Solution for Text-Based Visual Question AnsweringIEEE transactions on multimedia (TMM), 2020

232

24 Oct 2020

Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions

Radhika Dua

Sai Srinivas Kancheti

V. Balasubramanian

LRM

266

24 Oct 2020

Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

286

22 Oct 2020

Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional EntropiesNeural Information Processing Systems (NeurIPS), 2020

258

21 Oct 2020

New Ideas and Trends in Deep Multimodal Content Understanding: A ReviewNeurocomputing (Neurocomputing), 2020

329

16 Oct 2020

Interpretable Neural Computation for Real-World Compositional Visual Question AnsweringChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2020

Ruixue Tang

Chao Ma

CoGe

10 Oct 2020

Think before you act: A simple baseline for compositional generalization

C. Heinze-Deml

Diane Bouchacourt

CoGe

295

29 Sep 2020

CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes

Raeid Saqur

Ameet Deshpande

GNN NAI

137

19 Sep 2020

Commands 4 Autonomous Vehicles (C4AV) Workshop Summary

Luc Van Gool

Matthew Blaschko

Tinne Tuytelaars

Marie-Francine Moens

227

18 Sep 2020

Cosine meets Softmax: A tough-to-beat baseline for visual grounding

201

13 Sep 2020

Span-based Semantic Parsing for Compositional GeneralizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Jonathan Herzig

Jonathan Berant

ReLM LRM

238

103

13 Sep 2020

AttnGrounder: Talking to Cars with Attention

Vivek Mittal

ViT

254

11 Sep 2020

Systematic Generalization on gSCAN with Language Conditioned Embedding

Tong Gao

Qi Huang

Raymond J. Mooney

228

11 Sep 2020

Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents

Ye Zhu

Yu Wu

Yi Yang

Yan Yan

249

18 Aug 2020