Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.06890
Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
20 December 2016
Justin Johnson
B. Hariharan
L. V. D. van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"
50 / 1,475 papers shown
Title
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Medhini Narasimhan
Svetlana Lazebnik
A. Schwing
NAI
GNN
ReLM
26
11
0
01 Nov 2018
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
LRM
39
589
0
01 Nov 2018
Dilated DenseNets for Relational Reasoning
Antreas Antoniou
Agnieszka Słowik
Elliot J. Crowley
Amos Storkey
GNN
NAI
23
2
0
01 Nov 2018
Textbook Question Answering with Multi-modal Context Graph Understanding and Self-supervised Open-set Comprehension
Daesik Kim
Seonhoon Kim
Nojun Kwak
22
2
0
01 Nov 2018
TallyQA: Answering Complex Counting Questions
Manoj Acharya
Kushal Kafle
Christopher Kanan
19
112
0
29 Oct 2018
ReviewQA: a relational aspect-based opinion reading dataset
Quentin Grail
J. Perez
RALM
14
4
0
29 Oct 2018
Understand, Compose and Respond - Answering Visual Questions by a Composition of Abstract Procedures
B. Vatashsky
S. Ullman
CoGe
26
1
0
25 Oct 2018
PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution
Hong Chen
Zhenhua Fan
Hao Lu
Alan Yuille
Shu Rong
27
59
0
23 Oct 2018
Investigating Object Compositionality in Generative Adversarial Networks
Sjoerd van Steenkiste
Karol Kurach
Jürgen Schmidhuber
Sylvain Gelly
GAN
OCL
29
20
0
17 Oct 2018
Visual Semantic Navigation using Scene Priors
Wei Yang
Xueliang Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
33
320
0
15 Oct 2018
Knowing Where to Look? Analysis on Attention of Visual Question Answering System
Wei Li
Zehuan Yuan
Xiangzhong Fang
Changhu Wang
24
8
0
09 Oct 2018
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
AAML
20
235
0
08 Oct 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
46
599
0
04 Oct 2018
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
Hyeonwoo Noh
Taehoon Kim
Jonghwan Mun
Bohyung Han
31
17
0
03 Oct 2018
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
16
42
0
01 Oct 2018
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA
Shailza Jolly
Sandro Pezzelle
T. Klein
Andreas Dengel
Moin Nabi
27
2
0
12 Sep 2018
The Visual QA Devil in the Details: The Impact of Early Fusion and Batch Norm on CLEVR
Mateusz Malinowski
Carl Doersch
ReLM
19
12
0
11 Sep 2018
How clever is the FiLM model, and how clever can it be?
A. Kuhnle
Huiyuan Xie
Ann A. Copestake
30
6
0
09 Sep 2018
Cascaded Mutual Modulation for Visual Reasoning
Yiqun Yao
Jiaming Xu
Feng Wang
Bo Xu
LRM
23
14
0
06 Sep 2018
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
24
164
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
TVQA: Localized, Compositional Video Question Answering
Muhammad Abdul Wahab
Licheng Yu
Mounir Nasr Allah
Tamara L. Berg
36
617
0
05 Sep 2018
Localizing Moments in Video with Temporal Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
27
158
0
05 Sep 2018
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Medhini Narasimhan
A. Schwing
24
105
0
04 Sep 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
18
171
0
04 Sep 2018
Towards a Better Metric for Evaluating Question Generation Systems
Preksha Nema
Mitesh M. Khapra
18
108
0
30 Aug 2018
Neural Compositional Denotational Semantics for Question Answering
Nitish Gupta
M. Lewis
BDL
KELM
CoGe
14
23
0
29 Aug 2018
Human-centric Indoor Scene Synthesis Using Stochastic Grammar
Siyuan Qi
Yixin Zhu
Siyuan Huang
Chenfanfu Jiang
Song-Chun Zhu
3DV
23
182
0
25 Aug 2018
Question-Guided Hybrid Convolution for Visual Question Answering
Peng Gao
Pan Lu
Hongsheng Li
Shuang Li
Yikang Li
Guosheng Lin
Xiaogang Wang
29
68
0
08 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
28
26
0
03 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
27
96
0
01 Aug 2018
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
57
836
0
01 Aug 2018
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
41
220
0
28 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRM
OCL
30
197
0
23 Jul 2018
Measuring abstract reasoning in neural networks
David Barrett
Felix Hill
Adam Santoro
Ari S. Morcos
Timothy Lillicrap
OOD
25
356
0
11 Jul 2018
Deep Structured Generative Models
Kun Xu
Haoyun Liang
Jun Zhu
Hang Su
Bo Zhang
GAN
15
7
0
10 Jul 2018
Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful
R. Chidambaram
Michael C. Kampffmeyer
Willie Neiswanger
Xiaodan Liang
T. Lachmann
Eric Xing
23
0
0
10 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
27
124
0
09 Jul 2018
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu
Joel Lehman
Piero Molino
F. Such
Eric Frank
Alexander Sergeev
J. Yosinski
27
884
0
09 Jul 2018
COSMO: Contextualized Scene Modeling with Boltzmann Machines
Ilker Bozcan
Sinan Kalkan
20
15
0
02 Jul 2018
A New Benchmark and Progress Toward Improved Weakly Supervised Learning
Jason Ramapuram
Russ Webb
SSL
14
3
0
30 Jun 2018
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jason Jo
Vikas Verma
Yoshua Bengio
OOD
9
8
0
18 Jun 2018
Grounded Textual Entailment
H. Vu
Claudio Greco
A. Erofeeva
Somayeh Jafaritazehjan
Guido M. Linders
Marc Tanti
A. Testoni
Raffaella Bernardi
Albert Gatt
19
29
0
14 Jun 2018
FigureNet: A Deep Learning model for Question-Answering on Scientific Plots
Revanth Reddy Gangi Reddy
Rahul Ramesh
Ameet Deshpande
Mitesh M. Khapra
AIMat
OOD
GNN
35
22
0
12 Jun 2018
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
22
143
0
11 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
27
49
0
10 Jun 2018
Visual Reasoning by Progressive Module Networks
Seung Wook Kim
Makarand Tapaswi
Sanja Fidler
ReLM
LRM
36
13
0
06 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li-Jia Li
Alexander G. Hauptmann
33
110
0
05 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wei Liu
Syed Zulqarnain Gilani
Mubarak Shah
11
91
0
01 Jun 2018
Think Visually: Question Answering through Virtual Imagery
Ankit Goyal
Jian Wang
Jia Deng
27
2
0
25 May 2018
Previous
1
2
3
...
27
28
29
30
Next