ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06890
  4. Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016
Justin Johnson
B. Hariharan
L. V. D. van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
    CoGe
ArXivPDFHTML

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,475 papers shown
Title
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual
  Question Answering
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Medhini Narasimhan
Svetlana Lazebnik
A. Schwing
NAI
GNN
ReLM
26
11
0
01 Nov 2018
A Corpus for Reasoning About Natural Language Grounded in Photographs
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
LRM
39
589
0
01 Nov 2018
Dilated DenseNets for Relational Reasoning
Dilated DenseNets for Relational Reasoning
Antreas Antoniou
Agnieszka Słowik
Elliot J. Crowley
Amos Storkey
GNN
NAI
23
2
0
01 Nov 2018
Textbook Question Answering with Multi-modal Context Graph Understanding
  and Self-supervised Open-set Comprehension
Textbook Question Answering with Multi-modal Context Graph Understanding and Self-supervised Open-set Comprehension
Daesik Kim
Seonhoon Kim
Nojun Kwak
22
2
0
01 Nov 2018
TallyQA: Answering Complex Counting Questions
TallyQA: Answering Complex Counting Questions
Manoj Acharya
Kushal Kafle
Christopher Kanan
19
112
0
29 Oct 2018
ReviewQA: a relational aspect-based opinion reading dataset
ReviewQA: a relational aspect-based opinion reading dataset
Quentin Grail
J. Perez
RALM
14
4
0
29 Oct 2018
Understand, Compose and Respond - Answering Visual Questions by a
  Composition of Abstract Procedures
Understand, Compose and Respond - Answering Visual Questions by a Composition of Abstract Procedures
B. Vatashsky
S. Ullman
CoGe
26
1
0
25 Oct 2018
PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference
  Resolution
PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution
Hong Chen
Zhenhua Fan
Hao Lu
Alan Yuille
Shu Rong
27
59
0
23 Oct 2018
Investigating Object Compositionality in Generative Adversarial Networks
Investigating Object Compositionality in Generative Adversarial Networks
Sjoerd van Steenkiste
Karol Kurach
Jürgen Schmidhuber
Sylvain Gelly
GAN
OCL
29
20
0
17 Oct 2018
Visual Semantic Navigation using Scene Priors
Visual Semantic Navigation using Scene Priors
Wei Yang
Xueliang Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
33
320
0
15 Oct 2018
Knowing Where to Look? Analysis on Attention of Visual Question
  Answering System
Knowing Where to Look? Analysis on Attention of Visual Question Answering System
Wei Li
Zehuan Yuan
Xiangzhong Fang
Changhu Wang
24
8
0
09 Oct 2018
Overcoming Language Priors in Visual Question Answering with Adversarial
  Regularization
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization
S. Ramakrishnan
Aishwarya Agrawal
Stefan Lee
AAML
20
235
0
08 Oct 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language
  Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
46
599
0
04 Oct 2018
Transfer Learning via Unsupervised Task Discovery for Visual Question
  Answering
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
Hyeonwoo Noh
Taehoon Kim
Jonghwan Mun
Bohyung Han
31
17
0
03 Oct 2018
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
16
42
0
01 Oct 2018
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in
  the Evaluation of VQA
The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA
Shailza Jolly
Sandro Pezzelle
T. Klein
Andreas Dengel
Moin Nabi
27
2
0
12 Sep 2018
The Visual QA Devil in the Details: The Impact of Early Fusion and Batch
  Norm on CLEVR
The Visual QA Devil in the Details: The Impact of Early Fusion and Batch Norm on CLEVR
Mateusz Malinowski
Carl Doersch
ReLM
19
12
0
11 Sep 2018
How clever is the FiLM model, and how clever can it be?
How clever is the FiLM model, and how clever can it be?
A. Kuhnle
Huiyuan Xie
Ann A. Copestake
30
6
0
09 Sep 2018
Cascaded Mutual Modulation for Visual Reasoning
Cascaded Mutual Modulation for Visual Reasoning
Yiqun Yao
Jiaming Xu
Feng Wang
Bo Xu
LRM
23
14
0
06 Sep 2018
Visual Coreference Resolution in Visual Dialog using Neural Module
  Networks
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Satwik Kottur
José M. F. Moura
Devi Parikh
Dhruv Batra
Marcus Rohrbach
24
164
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
55
0
06 Sep 2018
TVQA: Localized, Compositional Video Question Answering
TVQA: Localized, Compositional Video Question Answering
Muhammad Abdul Wahab
Licheng Yu
Mounir Nasr Allah
Tamara L. Berg
36
617
0
05 Sep 2018
Localizing Moments in Video with Temporal Language
Localizing Moments in Video with Temporal Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
27
158
0
05 Sep 2018
Straight to the Facts: Learning Knowledge Base Retrieval for Factual
  Visual Question Answering
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Medhini Narasimhan
A. Schwing
24
105
0
04 Sep 2018
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking
  Recipes
RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes
Semih Yagcioglu
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
CoGe
18
171
0
04 Sep 2018
Towards a Better Metric for Evaluating Question Generation Systems
Towards a Better Metric for Evaluating Question Generation Systems
Preksha Nema
Mitesh M. Khapra
18
108
0
30 Aug 2018
Neural Compositional Denotational Semantics for Question Answering
Neural Compositional Denotational Semantics for Question Answering
Nitish Gupta
M. Lewis
BDL
KELM
CoGe
14
23
0
29 Aug 2018
Human-centric Indoor Scene Synthesis Using Stochastic Grammar
Human-centric Indoor Scene Synthesis Using Stochastic Grammar
Siyuan Qi
Yixin Zhu
Siyuan Huang
Chenfanfu Jiang
Song-Chun Zhu
3DV
23
182
0
25 Aug 2018
Question-Guided Hybrid Convolution for Visual Question Answering
Question-Guided Hybrid Convolution for Visual Question Answering
Peng Gao
Pan Lu
Hongsheng Li
Shuang Li
Yikang Li
Guosheng Lin
Xiaogang Wang
29
68
0
08 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
28
26
0
03 Aug 2018
Learning Visual Question Answering by Bootstrapping Hard Attention
Learning Visual Question Answering by Bootstrapping Hard Attention
Mateusz Malinowski
Carl Doersch
Adam Santoro
Peter W. Battaglia
OOD
27
96
0
01 Aug 2018
Graph R-CNN for Scene Graph Generation
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
57
836
0
01 Aug 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
41
220
0
28 Jul 2018
Explainable Neural Computation via Stack Neural Module Networks
Explainable Neural Computation via Stack Neural Module Networks
Ronghang Hu
Jacob Andreas
Trevor Darrell
Kate Saenko
LRM
OCL
30
197
0
23 Jul 2018
Measuring abstract reasoning in neural networks
Measuring abstract reasoning in neural networks
David Barrett
Felix Hill
Adam Santoro
Ari S. Morcos
Timothy Lillicrap
OOD
25
356
0
11 Jul 2018
Deep Structured Generative Models
Deep Structured Generative Models
Kun Xu
Haoyun Liang
Jun Zhu
Hang Su
Bo Zhang
GAN
15
7
0
10 Jul 2018
Geometric Generalization Based Zero-Shot Learning Dataset Infinite
  World: Simple Yet Powerful
Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful
R. Chidambaram
Michael C. Kampffmeyer
Willie Neiswanger
Xiaodan Liang
T. Lachmann
Eric Xing
23
0
0
10 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
27
124
0
09 Jul 2018
An Intriguing Failing of Convolutional Neural Networks and the CoordConv
  Solution
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu
Joel Lehman
Piero Molino
F. Such
Eric Frank
Alexander Sergeev
J. Yosinski
27
884
0
09 Jul 2018
COSMO: Contextualized Scene Modeling with Boltzmann Machines
COSMO: Contextualized Scene Modeling with Boltzmann Machines
Ilker Bozcan
Sinan Kalkan
20
15
0
02 Jul 2018
A New Benchmark and Progress Toward Improved Weakly Supervised Learning
A New Benchmark and Progress Toward Improved Weakly Supervised Learning
Jason Ramapuram
Russ Webb
SSL
14
3
0
30 Jun 2018
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jason Jo
Vikas Verma
Yoshua Bengio
OOD
9
8
0
18 Jun 2018
Grounded Textual Entailment
Grounded Textual Entailment
H. Vu
Claudio Greco
A. Erofeeva
Somayeh Jafaritazehjan
Guido M. Linders
Marc Tanti
A. Testoni
Raffaella Bernardi
Albert Gatt
19
29
0
14 Jun 2018
FigureNet: A Deep Learning model for Question-Answering on Scientific
  Plots
FigureNet: A Deep Learning model for Question-Answering on Scientific Plots
Revanth Reddy Gangi Reddy
Rahul Ramesh
Ameet Deshpande
Mitesh M. Khapra
AIMat
OOD
GNN
35
22
0
12 Jun 2018
Interactive Visual Grounding of Referring Expressions for Human-Robot
  Interaction
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
22
143
0
11 Jun 2018
Cross-Dataset Adaptation for Visual Question Answering
Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao
Hexiang Hu
Fei Sha
OOD
27
49
0
10 Jun 2018
Visual Reasoning by Progressive Module Networks
Visual Reasoning by Progressive Module Networks
Seung Wook Kim
Makarand Tapaswi
Sanja Fidler
ReLM
LRM
36
13
0
06 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li-Jia Li
Alexander G. Hauptmann
33
110
0
05 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wei Liu
Syed Zulqarnain Gilani
Mubarak Shah
11
91
0
01 Jun 2018
Think Visually: Question Answering through Virtual Imagery
Think Visually: Question Answering through Virtual Imagery
Ankit Goyal
Jian Wang
Jia Deng
27
2
0
25 May 2018
Previous
123...27282930
Next