ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06890
  4. Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
    CoGe
ArXivPDFHTML

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,480 papers shown
Title
Learning to Recombine and Resample Data for Compositional Generalization
Learning to Recombine and Resample Data for Compositional Generalization
Ekin Akyürek
Afra Feyza Akyürek
Jacob Andreas
34
79
0
08 Oct 2020
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Ramakrishna Vedantam
Arthur Szlam
Maximilian Nickel
Ari S. Morcos
Brenden M. Lake
UQLM
LRM
32
26
0
06 Oct 2020
Pathological Visual Question Answering
Pathological Visual Question Answering
Xuehai He
Zhuo Cai
Wenlan Wei
Yichen Zhang
Luntian Mou
Eric Xing
P. Xie
88
24
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
70
217
0
06 Oct 2020
Meta-Learning of Structured Task Distributions in Humans and Machines
Meta-Learning of Structured Task Distributions in Humans and Machines
Sreejan Kumar
Ishita Dasgupta
Jonathan Cohen
Nathaniel D. Daw
Thomas Griffiths
OffRL
27
3
0
05 Oct 2020
Improving Generative Imagination in Object-Centric World Models
Improving Generative Imagination in Object-Centric World Models
Zhixuan Lin
Yi-Fu Wu
Skand Peri
Bofeng Fu
Jindong Jiang
Sungjin Ahn
OCL
27
80
0
05 Oct 2020
Attention Guided Semantic Relationship Parsing for Visual Question
  Answering
Attention Guided Semantic Relationship Parsing for Visual Question Answering
M. Farazi
Salman Khan
Nick Barnes
19
2
0
05 Oct 2020
Static and Animated 3D Scene Generation from Free-form Text Descriptions
Static and Animated 3D Scene Generation from Free-form Text Descriptions
Faria Huq
Nafees Ahmed
Anindya Iqbal
3DV
21
1
0
04 Oct 2020
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of
  Objects using only Nouns
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns
L. Ferreira
Douglas De Rizzo Meneghetti
P. Santos
14
2
0
02 Oct 2020
Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and
  Reasoning
Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning
Weili Nie
Zhiding Yu
Lei Mao
Ankit B. Patel
Yuke Zhu
Anima Anandkumar
VLM
LRM
28
75
0
02 Oct 2020
RefVOS: A Closer Look at Referring Expressions for Video Object
  Segmentation
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
Míriam Bellver
Carles Ventura
Carina Silberer
Ioannis V. Kazakos
Jordi Torres
Xavier Giró-i-Nieto
VOS
34
32
0
01 Oct 2020
Graph-based Heuristic Search for Module Selection Procedure in Neural
  Module Network
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network
Yuxuan Wu
Hideki Nakayama
GNN
25
3
0
30 Sep 2020
Think before you act: A simple baseline for compositional generalization
Think before you act: A simple baseline for compositional generalization
C. Heinze-Deml
Diane Bouchacourt
CoGe
30
16
0
29 Sep 2020
Extending Answer Set Programs with Neural Networks
Extending Answer Set Programs with Neural Networks
Zhun Yang
ReLM
NAI
LRM
25
0
0
22 Sep 2020
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language
  Grounded Image Scenes
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes
Raeid Saqur
Ameet Deshpande
GNN
NAI
13
0
0
19 Sep 2020
ShapeAssembly: Learning to Generate Programs for 3D Shape Structure
  Synthesis
ShapeAssembly: Learning to Generate Programs for 3D Shape Structure Synthesis
R. K. Jones
Theresa Barton
Xianghao Xu
Kai Wang
Ellen Jiang
Paul Guerrero
Niloy J. Mitra
Daniel E. Ritchie
36
31
0
17 Sep 2020
Synbols: Probing Learning Algorithms with Synthetic Datasets
Synbols: Probing Learning Algorithms with Synthetic Datasets
Alexandre Lacoste
Pau Rodríguez
Frederic Branchaud-Charron
Parmida Atighehchian
Massimo Caccia
I. Laradji
Alexandre Drouin
Matt Craddock
Laurent Charlin
David Vázquez
33
14
0
14 Sep 2020
Span-based Semantic Parsing for Compositional Generalization
Span-based Semantic Parsing for Compositional Generalization
Jonathan Herzig
Jonathan Berant
ReLM
LRM
25
101
0
13 Sep 2020
A Dataset and Baselines for Visual Question Answering on Art
A Dataset and Baselines for Visual Question Answering on Art
Noa Garcia
Chentao Ye
Zihua Liu
Qingtao Hu
Mayu Otani
Chenhui Chu
Yuta Nakashima
Teruko Mitamura
CoGe
16
52
0
28 Aug 2020
Visual Question Answering on Image Sets
Visual Question Answering on Image Sets
Ankan Bansal
Yuting Zhang
Rama Chellappa
CoGe
22
40
0
27 Aug 2020
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
William S. Peebles
John Peebles
Jun-Yan Zhu
Alexei A. Efros
Antonio Torralba
45
115
0
24 Aug 2020
INSIDE: Steering Spatial Attention with Non-Imaging Information in CNNs
INSIDE: Steering Spatial Attention with Non-Imaging Information in CNNs
Grzegorz Jacenków
Alison Q. OÑeil
Brian Mohr
Sotirios A. Tsaftaris
22
9
0
21 Aug 2020
AutoSimulate: (Quickly) Learning Synthetic Data Generation
AutoSimulate: (Quickly) Learning Synthetic Data Generation
Harkirat Singh Behl
A. G. Baydin
Ran Gal
Philip Torr
Vibhav Vineet
16
23
0
16 Aug 2020
Compositional Generalization via Neural-Symbolic Stack Machines
Compositional Generalization via Neural-Symbolic Stack Machines
Xinyun Chen
Chen Liang
Adams Wei Yu
D. Song
Denny Zhou
BDL
19
99
0
15 Aug 2020
Graph Edit Distance Reward: Learning to Edit Scene Graph
Graph Edit Distance Reward: Learning to Edit Scene Graph
Lichang Chen
Guosheng Lin
Shijie Wang
Qingyao Wu
11
18
0
15 Aug 2020
Text as Neural Operator: Image Manipulation by Text Instruction
Text as Neural Operator: Image Manipulation by Text Instruction
Tianhao Zhang
Hung-Yu Tseng
Lu Jiang
Weilong Yang
Honglak Lee
Irfan Essa
DiffM
26
40
0
11 Aug 2020
A Neural-Symbolic Framework for Mental Simulation
A Neural-Symbolic Framework for Mental Simulation
Michael D Kissner
22
0
0
05 Aug 2020
Word meaning in minds and machines
Word meaning in minds and machines
Brenden M. Lake
G. Murphy
NAI
26
117
0
04 Aug 2020
Presentation and Analysis of a Multimodal Dataset for Grounded Language
  Learning
Presentation and Analysis of a Multimodal Dataset for Grounded Language Learning
Patrick Jenkins
Rishabh Sachdeva
Gaoussou Youssouf Kebe
Padraig Higgins
Kasra Darvish
Edward Raff
Don Engel
J. Winder
Francis Ferraro
Cynthia Matuszek
20
5
0
29 Jul 2020
Towards Ecologically Valid Research on Language User Interfaces
Towards Ecologically Valid Research on Language User Interfaces
H. D. Vries
Dzmitry Bahdanau
Christopher D. Manning
215
51
0
28 Jul 2020
AiR: Attention with Reasoning Capability
AiR: Attention with Reasoning Capability
Shi Chen
Ming Jiang
Jinhui Yang
Qi Zhao
LRM
13
36
0
28 Jul 2020
REXUP: I REason, I EXtract, I UPdate with Structured Compositional
  Reasoning for Visual Question Answering
REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering
Siwen Luo
S. Han
Kaiyuan Sun
Josiah Poon
CoGe
LRM
ReLM
26
4
0
27 Jul 2020
Knowledge Graph Extraction from Videos
Knowledge Graph Extraction from Videos
Louis Mahon
Eleonora Giunchiglia
Bowen Li
Thomas Lukasiewicz
19
19
0
20 Jul 2020
Understanding Spatial Relations through Multiple Modalities
Understanding Spatial Relations through Multiple Modalities
Soham Dan
Hangfeng He
Dan Roth
16
6
0
19 Jul 2020
Compositional Generalization in Semantic Parsing: Pre-training vs.
  Specialized Architectures
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures
Daniel Furrer
Marc van Zee
Nathan Scales
Nathanael Scharli
CoGe
31
113
0
17 Jul 2020
Knowledge-Based Video Question Answering with Unsupervised Scene
  Descriptions
Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions
Noa Garcia
Yuta Nakashima
46
32
0
17 Jul 2020
Detecting Human-Object Interactions with Action Co-occurrence Priors
Detecting Human-Object Interactions with Action Co-occurrence Priors
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
32
124
0
17 Jul 2020
On Robustness and Transferability of Convolutional Neural Networks
On Robustness and Transferability of Convolutional Neural Networks
Josip Djolonga
Jessica Yung
Michael Tschannen
Rob Romijnders
Lucas Beyer
...
D. Moldovan
Sylvain Gelly
N. Houlsby
Xiaohua Zhai
Mario Lucic
OOD
18
154
0
16 Jul 2020
Deep Learning for Abstract Argumentation Semantics
Deep Learning for Abstract Argumentation Semantics
Dennis Craandijk
Floris Bex
SSeg
24
30
0
15 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving
Learning Reasoning Strategies in End-to-End Differentiable Proving
Pasquale Minervini
Sebastian Riedel
Pontus Stenetorp
Edward Grefenstette
Tim Rocktaschel
LRM
50
96
0
13 Jul 2020
Generative Compositional Augmentations for Scene Graph Prediction
Generative Compositional Augmentations for Scene Graph Prediction
Boris Knyazev
H. D. Vries
Cătălina Cangea
Graham W. Taylor
Aaron Courville
Eugene Belilovsky
23
25
0
11 Jul 2020
INT: An Inequality Benchmark for Evaluating Generalization in Theorem
  Proving
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
Yuhuai Wu
Albert Qiaochu Jiang
Jimmy Ba
Roger C. Grosse
AIMat
26
55
0
06 Jul 2020
A Competence-aware Curriculum for Visual Concepts Learning via Question
  Answering
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering
Qing Li
Siyuan Huang
Yining Hong
Song-Chun Zhu
28
29
0
03 Jul 2020
RELATE: Physically Plausible Multi-Object Scene Synthesis Using
  Structured Latent Spaces
RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces
Sébastien Ehrhardt
Oliver Groth
Áron Monszpart
Martin Engelcke
Ingmar Posner
Niloy Mitra
Andrea Vedaldi
3DPC
21
54
0
02 Jul 2020
Scene Graph Reasoning for Visual Question Answering
Scene Graph Reasoning for Visual Question Answering
Marcel Hildebrandt
Hang Li
Rajat Koner
Volker Tresp
Stephan Günnemann
GNN
22
61
0
02 Jul 2020
DocVQA: A Dataset for VQA on Document Images
DocVQA: A Dataset for VQA on Document Images
Minesh Mathew
Dimosthenis Karatzas
C. V. Jawahar
40
675
0
01 Jul 2020
Latent Compositional Representations Improve Systematic Generalization
  in Grounded Question Answering
Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
Ben Bogin
Sanjay Subramanian
Matt Gardner
Jonathan Berant
ReLM
OOD
BDL
LRM
22
28
0
01 Jul 2020
Conditional Set Generation with Transformers
Conditional Set Generation with Transformers
Adam R. Kosiorek
Hyunjik Kim
Danilo Jimenez Rezende
29
40
0
26 Jun 2020
Object-Centric Learning with Slot Attention
Object-Centric Learning with Slot Attention
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
G. Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
OCL
41
824
0
26 Jun 2020
A causal view of compositional zero-shot recognition
A causal view of compositional zero-shot recognition
Yuval Atzmon
Felix Kreuk
Uri Shalit
Gal Chechik
OCL
BDL
CML
61
118
0
25 Jun 2020
Previous
123...212223...282930
Next