ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06890
  4. Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
    CoGe
ArXivPDFHTML

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,475 papers shown
Title
Learning Geometry-aware Representations by Sketching
Learning Geometry-aware Representations by Sketching
Hyun-Dong Lee
Inwoo Hwang
Hyun-Young Go
Won-Seok Choi
Kibeom Kim
Byoung-Tak Zhang
SSL
33
3
0
17 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image
  Generation
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Joey Tianyi Zhou
EGVM
16
5
0
13 Apr 2023
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
Yihao Ding
Siwen Luo
Hyunsuk Chung
S. Han
33
17
0
13 Apr 2023
A surprisingly simple technique to control the pretraining bias for
  better transfer: Expand or Narrow your representation
A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Florian Bordes
Samuel Lavoie
Randall Balestriero
Nicolas Ballas
Pascal Vincent
SSL
45
5
0
11 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
31
30
0
10 Apr 2023
Probing Conceptual Understanding of Large Visual-Language Models
Probing Conceptual Understanding of Large Visual-Language Models
Madeline Chantry Schiappa
Raiyaan Abdullah
Shehreen Azad
Jared Claypoole
Michael Cogswell
Ajay Divakaran
Yogesh S Rawat
58
14
0
07 Apr 2023
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
35
60
0
06 Apr 2023
Datamator: An Intelligent Authoring Tool for Creating Datamations via
  Data Query Decomposition
Datamator: An Intelligent Authoring Tool for Creating Datamations via Data Query Decomposition
Yi Guo
Nana Cao
Ligan Cai
Yanqiu Wu
Daniel Weiskopf
Danqing Shi
Qing Chen
30
1
0
06 Apr 2023
ChartReader: A Unified Framework for Chart Derendering and Comprehension
  without Heuristic Rules
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Jingdong Sun
Teruko Mitamura
Alexander G. Hauptmann
31
21
0
05 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas J. Guibas
Yin Zhou
Drago Anguelov
3DV
32
20
0
04 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
59
496
0
03 Apr 2023
Grounding Object Relations in Language-Conditioned Robotic Manipulation
  with Semantic-Spatial Reasoning
Grounding Object Relations in Language-Conditioned Robotic Manipulation with Semantic-Spatial Reasoning
Qian Luo
Yunfei Li
Yi Wu
LM&Ro
48
5
0
31 Mar 2023
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric
  Learning
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning
Jinwoo Kim
Janghyuk Choi
Ho-Jin Choi
Seon Joo Kim
OCL
VLM
29
14
0
31 Mar 2023
Bi-directional Training for Composed Image Retrieval via Text Prompt
  Learning
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Zheyuan Liu
Weixuan Sun
Yicong Hong
Damien Teney
Stephen Gould
40
30
0
29 Mar 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
38
107
0
27 Mar 2023
Borrowing Human Senses: Comment-Aware Self-Training for Social Media
  Multimodal Classification
Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Chunpu Xu
Jing Li
VLM
26
5
0
27 Mar 2023
Curriculum Learning for Compositional Visual Reasoning
Curriculum Learning for Compositional Visual Reasoning
Wafa Aissa
Marin Ferecatu
M. Crucianu
LRM
36
3
0
27 Mar 2023
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh
Hyeji Hwang
Hee-young Lee
Yongtaek Lim
Geunyoung Jung
Jiyoung Jung
Hosik Choi
Kyungwoo Song
VLM
VPVLM
85
57
0
26 Mar 2023
CelebV-Text: A Large-Scale Facial Text-Video Dataset
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
30
58
0
26 Mar 2023
BlobGAN-3D: A Spatially-Disentangled 3D-Aware Generative Model for
  Indoor Scenes
BlobGAN-3D: A Spatially-Disentangled 3D-Aware Generative Model for Indoor Scenes
Qian Wang
Yiqun Wang
Michael Birsak
Peter Wonka
73
7
0
26 Mar 2023
UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative
  Neural Feature Fields
UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields
Yuanbo Yang
Yifei Yang
Hanlei Guo
R. Xiong
Yue Wang
Yiyi Liao
36
18
0
24 Mar 2023
First Session Adaptation: A Strong Replay-Free Baseline for
  Class-Incremental Learning
First Session Adaptation: A Strong Replay-Free Baseline for Class-Incremental Learning
A. Panos
Yuriko Kobe
Daniel Olmeda Reino
Rahaf Aljundi
Richard Turner
CLL
106
42
0
23 Mar 2023
CC3D: Layout-Conditioned Generation of Compositional 3D Scenes
CC3D: Layout-Conditioned Generation of Compositional 3D Scenes
Sherwin Bahmani
Jeong Joon Park
Despoina Paschalidou
Xingguang Yan
Gordon Wetzstein
Leonidas J. Guibas
Andrea Tagliasacchi
3DV
52
44
0
21 Mar 2023
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
Yining Hong
Chun-Tse Lin
Yilun Du
Zhenfang Chen
J. Tenenbaum
Chuang Gan
3DV
32
52
0
20 Mar 2023
Object-Centric Slot Diffusion
Object-Centric Slot Diffusion
Jindong Jiang
Fei Deng
Gautam Singh
S. Ahn
DiffM
BDL
OCL
30
57
0
20 Mar 2023
Divide and Conquer: Answering Questions with Object Factorization and
  Compositional Reasoning
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
47
5
0
18 Mar 2023
Logical Implications for Visual Question Answering Consistency
Logical Implications for Visual Question Answering Consistency
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
23
9
0
16 Mar 2023
Identifiability Results for Multimodal Contrastive Learning
Identifiability Results for Multimodal Contrastive Learning
Imant Daunhawer
Alice Bizeul
Emanuele Palumbo
Alexander Marx
Julia E. Vogt
39
39
0
16 Mar 2023
Towards Commonsense Knowledge based Fuzzy Systems for Supporting
  Size-Related Fine-Grained Object Detection
Towards Commonsense Knowledge based Fuzzy Systems for Supporting Size-Related Fine-Grained Object Detection
Pufen Zhang
Tianhua Chen
Bing-Quan Liu
ObjD
13
1
0
16 Mar 2023
Investigating GANsformer: A Replication Study of a State-of-the-Art
  Image Generation Model
Investigating GANsformer: A Replication Study of a State-of-the-Art Image Generation Model
Giorgia Adorni
Felix Boelter
Stefano Carlo Lambertenghi
24
1
0
15 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
29
5
0
14 Mar 2023
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in
  Human Pose and Shape Estimation
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation
Qihao Liu
Adam Kortylewski
Alan Yuille
OODD
48
12
0
13 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
31
1
0
13 Mar 2023
Accountable Textual-Visual Chat Learns to Reject Human Instructions in
  Image Re-creation
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
30
0
0
10 Mar 2023
Controllable Video Generation by Learning the Underlying Dynamical
  System with Neural ODE
Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Yucheng Xu
Nanbo Li
A. Goel
Zijian Guo
Zonghai Yao
Hamidreza Kasaei
Mohammad-Sajad Kasaei
Zhibin Li
47
5
0
09 Mar 2023
Probabilistic 3d regression with projected huber distribution
Probabilistic 3d regression with projected huber distribution
David Mohlin
Josephine Sullivan
27
0
0
09 Mar 2023
Weakly Supervised Knowledge Transfer with Probabilistic Logical
  Reasoning for Object Detection
Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection
M. Oldenhof
Adam Arany
Yves Moreau
E. Brouwer
21
3
0
09 Mar 2023
Toward Unsupervised Realistic Visual Question Answering
Toward Unsupervised Realistic Visual Question Answering
Yuwei Zhang
Chih-Hui Ho
Nuno Vasconcelos
CoGe
24
2
0
09 Mar 2023
Learning Exploration Strategies to Solve Real-World Marble Runs
Learning Exploration Strategies to Solve Real-World Marble Runs
Alisa Allaire
C. Atkeson
34
0
0
08 Mar 2023
Learning to reason over visual objects
Learning to reason over visual objects
S. S. Mondal
Taylor Webb
Jonathan D. Cohen
OCL
33
29
0
03 Mar 2023
Towards Democratizing Joint-Embedding Self-Supervised Learning
Towards Democratizing Joint-Embedding Self-Supervised Learning
Florian Bordes
Randall Balestriero
Pascal Vincent
30
20
0
03 Mar 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
94
11
0
03 Mar 2023
Counterfactual Edits for Generative Evaluation
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
23
0
0
02 Mar 2023
Which One Are You Referring To? Multimodal Object Identification in
  Situated Dialogue
Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue
Holy Lovenia
Samuel Cahyawijaya
Pascale Fung
26
1
0
28 Feb 2023
Sequential Query Encoding For Complex Query Answering on Knowledge
  Graphs
Sequential Query Encoding For Complex Query Answering on Knowledge Graphs
Jiaxin Bai
Tianshi Zheng
Yangqiu Song
26
13
0
25 Feb 2023
Quantifying & Modeling Multimodal Interactions: An Information
  Decomposition Framework
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
Paul Pu Liang
Yun Cheng
Xiang Fan
Chun Kai Ling
Suzanne Nie
...
Nicholas B. Allen
Randy P. Auerbach
Faisal Mahmood
Ruslan Salakhutdinov
Louis-Philippe Morency
43
32
0
23 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
49
4
0
23 Feb 2023
MVTrans: Multi-View Perception of Transparent Objects
MVTrans: Multi-View Perception of Transparent Objects
Yi Ru Wang
Yuchi Zhao
Haoping Xu
Saggi Eppel
Alán Aspuru-Guzik
Florian Shkurti
Animesh Garg
34
19
0
22 Feb 2023
Reduce, Reuse, Recycle: Compositional Generation with Energy-Based
  Diffusion Models and MCMC
Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC
Yilun Du
Conor Durkan
Robin Strudel
J. Tenenbaum
Sander Dieleman
Rob Fergus
Jascha Narain Sohl-Dickstein
Arnaud Doucet
Will Grathwohl
DiffM
32
133
0
22 Feb 2023
Composer: Creative and Controllable Image Synthesis with Composable
  Conditions
Composer: Creative and Controllable Image Synthesis with Composable Conditions
Lianghua Huang
Di Chen
Yu Liu
Yujun Shen
Deli Zhao
Jingren Zhou
DiffM
22
279
0
20 Feb 2023
Previous
123...101112...282930
Next