Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.06890
Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"
50 / 1,475 papers shown
Title
Learning Geometry-aware Representations by Sketching
Hyun-Dong Lee
Inwoo Hwang
Hyun-Young Go
Won-Seok Choi
Kibeom Kim
Byoung-Tak Zhang
SSL
33
3
0
17 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Joey Tianyi Zhou
EGVM
16
5
0
13 Apr 2023
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
Yihao Ding
Siwen Luo
Hyunsuk Chung
S. Han
33
17
0
13 Apr 2023
A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Florian Bordes
Samuel Lavoie
Randall Balestriero
Nicolas Ballas
Pascal Vincent
SSL
45
5
0
11 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
31
30
0
10 Apr 2023
Probing Conceptual Understanding of Large Visual-Language Models
Madeline Chantry Schiappa
Raiyaan Abdullah
Shehreen Azad
Jared Claypoole
Michael Cogswell
Ajay Divakaran
Yogesh S Rawat
58
14
0
07 Apr 2023
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
35
60
0
06 Apr 2023
Datamator: An Intelligent Authoring Tool for Creating Datamations via Data Query Decomposition
Yi Guo
Nana Cao
Ligan Cai
Yanqiu Wu
Daniel Weiskopf
Danqing Shi
Qing Chen
30
1
0
06 Apr 2023
ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Jingdong Sun
Teruko Mitamura
Alexander G. Hauptmann
31
21
0
05 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas J. Guibas
Yin Zhou
Drago Anguelov
3DV
32
20
0
04 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
59
496
0
03 Apr 2023
Grounding Object Relations in Language-Conditioned Robotic Manipulation with Semantic-Spatial Reasoning
Qian Luo
Yunfei Li
Yi Wu
LM&Ro
48
5
0
31 Mar 2023
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning
Jinwoo Kim
Janghyuk Choi
Ho-Jin Choi
Seon Joo Kim
OCL
VLM
29
14
0
31 Mar 2023
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Zheyuan Liu
Weixuan Sun
Yicong Hong
Damien Teney
Stephen Gould
40
30
0
29 Mar 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
38
107
0
27 Mar 2023
Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Chunpu Xu
Jing Li
VLM
26
5
0
27 Mar 2023
Curriculum Learning for Compositional Visual Reasoning
Wafa Aissa
Marin Ferecatu
M. Crucianu
LRM
36
3
0
27 Mar 2023
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh
Hyeji Hwang
Hee-young Lee
Yongtaek Lim
Geunyoung Jung
Jiyoung Jung
Hosik Choi
Kyungwoo Song
VLM
VPVLM
85
57
0
26 Mar 2023
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
30
58
0
26 Mar 2023
BlobGAN-3D: A Spatially-Disentangled 3D-Aware Generative Model for Indoor Scenes
Qian Wang
Yiqun Wang
Michael Birsak
Peter Wonka
73
7
0
26 Mar 2023
UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields
Yuanbo Yang
Yifei Yang
Hanlei Guo
R. Xiong
Yue Wang
Yiyi Liao
36
18
0
24 Mar 2023
First Session Adaptation: A Strong Replay-Free Baseline for Class-Incremental Learning
A. Panos
Yuriko Kobe
Daniel Olmeda Reino
Rahaf Aljundi
Richard Turner
CLL
106
42
0
23 Mar 2023
CC3D: Layout-Conditioned Generation of Compositional 3D Scenes
Sherwin Bahmani
Jeong Joon Park
Despoina Paschalidou
Xingguang Yan
Gordon Wetzstein
Leonidas J. Guibas
Andrea Tagliasacchi
3DV
52
44
0
21 Mar 2023
3D Concept Learning and Reasoning from Multi-View Images
Yining Hong
Chun-Tse Lin
Yilun Du
Zhenfang Chen
J. Tenenbaum
Chuang Gan
3DV
32
52
0
20 Mar 2023
Object-Centric Slot Diffusion
Jindong Jiang
Fei Deng
Gautam Singh
S. Ahn
DiffM
BDL
OCL
30
57
0
20 Mar 2023
Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning
Shi Chen
Qi Zhao
47
5
0
18 Mar 2023
Logical Implications for Visual Question Answering Consistency
Sergio Tascon-Morales
Pablo Márquez-Neila
Raphael Sznitman
23
9
0
16 Mar 2023
Identifiability Results for Multimodal Contrastive Learning
Imant Daunhawer
Alice Bizeul
Emanuele Palumbo
Alexander Marx
Julia E. Vogt
39
39
0
16 Mar 2023
Towards Commonsense Knowledge based Fuzzy Systems for Supporting Size-Related Fine-Grained Object Detection
Pufen Zhang
Tianhua Chen
Bing-Quan Liu
ObjD
13
1
0
16 Mar 2023
Investigating GANsformer: A Replication Study of a State-of-the-Art Image Generation Model
Giorgia Adorni
Felix Boelter
Stefano Carlo Lambertenghi
24
1
0
15 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
29
5
0
14 Mar 2023
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation
Qihao Liu
Adam Kortylewski
Alan Yuille
OODD
48
12
0
13 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
31
1
0
13 Mar 2023
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
30
0
0
10 Mar 2023
Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Yucheng Xu
Nanbo Li
A. Goel
Zijian Guo
Zonghai Yao
Hamidreza Kasaei
Mohammad-Sajad Kasaei
Zhibin Li
47
5
0
09 Mar 2023
Probabilistic 3d regression with projected huber distribution
David Mohlin
Josephine Sullivan
27
0
0
09 Mar 2023
Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection
M. Oldenhof
Adam Arany
Yves Moreau
E. Brouwer
21
3
0
09 Mar 2023
Toward Unsupervised Realistic Visual Question Answering
Yuwei Zhang
Chih-Hui Ho
Nuno Vasconcelos
CoGe
24
2
0
09 Mar 2023
Learning Exploration Strategies to Solve Real-World Marble Runs
Alisa Allaire
C. Atkeson
34
0
0
08 Mar 2023
Learning to reason over visual objects
S. S. Mondal
Taylor Webb
Jonathan D. Cohen
OCL
33
29
0
03 Mar 2023
Towards Democratizing Joint-Embedding Self-Supervised Learning
Florian Bordes
Randall Balestriero
Pascal Vincent
30
20
0
03 Mar 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
94
11
0
03 Mar 2023
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
23
0
0
02 Mar 2023
Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue
Holy Lovenia
Samuel Cahyawijaya
Pascale Fung
26
1
0
28 Feb 2023
Sequential Query Encoding For Complex Query Answering on Knowledge Graphs
Jiaxin Bai
Tianshi Zheng
Yangqiu Song
26
13
0
25 Feb 2023
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework
Paul Pu Liang
Yun Cheng
Xiang Fan
Chun Kai Ling
Suzanne Nie
...
Nicholas B. Allen
Randy P. Auerbach
Faisal Mahmood
Ruslan Salakhutdinov
Louis-Philippe Morency
43
32
0
23 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
49
4
0
23 Feb 2023
MVTrans: Multi-View Perception of Transparent Objects
Yi Ru Wang
Yuchi Zhao
Haoping Xu
Saggi Eppel
Alán Aspuru-Guzik
Florian Shkurti
Animesh Garg
34
19
0
22 Feb 2023
Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC
Yilun Du
Conor Durkan
Robin Strudel
J. Tenenbaum
Sander Dieleman
Rob Fergus
Jascha Narain Sohl-Dickstein
Arnaud Doucet
Will Grathwohl
DiffM
32
133
0
22 Feb 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
Lianghua Huang
Di Chen
Yu Liu
Yujun Shen
Deli Zhao
Jingren Zhou
DiffM
22
279
0
20 Feb 2023
Previous
1
2
3
...
10
11
12
...
28
29
30
Next