Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
How stable are Transferability Metrics evaluations?
European Conference on Computer Vision (ECCV), 2022
A. Agostinelli
Michal Pándy
J. Uijlings
Thomas Mensink
V. Ferrari
377
32
0
04 Apr 2022
Data Cards: Purposeful and Transparent Dataset Documentation for Responsible AI
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Mahima Pushkarna
Andrew Zaldivar
Oddur Kjartansson
AI4TS
241
266
0
03 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
International Conference on Learning Representations (ICLR), 2022
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
594
681
0
01 Apr 2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
European Conference on Computer Vision (ECCV), 2022
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
138
7
0
31 Mar 2022
Acknowledging the Unknown for Multi-label Learning with Single Positive Labels
European Conference on Computer Vision (ECCV), 2022
Donghao Zhou
Pengfei Chen
Qiong Wang
Guangyong Chen
Pheng-Ann Heng
145
43
0
30 Mar 2022
Learning Program Representations for Food Images and Cooking Recipes
Computer Vision and Pattern Recognition (CVPR), 2022
Dim P. Papadopoulos
Enrique Mora
Nadiia Chepurko
Kuan-Wei Huang
Ferda Ofli
Antonio Torralba
149
42
0
30 Mar 2022
Image Retrieval from Contextual Descriptions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Benno Krojer
Vaibhav Adlakha
Vibhav Vineet
Yash Goyal
Edoardo Ponti
Siva Reddy
256
38
0
29 Mar 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Computer Vision and Pattern Recognition (CVPR), 2022
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
262
114
0
28 Mar 2022
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Likun Cai
Zhi-Li Zhang
Yi Zhu
Li Zhang
Mu Li
Xiangyang Xue
VLM
ObjD
263
46
0
24 Mar 2022
A Real World Dataset for Multi-view 3D Reconstruction
European Conference on Computer Vision (ECCV), 2022
Rakesh Shrestha
Siqi Hu
Minghao Gou
Ziyuan Liu
P. Tan
3DH
3DV
221
15
0
22 Mar 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
Findings (Findings), 2022
Wei Li
Can Gao
Guocheng Niu
Xinyan Xiao
Hao Liu
Jiachen Liu
Hua Wu
Haifeng Wang
MLLM
147
24
0
17 Mar 2022
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
International Journal of Computer Vision (IJCV), 2022
Yuanhan Zhang
Qi Sun
Yichun Zhou
Zexin He
Zhen-fei Yin
Kunze Wang
Lu Sheng
Yu Qiao
Jing Shao
Ziwei Liu
ObjD
VLM
289
22
0
15 Mar 2022
SuperAnimal pretrained pose estimation models for behavioral analysis
Nature Communications (Nat Commun), 2022
Shaokai Ye
Anastasiia Filippova
Jessy Lauer
Steffen Schneider
Maxime Vidal
Tian Qiu
Alexander Mathis
Mackenzie W. Mathis
342
72
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLM
CLIP
218
158
0
14 Mar 2022
Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Thomas Verelst
Paul Kishan Rubenstein
M. Eichner
Tinne Tuytelaars
Maxim Berman
146
26
0
11 Mar 2022
Peng Cheng Object Detection Benchmark for Smart City
Yaowei Wang
Zhouxin Yang
R. Liu
Deng Li
Yuandu Lai
Leyuan Fang
Yahong Han
ObjD
3DPC
87
1
0
11 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
447
11
0
11 Mar 2022
Weakly Supervised Semantic Segmentation using Out-of-Distribution Data
Computer Vision and Pattern Recognition (CVPR), 2022
Jungbeom Lee
Seong Joon Oh
Sangdoo Yun
Junsuk Choe
Eunji Kim
Sung-Hoon Yoon
WSOL
OOD
1.1K
111
0
08 Mar 2022
Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and Semantic Attention
IEEE transactions on multimedia (IEEE TMM), 2022
Ziming Liu
Song Guo
Jingcai Guo
Yuanyuan Xu
Fushuo Huo
209
26
0
07 Mar 2022
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition
IEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Yong Luo
Zhenglong Sun
Wei-Shi Zheng
Yaowei Wang
Chen Chen
216
15
0
07 Mar 2022
Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yue Yao
Liang Zheng
Xiaodong Yang
Milind Napthade
Tom Gedeon
214
18
0
28 Feb 2022
Optical flow-based branch segmentation for complex orchard environments
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
A. You
C. Grimm
J. Davidson
102
14
0
26 Feb 2022
Speciesist bias in AI -- How AI applications perpetuate discrimination and unfair outcomes against animals
AI and Ethics (AE), 2022
Thilo Hagendorff
L. Bossert
Yip Fai Tse
P. Singer
FaML
212
56
0
22 Feb 2022
Privacy Preserving Visual Question Answering
Cristian-Paul Bara
Q. Ping
Abhinav Mathur
Govind Thattai
M. Rohith
Gaurav Sukhatme
190
1
0
15 Feb 2022
Fairness Indicators for Systematic Assessments of Visual Feature Extractors
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Priya Goyal
Adriana Romero Soriano
C. Hazirbas
Levent Sagun
Nicolas Usunier
EGVM
219
35
0
15 Feb 2022
Using Social Media Images for Building Function Classification
Cities (Cities), 2022
E. J. Hoffmann
Karam Abdulahhad
Xiao Xiang Zhu
144
37
0
15 Feb 2022
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?
Conference on Fairness, Accountability and Transparency (FAccT), 2022
P. Schramowski
Christopher Tauchmann
Kristian Kersting
FaML
362
147
0
14 Feb 2022
Object-Guided Day-Night Visual Localization in Urban Scenes
International Conference on Pattern Recognition (ICPR), 2022
Assia Benbihi
C´edric Pradalier
Ondřej Chum
176
4
0
09 Feb 2022
Recent Trends in 2D Object Detection and Applications in Video Event Recognition
Prithwish Jana
Partha Pratim Mohanta
175
1
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
International Conference on Machine Learning (ICML), 2022
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
521
1,009
0
07 Feb 2022
Keyword localisation in untranscribed speech using visually grounded speech models
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
196
7
0
02 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
ACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
480
154
0
31 Jan 2022
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning
ACM Multimedia (ACM MM), 2022
Zejun Li
Zhihao Fan
Huaixiao Tou
Jingjing Chen
Zhongyu Wei
Xuanjing Huang
239
23
0
29 Jan 2022
RelTR: Relation Transformer for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuren Cong
M. Yang
Bodo Rosenhahn
ViT
464
181
0
27 Jan 2022
CrossRectify: Leveraging Disagreement for Semi-supervised Object Detection
Pattern Recognition (Pattern Recogn.), 2022
Cheng Ma
Xingjia Pan
QiXiang Ye
Fan Tang
Weiming Dong
Changsheng Xu
214
16
0
26 Jan 2022
Visual Identification of Problematic Bias in Large Label Spaces
Alex Bauerle
Aybuke Turker
Ken Burke
Osman Aka
Timo Ropinski
Christina Greer
Mani Varadarajan
151
2
0
17 Jan 2022
CLIP-Event: Connecting Text and Images with Event Structures
Computer Vision and Pattern Recognition (CVPR), 2022
Pengfei Yu
Ruochen Xu
Shuohang Wang
Luowei Zhou
Xudong Lin
Chenguang Zhu
Michael Zeng
Heng Ji
Shih-Fu Chang
VLM
CLIP
170
145
0
13 Jan 2022
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining
IEEE International Conference on Computer Vision (ICCV), 2022
Saksham Suri
Sai Saketh Rambhatla
Rama Chellappa
Abhinav Shrivastava
ObjD
331
15
0
12 Jan 2022
Detecting Twenty-thousand Classes using Image-level Supervision
European Conference on Computer Vision (ECCV), 2022
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIP
VLM
489
755
0
07 Jan 2022
Equalized Focal Loss for Dense Long-Tailed Object Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Yue Liu
Yongqiang Yao
Jingru Tan
Qiang Chen
F. Yu
Jianwei Lu
Ye Luo
246
119
0
07 Jan 2022
Scene Graph Generation: A Comprehensive Survey
Neurocomputing (Neurocomputing), 2022
Guangming Zhu
Liang Zhang
Youliang Jiang
Yixuan Dang
Haoran Hou
...
Mingtao Feng
Xia Zhao
Qiguang Miao
Syed Afaq Ali Shah
Bennamoun
3DV
444
131
0
03 Jan 2022
LaTr: Layout-Aware Transformer for Scene-Text VQA
Computer Vision and Pattern Recognition (CVPR), 2021
Ali Furkan Biten
Ron Litman
Yusheng Xie
Srikar Appalaraju
R. Manmatha
ViT
378
116
0
23 Dec 2021
Few-Shot Object Detection: A Comprehensive Survey
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Mona Köhler
M. Eisenbach
H. Groß
ObjD
243
98
0
22 Dec 2021
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
VOS
334
29
0
16 Dec 2021
Reliable Multi-Object Tracking in the Presence of Unreliable Detections
Travis Mandel
Mark Jimenez
Emily Risley
Taishi Nammoto
Rebekka Williams
Max Panoff
Meynard Ballesteros
Bobbie Suarez
VOT
171
2
0
15 Dec 2021
CPPE-5: Medical Personal Protective Equipment Dataset
Rishit Dagli
A. Shaikh
273
14
0
15 Dec 2021
Simple and Robust Loss Design for Multi-Label Learning with Missing Labels
Youcai Zhang
Y. Cheng
Xinyu Huang
Fei Wen
Rui Feng
Yaqian Li
Yandong Guo
VLM
160
47
0
13 Dec 2021
Holistic Interpretation of Public Scenes Using Computer Vision and Temporal Graphs to Identify Social Distancing Violations
Gihan Chanaka Jayatilaka
Jameel Hassan
Suren Sritharan
J. B. Senanayaka
H. Weligampola
Roshan Godaliyadda
Parakrama Ekanayake
Vijitha Herath
Janaka Ekanayake
S. Dharmaratne
419
6
0
13 Dec 2021
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViT
VLM
240
120
0
09 Dec 2021
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization
The Web Conference (WWW), 2021
Mesut Erhan Unal
Adriana Kovashka
Wen-Ting Chung
Yu-Ru Lin
200
5
0
05 Dec 2021
Previous
1
2
3
...
7
8
9
...
11
12
13
Next