v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown

How stable are Transferability Metrics evaluations?European Conference on Computer Vision (ECCV), 2022

377

04 Apr 2022

Data Cards: Purposeful and Transparent Dataset Documentation for Responsible AIConference on Fairness, Accountability and Transparency (FAccT), 2022

241

266

03 Apr 2022

Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageInternational Conference on Learning Representations (ICLR), 2022

...

594

681

01 Apr 2022

GALA: Toward Geometry-and-Lighting-Aware Object Search for CompositingEuropean Conference on Computer Vision (ECCV), 2022

Zhifei Zhang

138

31 Mar 2022

Acknowledging the Unknown for Multi-label Learning with Single Positive LabelsEuropean Conference on Computer Vision (ECCV), 2022

Pheng-Ann Heng

145

30 Mar 2022

Learning Program Representations for Food Images and Cooking RecipesComputer Vision and Pattern Recognition (CVPR), 2022

Antonio Torralba

149

30 Mar 2022

Image Retrieval from Contextual DescriptionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Siva Reddy

256

29 Mar 2022

Towards End-to-End Unified Scene Text Detection and Layout AnalysisComputer Vision and Pattern Recognition (CVPR), 2022

Yasuhisa Fujii

262

114

28 Mar 2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Li Zhang

263

24 Mar 2022

A Real World Dataset for Multi-view 3D ReconstructionEuropean Conference on Computer Vision (ECCV), 2022

221

22 Mar 2022

UNIMO-2: End-to-End Unified Vision-Language Grounded LearningFindings (Findings), 2022

147

17 Mar 2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine SynergyInternational Journal of Computer Vision (IJCV), 2022

Yu Qiao

Ziwei Liu

289

15 Mar 2022

SuperAnimal pretrained pose estimation models for behavioral analysisNature Communications (Nat Commun), 2022

342

14 Mar 2022

CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

218

158

14 Mar 2022

Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Thomas Verelst

Paul Kishan Rubenstein

M. Eichner

Tinne Tuytelaars

Maxim Berman

146

11 Mar 2022

Peng Cheng Object Detection Benchmark for Smart City

Yaowei Wang

11 Mar 2022

Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding

447

11 Mar 2022

Weakly Supervised Semantic Segmentation using Out-of-Distribution DataComputer Vision and Pattern Recognition (CVPR), 2022

1.1K

111

08 Mar 2022

Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and Semantic AttentionIEEE transactions on multimedia (IEEE TMM), 2022

209

07 Mar 2022

Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept RecognitionIEEE transactions on multimedia (IEEE TMM), 2022

Yaowei Wang

216

07 Mar 2022

Attribute Descent: Simulating Object-Centric Datasets on the Content Level and BeyondIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yue Yao

Liang Zheng

Xiaodong Yang

Milind Napthade

Tom Gedeon

214

28 Feb 2022

Optical flow-based branch segmentation for complex orchard environmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022

A. You

C. Grimm

J. Davidson

102

26 Feb 2022

Speciesist bias in AI -- How AI applications perpetuate discrimination and unfair outcomes against animalsAI and Ethics (AE), 2022

212

22 Feb 2022

Privacy Preserving Visual Question Answering

Govind Thattai

190

15 Feb 2022

Fairness Indicators for Systematic Assessments of Visual Feature ExtractorsConference on Fairness, Accountability and Transparency (FAccT), 2022

Priya Goyal

Adriana Romero Soriano

219

15 Feb 2022

Using Social Media Images for Building Function ClassificationCities (Cities), 2022

E. J. Hoffmann

Karam Abdulahhad

Xiao Xiang Zhu

144

15 Feb 2022

Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Conference on Fairness, Accountability and Transparency (FAccT), 2022

P. Schramowski

Christopher Tauchmann

Kristian Kersting

FaML

362

147

14 Feb 2022

Object-Guided Day-Night Visual Localization in Urban ScenesInternational Conference on Pattern Recognition (ICPR), 2022

Assia Benbihi

C´edric Pradalier

Ondřej Chum

176

09 Feb 2022

Recent Trends in 2D Object Detection and Applications in Video Event Recognition

Prithwish Jana

Partha Pratim Mohanta

175

07 Feb 2022

OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkInternational Conference on Machine Learning (ICML), 2022

Peng Wang

An Yang

Rui Men

Junyang Lin

Shuai Bai

Zhikang Li

Jianxin Ma

Chang Zhou

Jingren Zhou

Hongxia Yang

MLLM ObjD

521

1,009

07 Feb 2022

Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Kayode Olaleye

Dan Oneaţă

Herman Kamper

196

02 Feb 2022

Deep Learning Approaches on Image Captioning: A ReviewACM Computing Surveys (ACM CSUR), 2022

480

154

31 Jan 2022

MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage LearningACM Multimedia (ACM MM), 2022

Xuanjing Huang

239

29 Jan 2022

RelTR: Relation Transformer for Scene Graph GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

464

181

27 Jan 2022

CrossRectify: Leveraging Disagreement for Semi-supervised Object DetectionPattern Recognition (Pattern Recogn.), 2022

214

26 Jan 2022

Visual Identification of Problematic Bias in Large Label Spaces

Timo Ropinski

151

17 Jan 2022

CLIP-Event: Connecting Text and Images with Event StructuresComputer Vision and Pattern Recognition (CVPR), 2022

Heng Ji

170

145

13 Jan 2022

SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive MiningIEEE International Conference on Computer Vision (ICCV), 2022

331

12 Jan 2022

Detecting Twenty-thousand Classes using Image-level SupervisionEuropean Conference on Computer Vision (ECCV), 2022

489

755

07 Jan 2022

Equalized Focal Loss for Dense Long-Tailed Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022

246

119

07 Jan 2022

Scene Graph Generation: A Comprehensive SurveyNeurocomputing (Neurocomputing), 2022

...

444

131

03 Jan 2022

LaTr: Layout-Aware Transformer for Scene-Text VQAComputer Vision and Pattern Recognition (CVPR), 2021

378

116

23 Dec 2021

Few-Shot Object Detection: A Comprehensive SurveyIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

243

22 Dec 2021

HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

334

16 Dec 2021

Reliable Multi-Object Tracking in the Presence of Unreliable Detections

171

15 Dec 2021

CPPE-5: Medical Personal Protective Equipment Dataset

Rishit Dagli

A. Shaikh

273

15 Dec 2021

Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

Xinyu Huang

Rui Feng

160

13 Dec 2021

Holistic Interpretation of Public Scenes Using Computer Vision and Temporal Graphs to Identify Social Distancing Violations

Gihan Chanaka Jayatilaka

419

13 Dec 2021

Injecting Semantic Concepts into End-to-End Image Captioning

Xiaowei Hu

Yezhou Yang

Zicheng Liu

ViT VLM

240

120

09 Dec 2021

Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal CharacterizationThe Web Conference (WWW), 2021

200

05 Dec 2021