ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale
v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown
How stable are Transferability Metrics evaluations?
How stable are Transferability Metrics evaluations?European Conference on Computer Vision (ECCV), 2022
A. Agostinelli
Michal Pándy
J. Uijlings
Thomas Mensink
V. Ferrari
377
32
0
04 Apr 2022
Data Cards: Purposeful and Transparent Dataset Documentation for
  Responsible AI
Data Cards: Purposeful and Transparent Dataset Documentation for Responsible AIConference on Fairness, Accountability and Transparency (FAccT), 2022
Mahima Pushkarna
Andrew Zaldivar
Oddur Kjartansson
AI4TS
241
266
0
03 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageInternational Conference on Learning Representations (ICLR), 2022
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLMLRM
594
681
0
01 Apr 2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
GALA: Toward Geometry-and-Lighting-Aware Object Search for CompositingEuropean Conference on Computer Vision (ECCV), 2022
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
138
7
0
31 Mar 2022
Acknowledging the Unknown for Multi-label Learning with Single Positive
  Labels
Acknowledging the Unknown for Multi-label Learning with Single Positive LabelsEuropean Conference on Computer Vision (ECCV), 2022
Donghao Zhou
Pengfei Chen
Qiong Wang
Guangyong Chen
Pheng-Ann Heng
145
43
0
30 Mar 2022
Learning Program Representations for Food Images and Cooking Recipes
Learning Program Representations for Food Images and Cooking RecipesComputer Vision and Pattern Recognition (CVPR), 2022
Dim P. Papadopoulos
Enrique Mora
Nadiia Chepurko
Kuan-Wei Huang
Ferda Ofli
Antonio Torralba
149
42
0
30 Mar 2022
Image Retrieval from Contextual Descriptions
Image Retrieval from Contextual DescriptionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Benno Krojer
Vaibhav Adlakha
Vibhav Vineet
Yash Goyal
Edoardo Ponti
Siva Reddy
256
38
0
29 Mar 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Towards End-to-End Unified Scene Text Detection and Layout AnalysisComputer Vision and Pattern Recognition (CVPR), 2022
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
262
114
0
28 Mar 2022
BigDetection: A Large-scale Benchmark for Improved Object Detector
  Pre-training
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Likun Cai
Zhi-Li Zhang
Yi Zhu
Li Zhang
Mu Li
Xiangyang Xue
VLMObjD
263
46
0
24 Mar 2022
A Real World Dataset for Multi-view 3D Reconstruction
A Real World Dataset for Multi-view 3D ReconstructionEuropean Conference on Computer Vision (ECCV), 2022
Rakesh Shrestha
Siqi Hu
Minghao Gou
Ziyuan Liu
P. Tan
3DH3DV
221
15
0
22 Mar 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
UNIMO-2: End-to-End Unified Vision-Language Grounded LearningFindings (Findings), 2022
Wei Li
Can Gao
Guocheng Niu
Xinyan Xiao
Hao Liu
Jiachen Liu
Hua Wu
Haifeng Wang
MLLM
147
24
0
17 Mar 2022
Bamboo: Building Mega-Scale Vision Dataset Continually with
  Human-Machine Synergy
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine SynergyInternational Journal of Computer Vision (IJCV), 2022
Yuanhan Zhang
Qi Sun
Yichun Zhou
Zexin He
Zhen-fei Yin
Kunze Wang
Lu Sheng
Yu Qiao
Jing Shao
Ziwei Liu
ObjDVLM
289
22
0
15 Mar 2022
SuperAnimal pretrained pose estimation models for behavioral analysis
SuperAnimal pretrained pose estimation models for behavioral analysisNature Communications (Nat Commun), 2022
Shaokai Ye
Anastasiia Filippova
Jessy Lauer
Steffen Schneider
Maxime Vidal
Tian Qiu
Alexander Mathis
Mackenzie W. Mathis
342
72
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual
  Entailment
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLMCLIP
218
158
0
14 Mar 2022
Spatial Consistency Loss for Training Multi-Label Classifiers from
  Single-Label Annotations
Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Thomas Verelst
Paul Kishan Rubenstein
M. Eichner
Tinne Tuytelaars
Maxim Berman
146
26
0
11 Mar 2022
Peng Cheng Object Detection Benchmark for Smart City
Peng Cheng Object Detection Benchmark for Smart City
Yaowei Wang
Zhouxin Yang
R. Liu
Deng Li
Yuandu Lai
Leyuan Fang
Yahong Han
ObjD3DPC
87
1
0
11 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story
  Understanding
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
447
11
0
11 Mar 2022
Weakly Supervised Semantic Segmentation using Out-of-Distribution Data
Weakly Supervised Semantic Segmentation using Out-of-Distribution DataComputer Vision and Pattern Recognition (CVPR), 2022
Jungbeom Lee
Seong Joon Oh
Sangdoo Yun
Junsuk Choe
Eunji Kim
Sung-Hoon Yoon
WSOLOOD
1.1K
111
0
08 Mar 2022
Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and
  Semantic Attention
Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and Semantic AttentionIEEE transactions on multimedia (IEEE TMM), 2022
Ziming Liu
Song Guo
Jingcai Guo
Yuanyuan Xu
Fushuo Huo
209
26
0
07 Mar 2022
Unpaired Image Captioning by Image-level Weakly-Supervised Visual
  Concept Recognition
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept RecognitionIEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Yong Luo
Zhenglong Sun
Wei-Shi Zheng
Yaowei Wang
Chen Chen
216
15
0
07 Mar 2022
Attribute Descent: Simulating Object-Centric Datasets on the Content
  Level and Beyond
Attribute Descent: Simulating Object-Centric Datasets on the Content Level and BeyondIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yue Yao
Liang Zheng
Xiaodong Yang
Milind Napthade
Tom Gedeon
214
18
0
28 Feb 2022
Optical flow-based branch segmentation for complex orchard environments
Optical flow-based branch segmentation for complex orchard environmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
A. You
C. Grimm
J. Davidson
102
14
0
26 Feb 2022
Speciesist bias in AI -- How AI applications perpetuate discrimination
  and unfair outcomes against animals
Speciesist bias in AI -- How AI applications perpetuate discrimination and unfair outcomes against animalsAI and Ethics (AE), 2022
Thilo Hagendorff
L. Bossert
Yip Fai Tse
P. Singer
FaML
212
56
0
22 Feb 2022
Privacy Preserving Visual Question Answering
Privacy Preserving Visual Question Answering
Cristian-Paul Bara
Q. Ping
Abhinav Mathur
Govind Thattai
M. Rohith
Gaurav Sukhatme
190
1
0
15 Feb 2022
Fairness Indicators for Systematic Assessments of Visual Feature
  Extractors
Fairness Indicators for Systematic Assessments of Visual Feature ExtractorsConference on Fairness, Accountability and Transparency (FAccT), 2022
Priya Goyal
Adriana Romero Soriano
C. Hazirbas
Levent Sagun
Nicolas Usunier
EGVM
219
35
0
15 Feb 2022
Using Social Media Images for Building Function Classification
Using Social Media Images for Building Function ClassificationCities (Cities), 2022
E. J. Hoffmann
Karam Abdulahhad
Xiao Xiang Zhu
144
37
0
15 Feb 2022
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn
  Reflecting on Inappropriate Content?
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Conference on Fairness, Accountability and Transparency (FAccT), 2022
P. Schramowski
Christopher Tauchmann
Kristian Kersting
FaML
362
147
0
14 Feb 2022
Object-Guided Day-Night Visual Localization in Urban Scenes
Object-Guided Day-Night Visual Localization in Urban ScenesInternational Conference on Pattern Recognition (ICPR), 2022
Assia Benbihi
C´edric Pradalier
Ondřej Chum
176
4
0
09 Feb 2022
Recent Trends in 2D Object Detection and Applications in Video Event
  Recognition
Recent Trends in 2D Object Detection and Applications in Video Event Recognition
Prithwish Jana
Partha Pratim Mohanta
175
1
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkInternational Conference on Machine Learning (ICML), 2022
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLMObjD
521
1,009
0
07 Feb 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech modelsIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
196
7
0
02 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
Deep Learning Approaches on Image Captioning: A ReviewACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
480
154
0
31 Jan 2022
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training
  via Multi-Stage Learning
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage LearningACM Multimedia (ACM MM), 2022
Zejun Li
Zhihao Fan
Huaixiao Tou
Jingjing Chen
Zhongyu Wei
Xuanjing Huang
239
23
0
29 Jan 2022
RelTR: Relation Transformer for Scene Graph Generation
RelTR: Relation Transformer for Scene Graph GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuren Cong
M. Yang
Bodo Rosenhahn
ViT
464
181
0
27 Jan 2022
CrossRectify: Leveraging Disagreement for Semi-supervised Object
  Detection
CrossRectify: Leveraging Disagreement for Semi-supervised Object DetectionPattern Recognition (Pattern Recogn.), 2022
Cheng Ma
Xingjia Pan
QiXiang Ye
Fan Tang
Weiming Dong
Changsheng Xu
214
16
0
26 Jan 2022
Visual Identification of Problematic Bias in Large Label Spaces
Visual Identification of Problematic Bias in Large Label Spaces
Alex Bauerle
Aybuke Turker
Ken Burke
Osman Aka
Timo Ropinski
Christina Greer
Mani Varadarajan
151
2
0
17 Jan 2022
CLIP-Event: Connecting Text and Images with Event Structures
CLIP-Event: Connecting Text and Images with Event StructuresComputer Vision and Pattern Recognition (CVPR), 2022
Pengfei Yu
Ruochen Xu
Shuohang Wang
Luowei Zhou
Xudong Lin
Chenguang Zhu
Michael Zeng
Heng Ji
Shih-Fu Chang
VLMCLIP
170
145
0
13 Jan 2022
SparseDet: Improving Sparsely Annotated Object Detection with
  Pseudo-positive Mining
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive MiningIEEE International Conference on Computer Vision (ICCV), 2022
Saksham Suri
Sai Saketh Rambhatla
Rama Chellappa
Abhinav Shrivastava
ObjD
331
15
0
12 Jan 2022
Detecting Twenty-thousand Classes using Image-level Supervision
Detecting Twenty-thousand Classes using Image-level SupervisionEuropean Conference on Computer Vision (ECCV), 2022
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIPVLM
489
755
0
07 Jan 2022
Equalized Focal Loss for Dense Long-Tailed Object Detection
Equalized Focal Loss for Dense Long-Tailed Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Yue Liu
Yongqiang Yao
Jingru Tan
Qiang Chen
F. Yu
Jianwei Lu
Ye Luo
246
119
0
07 Jan 2022
Scene Graph Generation: A Comprehensive Survey
Scene Graph Generation: A Comprehensive SurveyNeurocomputing (Neurocomputing), 2022
Guangming Zhu
Liang Zhang
Youliang Jiang
Yixuan Dang
Haoran Hou
...
Mingtao Feng
Xia Zhao
Qiguang Miao
Syed Afaq Ali Shah
Bennamoun
3DV
444
131
0
03 Jan 2022
LaTr: Layout-Aware Transformer for Scene-Text VQA
LaTr: Layout-Aware Transformer for Scene-Text VQAComputer Vision and Pattern Recognition (CVPR), 2021
Ali Furkan Biten
Ron Litman
Yusheng Xie
Srikar Appalaraju
R. Manmatha
ViT
378
116
0
23 Dec 2021
Few-Shot Object Detection: A Comprehensive Survey
Few-Shot Object Detection: A Comprehensive SurveyIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Mona Köhler
M. Eisenbach
H. Groß
ObjD
243
98
0
22 Dec 2021
HODOR: High-level Object Descriptors for Object Re-segmentation in Video
  Learned from Static Images
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images
A. Athar
Jonathon Luiten
Alexander Hermans
Deva Ramanan
Bastian Leibe
VOS
334
29
0
16 Dec 2021
Reliable Multi-Object Tracking in the Presence of Unreliable Detections
Reliable Multi-Object Tracking in the Presence of Unreliable Detections
Travis Mandel
Mark Jimenez
Emily Risley
Taishi Nammoto
Rebekka Williams
Max Panoff
Meynard Ballesteros
Bobbie Suarez
VOT
171
2
0
15 Dec 2021
CPPE-5: Medical Personal Protective Equipment Dataset
CPPE-5: Medical Personal Protective Equipment Dataset
Rishit Dagli
A. Shaikh
273
14
0
15 Dec 2021
Simple and Robust Loss Design for Multi-Label Learning with Missing
  Labels
Simple and Robust Loss Design for Multi-Label Learning with Missing Labels
Youcai Zhang
Y. Cheng
Xinyu Huang
Fei Wen
Rui Feng
Yaqian Li
Yandong Guo
VLM
160
47
0
13 Dec 2021
Holistic Interpretation of Public Scenes Using Computer Vision and
  Temporal Graphs to Identify Social Distancing Violations
Holistic Interpretation of Public Scenes Using Computer Vision and Temporal Graphs to Identify Social Distancing Violations
Gihan Chanaka Jayatilaka
Jameel Hassan
Suren Sritharan
J. B. Senanayaka
H. Weligampola
Roshan Godaliyadda
Parakrama Ekanayake
Vijitha Herath
Janaka Ekanayake
S. Dharmaratne
419
6
0
13 Dec 2021
Injecting Semantic Concepts into End-to-End Image Captioning
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViTVLM
240
120
0
09 Dec 2021
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal
  Characterization
Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal CharacterizationThe Web Conference (WWW), 2021
Mesut Erhan Unal
Adriana Kovashka
Wen-Ting Chung
Yu-Ru Lin
200
5
0
05 Dec 2021
Previous
123...789...111213
Next