ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale
v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown
Towards Open-vocabulary Scene Graph Generation with Prompt-based
  Finetuning
Towards Open-vocabulary Scene Graph Generation with Prompt-based FinetuningEuropean Conference on Computer Vision (ECCV), 2022
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
VLM
274
68
0
17 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly
  Supervised Scene Graph Generation
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph GenerationACM Multimedia (ACM MM), 2022
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
196
30
0
03 Aug 2022
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
TAG: Boosting Text-VQA via Text-aware Visual Question-answer GenerationBritish Machine Vision Conference (BMVC), 2022
Jun Wang
M. Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
Joseph Jaja
Larry S. Davis
ViT
221
22
0
03 Aug 2022
Visual Recognition by Request
Visual Recognition by RequestComputer Vision and Pattern Recognition (CVPR), 2022
Chufeng Tang
Lingxi Xie
Xiaopeng Zhang
Xiaolin Hu
Qi Tian
VLM
232
16
0
28 Jul 2022
Careful What You Wish For: on the Extraction of Adversarially Trained
  Models
Careful What You Wish For: on the Extraction of Adversarially Trained ModelsConference on Privacy, Security and Trust (PST), 2022
Kacem Khaled
Gabriela Nicolescu
F. Magalhães
MIACVAAML
172
6
0
21 Jul 2022
Don't Forget Me: Accurate Background Recovery for Text Removal via
  Modeling Local-Global Context
Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global ContextEuropean Conference on Computer Vision (ECCV), 2022
Chongyu Liu
Lianwen Jin
Yuliang Liu
Canjie Luo
Bangdong Chen
Fengjun Guo
Kai Ding
156
24
0
21 Jul 2022
On Label Granularity and Object Localization
On Label Granularity and Object LocalizationEuropean Conference on Computer Vision (ECCV), 2022
Elijah Cole
Kimberly Wilber
Grant Van Horn
Xuan S. Yang
Marco Fornoni
Pietro Perona
Serge Belongie
Andrew G. Howard
Oisin Mac Aodha
WSOL
282
15
0
20 Jul 2022
Visual Knowledge Tracing
Visual Knowledge TracingEuropean Conference on Computer Vision (ECCV), 2022
Neehar Kondapaneni
Pietro Perona
Oisin Mac Aodha
169
1
0
20 Jul 2022
DataPerf: Benchmarks for Data-Centric AI Development
DataPerf: Benchmarks for Data-Centric AI DevelopmentNeural Information Processing Systems (NeurIPS), 2022
Mark Mazumder
Colby R. Banbury
Xiaozhe Yao
Bojan Karlavs
W. G. Rojas
...
Carole-Jean Wu
Cody Coleman
Andrew Y. Ng
Peter Mattson
Vijay Janapa Reddi
VLM
279
129
0
20 Jul 2022
Robust Object Detection With Inaccurate Bounding Boxes
Robust Object Detection With Inaccurate Bounding BoxesEuropean Conference on Computer Vision (ECCV), 2022
Chengxin Liu
Kewei Wang
Hao Lu
Zhiguo Cao
Ziming Zhang
213
35
0
20 Jul 2022
Cycle Self-Training for Semi-Supervised Object Detection with
  Distribution Consistency Reweighting
Cycle Self-Training for Semi-Supervised Object Detection with Distribution Consistency ReweightingACM Multimedia (ACM MM), 2022
Hao Liu
Bin Chen
Bo Wang
Chunpeng Wu
Feng Dai
Peng Wu
166
10
0
12 Jul 2022
IDEA: Increasing Text Diversity via Online Multi-Label Recognition for
  Vision-Language Pre-training
IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-trainingACM Multimedia (ACM MM), 2022
Xinyu Huang
Youcai Zhang
Ying Cheng
Weiwei Tian
Ruiwei Zhao
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Xiao-Yong Zhang
VLM
212
15
0
12 Jul 2022
Scaling Novel Object Detection with Weakly Supervised Detection
  Transformers
Scaling Novel Object Detection with Weakly Supervised Detection TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
T. LaBonte
Ya-heng Song
Xin Eric Wang
Vibhav Vineet
Neel Joshi
ViT
196
13
0
11 Jul 2022
Bridging the Gap between Object and Image-level Representations for
  Open-Vocabulary Detection
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary DetectionNeural Information Processing Systems (NeurIPS), 2022
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjDVLM
329
183
0
07 Jul 2022
FewSOL: A Dataset for Few-Shot Object Learning in Robotic Environments
FewSOL: A Dataset for Few-Shot Object Learning in Robotic EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2022
P. JishnuJaykumar
Yu-Wei Chao
Yu Xiang
145
13
0
06 Jul 2022
Image Amodal Completion: A Survey
Image Amodal Completion: A SurveyComputer Vision and Image Understanding (CVIU), 2022
Jiayang Ao
Qiuhong Ke
Krista A. Ehinger
437
25
0
05 Jul 2022
Vision-and-Language Pretraining
Vision-and-Language Pretraining
Thong Nguyen
Cong-Duy Nguyen
Xiaobao Wu
See-Kiong Ng
Anh Tuan Luu
VLMCLIP
276
2
0
05 Jul 2022
Parallel Pre-trained Transformers (PPT) for Synthetic Data-based
  Instance Segmentation
Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation
Ming Li
Jie Wu
Jin Cai
J. Qin
Yuxi Ren
Xu Xiao
Min Zheng
Rui Wang
X. Pan
ViT
172
2
0
22 Jun 2022
Deep Learning Models on CPUs: A Methodology for Efficient Training
Deep Learning Models on CPUs: A Methodology for Efficient Training
Quchen Fu
Ramesh Chukka
Keith Achorn
Thomas Atta-fosu
Deepak R. Canchi
Zhongwei Teng
Jules White
Douglas C. Schmidt
158
3
0
20 Jun 2022
DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited
  Annotations
DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited AnnotationsNeural Information Processing Systems (NeurIPS), 2022
Ximeng Sun
Ping Hu
Kate Saenko
VLM
252
161
0
20 Jun 2022
Gender Artifacts in Visual Datasets
Gender Artifacts in Visual DatasetsIEEE International Conference on Computer Vision (ICCV), 2022
Nicole Meister
Dora Zhao
Angelina Wang
V. V. Ramaswamy
Ruth C. Fong
Olga Russakovsky
288
36
0
18 Jun 2022
All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label
  Predictions (CHAMP)
All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP)
A. Vaswani
Gaurav Aggarwal
Praneeth Netrapalli
N. Hegde
251
4
0
17 Jun 2022
It's Time for Artistic Correspondence in Music and Video
It's Time for Artistic Correspondence in Music and VideoComputer Vision and Pattern Recognition (CVPR), 2022
Dídac Surís
Carl Vondrick
Bryan C. Russell
Justin Salamon
147
42
0
14 Jun 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
319
371
0
14 Jun 2022
Discovering Object Masks with Transformers for Unsupervised Semantic
  Segmentation
Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation
Wouter Van Gansbeke
Simon Vandenhende
Luc Van Gool
228
63
0
13 Jun 2022
A Semantic Consistency Feature Alignment Object Detection Model Based on
  Mixed-Class Distribution Metrics
A Semantic Consistency Feature Alignment Object Detection Model Based on Mixed-Class Distribution Metrics
Lijun Gou
Jinrong Yang
Hangchen Yu
Pan Wang
Xiaoping Li
Chao Deng
96
2
0
12 Jun 2022
Gradient Obfuscation Gives a False Sense of Security in Federated
  Learning
Gradient Obfuscation Gives a False Sense of Security in Federated LearningUSENIX Security Symposium (USENIX Security), 2022
Kai Yue
Richeng Jin
Chau-Wai Wong
D. Baron
H. Dai
FedML
253
66
0
08 Jun 2022
A Survey on Long-Tailed Visual Recognition
A Survey on Long-Tailed Visual RecognitionInternational Journal of Computer Vision (IJCV), 2022
Pu Cao
He Jiang
Q. Song
Jun Guo
301
161
0
27 May 2022
Penalizing Proposals using Classifiers for Semi-Supervised Object
  Detection
Penalizing Proposals using Classifiers for Semi-Supervised Object DetectionComputer Vision and Image Understanding (CVIU), 2022
S. Hazra
P. Dasgupta
198
0
0
26 May 2022
Perceptual Learned Source-Channel Coding for High-Fidelity Image
  Semantic Transmission
Perceptual Learned Source-Channel Coding for High-Fidelity Image Semantic TransmissionGlobal Communications Conference (GLOBECOM), 2022
Jun Wang
Sixian Wang
Jincheng Dai
Zhongwei Si
Dekun Zhou
K. Niu
171
40
0
26 May 2022
Charon: a FrameNet Annotation Tool for Multimodal Corpora
Charon: a FrameNet Annotation Tool for Multimodal CorporaLaw (LAW), 2022
Frederico Belcavello
Marcelo Viridiano
E. Matos
Haiyue Song
101
6
0
24 May 2022
Deep Image Retrieval is not Robust to Label Noise
Deep Image Retrieval is not Robust to Label Noise
Stanislav Dereka
I. Karpukhin
Sergey Kolesnikov
NoLaVLM
173
2
0
23 May 2022
The Case for Perspective in Multimodal Datasets
The Case for Perspective in Multimodal Datasets
Marcelo Viridiano
Haiyue Song
Oliver Czulo
Arthur Lorenzi
E. Matos
Frederico Belcavello
106
7
0
22 May 2022
Language Models with Image Descriptors are Strong Few-Shot
  Video-Language Learners
Language Models with Image Descriptors are Strong Few-Shot Video-Language LearnersNeural Information Processing Systems (NeurIPS), 2022
Zhenhailong Wang
Pengfei Yu
Ruochen Xu
Luowei Zhou
Jie Lei
...
Chenguang Zhu
Derek Hoiem
Shih-Fu Chang
Joey Tianyi Zhou
Heng Ji
MLLMVLM
533
162
0
22 May 2022
Deep transfer learning for image classification: a survey
Deep transfer learning for image classification: a survey
J. Plested
Musa Phiri
Tom Gedeon
OOD
210
47
0
20 May 2022
Simple Open-Vocabulary Object Detection with Vision Transformers
Simple Open-Vocabulary Object Detection with Vision Transformers
Matthias Minderer
A. Gritsenko
Austin Stone
Maxim Neumann
Dirk Weissenborn
...
Zhuoran Shen
Tianlin Li
Xiaohua Zhai
Thomas Kipf
N. Houlsby
ObjDCLIPVLMViTOCL
319
367
0
12 May 2022
Deep Learning and Computer Vision Techniques for Microcirculation
  Analysis: A Review
Deep Learning and Computer Vision Techniques for Microcirculation Analysis: A ReviewPatterns (Patterns), 2022
Maged Abdalla Helmy Abdou
T. Truong
E. Jul
Paulo Ferreira
238
9
0
11 May 2022
Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection
Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection
Wei Feng
Xingyuan Bu
Chenchen Zhang
Xubin Li
VLM
152
5
0
09 May 2022
HL-Net: Heterophily Learning Network for Scene Graph Generation
HL-Net: Heterophily Learning Network for Scene Graph GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Xin Lin
Changxing Ding
Yibing Zhan
Zijian Li
Dacheng Tao
198
44
0
03 May 2022
RU-Net: Regularized Unrolling Network for Scene Graph Generation
RU-Net: Regularized Unrolling Network for Scene Graph GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Xin Lin
Changxing Ding
Jing Zhang
Yibing Zhan
Dacheng Tao
207
38
0
03 May 2022
Reliable Label Correction is a Good Booster When Learning with Extremely
  Noisy Labels
Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels
Kaidi Wang
Xiang Peng
Shuo Yang
Jianfei Yang
Zheng Hua Zhu
Xinchao Wang
Yang You
NoLa
226
9
0
30 Apr 2022
Seeing without Looking: Analysis Pipeline for Child Sexual Abuse
  Datasets
Seeing without Looking: Analysis Pipeline for Child Sexual Abuse DatasetsConference on Fairness, Accountability and Transparency (FAccT), 2022
Camila Laranjeira
João Macedo
Sandra Avila
J. A. dos Santos
108
21
0
29 Apr 2022
Improving Multimodal Speech Recognition by Data Augmentation and Speech
  Representations
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
Dan Oneaţă
H. Cucu
118
24
0
27 Apr 2022
Training and challenging models for text-guided fashion image retrieval
Training and challenging models for text-guided fashion image retrieval
Eric Dodds
Jack Culpepper
Gaurav Srivastava
145
10
0
23 Apr 2022
Fast AdvProp
Fast AdvPropInternational Conference on Learning Representations (ICLR), 2022
Jieru Mei
Yucheng Han
Yutong Bai
Yixiao Zhang
Yingwei Li
Xianhang Li
Alan Yuille
Cihang Xie
AAML
180
9
0
21 Apr 2022
A Tour of Visualization Techniques for Computer Vision Datasets
A Tour of Visualization Techniques for Computer Vision Datasets
B. Alsallakh
P. Bhattacharya
V. Feng
Narine Kokhlikyan
Orion Reblitz-Richardson
Rahul Rajan
David Yan
128
4
0
19 Apr 2022
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression
  Comprehension
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Sanjay Subramanian
William Merrill
Trevor Darrell
Matt Gardner
Sameer Singh
Anna Rohrbach
ObjD
284
156
0
12 Apr 2022
Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D
  Reconstruction
Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D ReconstructionComputer Vision and Pattern Recognition (CVPR), 2022
Kalyan Vasudev Alwala
Abhinav Gupta
Shubham Tulsiani
172
34
0
07 Apr 2022
ECCV Caption: Correcting False Negatives by Collecting
  Machine-and-Human-verified Image-Caption Associations for MS-COCO
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCOEuropean Conference on Computer Vision (ECCV), 2022
Sanghyuk Chun
Wonjae Kim
Song Park
Minsuk Chang
Seong Joon Oh
VLM
1.4K
51
0
07 Apr 2022
"This is my unicorn, Fluffy": Personalizing frozen vision-language
  representations
"This is my unicorn, Fluffy": Personalizing frozen vision-language representationsEuropean Conference on Computer Vision (ECCV), 2022
Niv Cohen
Rinon Gal
E. Meirom
Gal Chechik
Yuval Atzmon
VLMMLLM
351
102
0
04 Apr 2022
Previous
123...678...111213
Next