ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale
v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown
Title
Grafit: Learning fine-grained image representations with coarse labels
Grafit: Learning fine-grained image representations with coarse labelsIEEE International Conference on Computer Vision (ICCV), 2020
Hugo Touvron
Alexandre Sablayrolles
Matthijs Douze
Matthieu Cord
Edouard Grave
SSL
143
73
0
25 Nov 2020
Insights From A Large-Scale Database of Material Depictions In Paintings
Insights From A Large-Scale Database of Material Depictions In Paintings
Hubert Lin
Mitchell J. P. van Zuijlen
M. Wijntjes
S. Pont
Kavita Bala
186
8
0
24 Nov 2020
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color
  Histograms
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color HistogramsComputer Vision and Pattern Recognition (CVPR), 2020
Mahmoud Afifi
Marcus A. Brubaker
M. S. Brown
GAN
240
126
0
23 Nov 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for
  Evaluating Visual Detection Tasks
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection TasksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Kemal Oksuz
Baris Can Cam
Sinan Kalkan
Emre Akbas
204
39
0
21 Nov 2020
Open-Vocabulary Object Detection Using Captions
Open-Vocabulary Object Detection Using CaptionsComputer Vision and Pattern Recognition (CVPR), 2020
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLMObjD
384
531
0
20 Nov 2020
Towards Abstract Relational Learning in Human Robot Interaction
Towards Abstract Relational Learning in Human Robot Interaction
Mohamadreza Faridghasemnia
Daniele Nardi
A. Saffiotti
96
2
0
20 Nov 2020
Efficient Conditional Pre-training for Transfer Learning
Efficient Conditional Pre-training for Transfer Learning
Shuvam Chakraborty
Burak Uzkent
Kumar Ayush
Kumar Tanmay
Evan Sheehan
Stefano Ermon
VLM
490
20
0
20 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain
  Human-like Biases
Image Representations Learned With Unsupervised Pre-Training Contain Human-like BiasesConference on Fairness, Accountability and Transparency (FAccT), 2020
Ryan Steed
Aylin Caliskan
SSL
290
172
0
28 Oct 2020
Unsupervised Vision-and-Language Pre-training Without Parallel Images
  and Captions
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions
Liunian Harold Li
Haoxuan You
Zhecan Wang
Alireza Zareian
Shih-Fu Chang
Kai-Wei Chang
SSLVLM
183
12
0
24 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy
  Label Correction via Visual-Semantic Graph
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph
Jingkang Yang
Weirong Chen
Xue Jiang
Xiaopeng Yan
Huabin Zheng
Wayne Zhang
NoLa
117
13
0
12 Oct 2020
A Unified Framework for Generic, Query-Focused, Privacy Preserving and
  Update Summarization using Submodular Information Measures
A Unified Framework for Generic, Query-Focused, Privacy Preserving and Update Summarization using Submodular Information Measures
Vishal Kaushal
Suraj Kothawade
Ganesh Ramakrishnan
J. Bilmes
Himanshu Asnani
Rishabh K. Iyer
88
6
0
12 Oct 2020
Homography Estimation with Convolutional Neural Networks Under
  Conditions of Variance
Homography Estimation with Convolutional Neural Networks Under Conditions of Variance
David Niblick
A. Kak
187
4
0
02 Oct 2020
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of
  Objects using only Nouns
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns
L. Ferreira
Douglas De Rizzo Meneghetti
P. Santos
71
2
0
02 Oct 2020
Asymmetric Loss For Multi-Label Classification
Asymmetric Loss For Multi-Label Classification
Emanuel Ben-Baruch
T. Ridnik
Nadav Zamir
Asaf Noy
Itamar Friedman
M. Protter
Lihi Zelnik-Manor
360
683
0
29 Sep 2020
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
Xiaowei Hu
Xi Yin
Kevin Qinghong Lin
Lijuan Wang
Guang Dai
Jianfeng Gao
Zicheng Liu
VLM
203
58
0
28 Sep 2020
MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object
  Detection
MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object DetectionEuropean Conference on Computer Vision (ECCV), 2020
Xin Lu
Quanquan Li
Buyu Li
Junjie Yan
ObjD
155
62
0
24 Sep 2020
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet
  without Tricks
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
Zhiqiang Shen
Marios Savvides
253
68
0
17 Sep 2020
Knowledge Guided Learning: Towards Open Domain Egocentric Action
  Recognition with Zero Supervision
Knowledge Guided Learning: Towards Open Domain Egocentric Action Recognition with Zero Supervision
Sathyanarayanan N. Aakur
Sanjoy Kundu
Nikhil Gunti
EgoV
127
1
0
16 Sep 2020
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation
  Modeling for Temporal Action Proposal Generation
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal GenerationAAAI Conference on Artificial Intelligence (AAAI), 2020
Haisheng Su
Weihao Gan
Wei Wu
Yu Qiao
Junjie Yan
208
136
0
15 Sep 2020
Adaptive Label Smoothing
Adaptive Label Smoothing
Ujwal Krothapalli
A. Lynn Abbott
187
11
0
14 Sep 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content
  Selection Models
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection ModelsInternational Conference on Computational Linguistics (COLING), 2020
Khyathi Chandu
Piyush Sharma
Soravit Changpinyo
Ashish V. Thapliyal
Radu Soricut
DiffMVLM
201
3
0
10 Sep 2020
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee
  of a Good Mask
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask
Jingru Tan
Qiang Chen
Hanming Deng
Changbao Wang
Lewei Lu
Quanquan Li
Jifeng Dai
144
19
0
03 Sep 2020
A Cost-Effective Person-Following System for Assistive Unmanned Vehicles
  with Deep Learning at the Edge
A Cost-Effective Person-Following System for Assistive Unmanned Vehicles with Deep Learning at the Edge
A. Boschi
Francesco Salvetti
Vittorio Mazzia
Marcello Chiaberge
180
14
0
31 Aug 2020
Soliciting Human-in-the-Loop User Feedback for Interactive Machine
  Learning Reduces User Trust and Impressions of Model Accuracy
Soliciting Human-in-the-Loop User Feedback for Interactive Machine Learning Reduces User Trust and Impressions of Model AccuracyAAAI Conference on Human Computation & Crowdsourcing (HCOMP), 2020
Donald R. Honeycutt
Mahsan Nourani
Eric D. Ragan
HAI
225
73
0
28 Aug 2020
Domain Adaptation Through Task Distillation
Domain Adaptation Through Task DistillationEuropean Conference on Computer Vision (ECCV), 2020
Brady Zhou
Nimit Kalra
Philipp Krahenbuhl
OOD
144
16
0
27 Aug 2020
DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment
  in COVID-19 Pandemic
DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment in COVID-19 PandemicmedRxiv (medRxiv), 2020
Mahdi Rezaei
Mohsen Azarmi
255
159
0
26 Aug 2020
Object Detection with a Unified Label Space from Multiple Datasets
Object Detection with a Unified Label Space from Multiple Datasets
Xiangyu Zhao
S. Schulter
Gaurav Sharma
Yi-Hsuan Tsai
Manmohan Chandraker
Ying Nian Wu
ObjD
164
80
0
15 Aug 2020
What leads to generalization of object proposals?
What leads to generalization of object proposals?
Rui Wang
D. Mahajan
Vignesh Ramanathan
ObjD
183
12
0
13 Aug 2020
Guided Collaborative Training for Pixel-wise Semi-Supervised Learning
Guided Collaborative Training for Pixel-wise Semi-Supervised LearningEuropean Conference on Computer Vision (ECCV), 2020
Zhanghan Ke
Di Qiu
Kaican Li
Qiong Yan
Rynson W. H. Lau
237
282
0
12 Aug 2020
BREEDS: Benchmarks for Subpopulation Shift
BREEDS: Benchmarks for Subpopulation ShiftInternational Conference on Learning Representations (ICLR), 2020
Shibani Santurkar
Dimitris Tsipras
Aleksander Madry
OOD
187
189
0
11 Aug 2020
Polysemy Deciphering Network for Robust Human-Object Interaction
  Detection
Polysemy Deciphering Network for Robust Human-Object Interaction DetectionInternational Journal of Computer Vision (IJCV), 2020
Xubin Zhong
Changxing Ding
X. Qu
Dacheng Tao
309
62
0
07 Aug 2020
Multiple instance learning on deep features for weakly supervised object
  detection with extreme domain shifts
Multiple instance learning on deep features for weakly supervised object detection with extreme domain shifts
Nicolas Gonthier
Saïd Ladjal
Y. Gousseau
WSOD
387
31
0
03 Aug 2020
Spatially Aware Multimodal Transformers for TextVQA
Spatially Aware Multimodal Transformers for TextVQAEuropean Conference on Computer Vision (ECCV), 2020
Yash Kant
Dhruv Batra
Peter Anderson
Alex Schwing
Devi Parikh
Jiasen Lu
Harsh Agrawal
179
93
0
23 Jul 2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling
  for Temporal Action Localization: Submission to ActivityNet Challenge 2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020
Haisheng Su
Jinyuan Feng
Hao Shao
Zhenyu Jiang
Manyuan Zhang
Wei Wu
Yu Liu
Jiaming Song
Junjie Yan
136
0
0
20 Jul 2020
Boosting Weakly Supervised Object Detection with Progressive Knowledge
  Transfer
Boosting Weakly Supervised Object Detection with Progressive Knowledge TransferEuropean Conference on Computer Vision (ECCV), 2020
Yuanyi Zhong
Jianfeng Wang
Jian-wei Peng
Lei Zhang
148
57
0
15 Jul 2020
COBE: Contextualized Object Embeddings from Narrated Instructional Video
COBE: Contextualized Object Embeddings from Narrated Instructional VideoNeural Information Processing Systems (NeurIPS), 2020
Gedas Bertasius
Lorenzo Torresani
185
27
0
14 Jul 2020
Deep learning for scene recognition from visual data: a survey
Deep learning for scene recognition from visual data: a survey
Alina Matei
A. Glavan
Estefanía Talavera
160
19
0
03 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image
  Classification
Measuring Robustness to Natural Distribution Shifts in Image Classification
Rohan Taori
Achal Dave
Vaishaal Shankar
Nicholas Carlini
Benjamin Recht
Ludwig Schmidt
OOD
466
623
0
01 Jul 2020
Recurrent Relational Memory Network for Unsupervised Image Captioning
Recurrent Relational Memory Network for Unsupervised Image CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2020
Dan Guo
Yang Wang
Peipei Song
Meng Wang
GAN
168
42
0
24 Jun 2020
Large image datasets: A pyrrhic win for computer vision?
Large image datasets: A pyrrhic win for computer vision?IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Vinay Uday Prabhu
Abeba Birhane
300
402
0
24 Jun 2020
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and
  Data Poisoning Attacks
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning AttacksInternational Conference on Machine Learning (ICML), 2020
Avi Schwarzschild
Micah Goldblum
Arjun Gupta
John P. Dickerson
Tom Goldstein
AAMLTDI
272
189
0
22 Jun 2020
UniT: Unified Knowledge Transfer for Any-shot Object Detection and
  Segmentation
UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation
Siddhesh Khandelwal
Raghav Goyal
Leonid Sigal
VLM
329
2
0
12 Jun 2020
Rethinking Pre-training and Self-training
Rethinking Pre-training and Self-trainingNeural Information Processing Systems (NeurIPS), 2020
Barret Zoph
Golnaz Ghiasi
Nayeon Lee
Huayu Chen
Hanxiao Liu
E. D. Cubuk
Quoc V. Le
SSeg
273
702
0
11 Jun 2020
Privacy-Aware Activity Classification from First Person Office Videos
Privacy-Aware Activity Classification from First Person Office Videos
Partho Ghosh
Md. Abrar Istiak
Nayeeb Rashid
Ahsan Habib Akash
Ridwan Abrar
Ankan Ghosh Dastider
Asif Sushmit
Taufiq Hasan
PICV
134
2
0
11 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language
  Learning
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Alessandro Suglia
Ioannis Konstas
Andrea Vanzo
E. Bastianelli
Desmond Elliott
Stella Frank
Oliver Lemon
131
17
0
03 Jun 2020
Multimodal grid features and cell pointers for Scene Text Visual
  Question Answering
Multimodal grid features and cell pointers for Scene Text Visual Question AnsweringPattern Recognition Letters (Pattern Recognit. Lett.), 2020
Lluís Gómez
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Marçal Rusiñol
Ernest Valveny
Dimosthenis Karatzas
189
22
0
01 Jun 2020
Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
Junran Peng
Xingyuan Bu
Ming Sun
Zhaoxiang Zhang
Tieniu Tan
Junjie Yan
VLMObjD
170
66
0
18 May 2020
Cross-media Structured Common Space for Multimedia Event Extraction
Cross-media Structured Common Space for Multimedia Event ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Pengfei Yu
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
151
116
0
05 May 2020
Monitoring COVID-19 social distancing with person detection and tracking
  via fine-tuned YOLO v3 and Deepsort techniques
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
Gaurav Rai
251
250
0
04 May 2020
Clue: Cross-modal Coherence Modeling for Caption Generation
Clue: Cross-modal Coherence Modeling for Caption GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Malihe Alikhani
Piyush Sharma
Shengjie Li
Radu Soricut
Matthew Stone
181
59
0
02 May 2020
Previous
123...10111213
Next