ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale
v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition
  with Limited Annotations
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited AnnotationsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Ping Hu
Ximeng Sun
Stan Sclaroff
Kate Saenko
VLM
335
33
0
03 Aug 2023
ZRIGF: An Innovative Multimodal Framework for Zero-Resource
  Image-Grounded Dialogue Generation
ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue GenerationACM Multimedia (ACM MM), 2023
Bo Zhang
Jian Wang
Hui Ma
Bo Xu
Hongfei Lin
194
5
0
01 Aug 2023
Towards Imbalanced Large Scale Multi-label Classification with Partially
  Annotated Labels
Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated LabelsInternational Conference on Software Engineering Research and Applications (ICSERA), 2023
Xin Zhang
Yuqi Song
Fei Zuo
Xiang Wang
281
2
0
31 Jul 2023
CLIP Brings Better Features to Visual Aesthetics Learners
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
215
5
0
28 Jul 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the MachineScientific Reports (Sci Rep), 2023
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
199
5
0
26 Jul 2023
Towards Establishing Systematic Classification Requirements for
  Automated Driving
Towards Establishing Systematic Classification Requirements for Automated Driving
Kent Mori
Trent Brown
Steven C. Peters
224
0
0
26 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation
  without Test-time Fine-tuning
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuningInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2023
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
298
197
0
21 Jul 2023
Interactive Segmentation for Diverse Gesture Types Without Context
Interactive Segmentation for Diverse Gesture Types Without ContextIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Josh Myers-Dean
Yifei Fan
Brian L. Price
Wilson Chan
Danna Gurari
306
5
0
20 Jul 2023
In Defense of Clip-based Video Relation Detection
In Defense of Clip-based Video Relation DetectionIEEE Transactions on Image Processing (IEEE TIP), 2023
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
182
7
0
18 Jul 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
Pair then Relation: Pair-Net for Panoptic Scene Graph GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
221
26
0
17 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data PerspectiveIEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
352
61
0
17 Jul 2023
DynamicFL: Balancing Communication Dynamics and Client Manipulation for
  Federated Learning
DynamicFL: Balancing Communication Dynamics and Client Manipulation for Federated LearningAnnual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON), 2023
Bocheng Chen
Nikolay Ivanov
Guangjing Wang
Qiben Yan
207
7
0
16 Jul 2023
EmoSet: A Large-scale Visual Emotion Dataset with Rich Attributes
EmoSet: A Large-scale Visual Emotion Dataset with Rich AttributesIEEE International Conference on Computer Vision (ICCV), 2023
Jingyuan Yang
Qiruin Huang
Tingting Ding
Dani Lischinski
Daniel Cohen-Or
Hui Huang
213
90
0
16 Jul 2023
Unbiased Scene Graph Generation via Two-stage Causal Modeling
Unbiased Scene Graph Generation via Two-stage Causal ModelingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Shuzhou Sun
Shuaifeng Zhi
Qing Liao
J. Heikkilä
Tianpeng Liu
CML
264
51
0
11 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
179
8
0
08 Jul 2023
Pollen: High-throughput Federated Learning Simulation via Resource-Aware
  Client Placement
Pollen: High-throughput Federated Learning Simulation via Resource-Aware Client Placement
Lorenzo Sani
Pedro Gusmão
Alexandru Iacob
Wanru Zhao
Xinchi Qiu
Yan Gao
Javier Fernandez-Marques
Nicholas D. Lane
242
0
0
30 Jun 2023
Transferability Metrics for Object Detection
Transferability Metrics for Object Detection
Louis Fouquet
Simona Maggio
L. Dreyfus-Schmidt
153
1
0
27 Jun 2023
ParameterNet: Parameters Are All You Need
ParameterNet: Parameters Are All You NeedComputer Vision and Pattern Recognition (CVPR), 2023
Kai Han
Yunhe Wang
Jianyuan Guo
Enhua Wu
VLMAI4CE
158
76
0
26 Jun 2023
DISCO-10M: A Large-Scale Music Dataset
DISCO-10M: A Large-Scale Music DatasetNeural Information Processing Systems (NeurIPS), 2023
Luca A. Lanzendörfer
Florian Grötschla
Emil Funke
Roger Wattenhofer
125
24
0
23 Jun 2023
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene
  Graph Generation
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Qianji Di
Wenxing Ma
Chen Ma
Tianxiang Hou
Ying Shan
Hanzi Wang
145
1
0
23 Jun 2023
Label-noise-tolerant medical image classification via self-attention and
  self-supervised learning
Label-noise-tolerant medical image classification via self-attention and self-supervised learning
Hongyang Jiang
Mengdi Gao
Yan Hu
Qi Ren
Zhaoheng Xie
Jiang-Dong Liu
NoLa
140
5
0
16 Jun 2023
Scaling Open-Vocabulary Object Detection
Scaling Open-Vocabulary Object DetectionNeural Information Processing Systems (NeurIPS), 2023
Matthias Minderer
A. Gritsenko
N. Houlsby
VLMObjD
424
315
0
16 Jun 2023
ScaleDet: A Scalable Multi-Dataset Object Detector
ScaleDet: A Scalable Multi-Dataset Object DetectorComputer Vision and Pattern Recognition (CVPR), 2023
Yanbei Chen
Manchen Wang
Abhay Mittal
Zhenlin Xu
Paolo Favaro
Joseph Tighe
Davide Modolo
ObjD
177
27
0
08 Jun 2023
Coarse Is Better? A New Pipeline Towards Self-Supervised Learning with
  Uncurated Images
Coarse Is Better? A New Pipeline Towards Self-Supervised Learning with Uncurated ImagesPattern Recognition (Pattern Recogn.), 2023
Ke Zhu
Yin He
Jianxin Wu
255
7
0
07 Jun 2023
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real
  Objects
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real ObjectsComputer Vision and Pattern Recognition (CVPR), 2023
Ruohan Gao
Yiming Dou
Hao Li
Tanmay Agarwal
Jeannette Bohg
Yunzhu Li
Li Fei-Fei
Jiajun Wu
155
51
0
01 Jun 2023
Joint Adaptive Representations for Image-Language Learning
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
278
0
0
31 May 2023
What Can We Learn from Unlearnable Datasets?
What Can We Learn from Unlearnable Datasets?Neural Information Processing Systems (NeurIPS), 2023
Pedro Sandoval-Segura
Vasu Singla
Jonas Geiping
Micah Goldblum
Tom Goldstein
279
20
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language ModelsInternational Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjDVLMMLLM
328
141
0
29 May 2023
Learning high-level visual representations from a child's perspective
  without strong inductive biases
Learning high-level visual representations from a child's perspective without strong inductive biases
A. Orhan
Brenden M. Lake
SSL
263
34
0
24 May 2023
NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using
  Knowledge Graphs in Visual Relationship Detection
NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using Knowledge Graphs in Visual Relationship Detection
D. Herron
Ernesto Jiménez-Ruiz
G. Tarroni
Tillman Weyde
188
2
0
22 May 2023
Relabeling Minimal Training Subset to Flip a Prediction
Relabeling Minimal Training Subset to Flip a PredictionFindings (Findings), 2023
Jinghan Yang
Linjie Xu
Lequan Yu
305
3
0
22 May 2023
Annotation-free Audio-Visual Segmentation
Annotation-free Audio-Visual SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jinxian Liu
Yu Wang
Chen Ju
Chaofan Ma
Ya Zhang
Weidi Xie
VOSVLM
395
47
0
18 May 2023
Rethinking Multimodal Content Moderation from an Asymmetric Angle with
  Mixed-modality
Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modalityIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jialing Yuan
Ye Yu
Gaurav Mittal
Matthew Hall
Sandra Sajeev
Mei Chen
215
14
0
17 May 2023
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions
  in One Go
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go
Yecong Wan
Mingzhen Shao
Yuanshuo Cheng
YueQin Liu
Zhipeng Bao
208
9
0
17 May 2023
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
ICDAR 2023 Competition on Hierarchical Text Detection and RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
VLM
189
24
0
16 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with
  Elasticsearch
ElasticHash: Semantic Image Similarity Search by Deep Hashing with ElasticsearchInternational Conference on Computer Analysis of Images and Patterns (CAIP), 2023
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
149
4
0
08 May 2023
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual
  Question Answering in Vietnamese
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in VietnameseInformation Fusion (Inf. Fusion), 2023
Nghia Hieu Nguyen
Duong T.D. Vo
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
196
27
0
07 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label
  Learning
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label LearningNeural Information Processing Systems (NeurIPS), 2023
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
276
32
0
04 May 2023
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from
  Linguistically Complex Text
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunxin Li
Baotian Hu
Yuxin Ding
Lin Ma
Hao Fei
216
6
0
03 May 2023
An Extensible Multimodal Multi-task Object Dataset with Materials
An Extensible Multimodal Multi-task Object Dataset with MaterialsInternational Conference on Learning Representations (ICLR), 2023
Trevor Scott Standley
Ruohan Gao
Dawn Chen
Jiajun Wu
Silvio Savarese
141
2
0
29 Apr 2023
Controllable Image Generation via Collage Representations
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
219
6
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
LEMaRT: Label-Efficient Masked Region Transform for Image HarmonizationComputer Vision and Pattern Recognition (CVPR), 2023
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
279
24
0
25 Apr 2023
Docmarking: Real-Time Screen-Cam Robust Document Image Watermarking
Docmarking: Real-Time Screen-Cam Robust Document Image Watermarking
A. Yakushev
Yury Markin
D. Obydenkov
A. Frolov
S. Fomin
Manuk Akopyan
A. Kozachok
Arthur Gaynov
43
2
0
25 Apr 2023
Building Multimodal AI Chatbots
Building Multimodal AI Chatbots
Mingyu Lee
156
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via
  Geometric and CLIP-based Consistency
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based ConsistencyComputer Vision and Pattern Recognition (CVPR), 2023
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
192
24
0
13 Apr 2023
ImageCaptioner$^2$: Image Captioner for Image Captioning Bias
  Amplification Assessment
ImageCaptioner2^22: Image Captioner for Image Captioning Bias Amplification AssessmentAAAI Conference on Artificial Intelligence (AAAI), 2023
Eslam Mohamed Bakr
Pengzhan Sun
Erran L. Li
Mohamed Elhoseiny
200
10
0
10 Apr 2023
Knowledge Combination to Learn Rotated Detection Without Rotated
  Annotation
Knowledge Combination to Learn Rotated Detection Without Rotated AnnotationComputer Vision and Pattern Recognition (CVPR), 2023
Tianyu Zhu
Bryce Ferenczi
Pulak Purkait
Tom Drummond
Hamid Rezatofighi
Anton Van Den Hengel
238
20
0
05 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for
  Scene-Text VQA
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQAAAAI Conference on Artificial Intelligence (AAAI), 2023
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
166
9
0
04 Apr 2023
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual
  Mask Annotations
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask AnnotationsComputer Vision and Pattern Recognition (CVPR), 2023
VS Vibashan
Ning Yu
Chen Xing
Can Qin
M. Gao
Juan Carlos Niebles
Vishal M. Patel
Ran Xu
VLMISeg
252
19
0
29 Mar 2023
Egocentric Auditory Attention Localization in Conversations
Egocentric Auditory Attention Localization in ConversationsComputer Vision and Pattern Recognition (CVPR), 2023
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
229
23
0
28 Mar 2023
Previous
12345...111213
Next
Page 4 of 13
Pageof 13