Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Ping Hu
Ximeng Sun
Stan Sclaroff
Kate Saenko
VLM
335
33
0
03 Aug 2023
ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation
ACM Multimedia (ACM MM), 2023
Bo Zhang
Jian Wang
Hui Ma
Bo Xu
Hongfei Lin
194
5
0
01 Aug 2023
Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels
International Conference on Software Engineering Research and Applications (ICSERA), 2023
Xin Zhang
Yuqi Song
Fei Zuo
Xiang Wang
281
2
0
31 Jul 2023
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
215
5
0
28 Jul 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Scientific Reports (Sci Rep), 2023
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
199
5
0
26 Jul 2023
Towards Establishing Systematic Classification Requirements for Automated Driving
Kent Mori
Trent Brown
Steven C. Peters
224
0
0
26 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2023
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
298
197
0
21 Jul 2023
Interactive Segmentation for Diverse Gesture Types Without Context
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Josh Myers-Dean
Yifei Fan
Brian L. Price
Wilson Chan
Danna Gurari
306
5
0
20 Jul 2023
In Defense of Clip-based Video Relation Detection
IEEE Transactions on Image Processing (IEEE TIP), 2023
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
182
7
0
18 Jul 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
221
26
0
17 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
IEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
352
61
0
17 Jul 2023
DynamicFL: Balancing Communication Dynamics and Client Manipulation for Federated Learning
Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON), 2023
Bocheng Chen
Nikolay Ivanov
Guangjing Wang
Qiben Yan
207
7
0
16 Jul 2023
EmoSet: A Large-scale Visual Emotion Dataset with Rich Attributes
IEEE International Conference on Computer Vision (ICCV), 2023
Jingyuan Yang
Qiruin Huang
Tingting Ding
Dani Lischinski
Daniel Cohen-Or
Hui Huang
213
90
0
16 Jul 2023
Unbiased Scene Graph Generation via Two-stage Causal Modeling
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Shuzhou Sun
Shuaifeng Zhi
Qing Liao
J. Heikkilä
Tianpeng Liu
CML
264
51
0
11 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
179
8
0
08 Jul 2023
Pollen: High-throughput Federated Learning Simulation via Resource-Aware Client Placement
Lorenzo Sani
Pedro Gusmão
Alexandru Iacob
Wanru Zhao
Xinchi Qiu
Yan Gao
Javier Fernandez-Marques
Nicholas D. Lane
242
0
0
30 Jun 2023
Transferability Metrics for Object Detection
Louis Fouquet
Simona Maggio
L. Dreyfus-Schmidt
153
1
0
27 Jun 2023
ParameterNet: Parameters Are All You Need
Computer Vision and Pattern Recognition (CVPR), 2023
Kai Han
Yunhe Wang
Jianyuan Guo
Enhua Wu
VLM
AI4CE
158
76
0
26 Jun 2023
DISCO-10M: A Large-Scale Music Dataset
Neural Information Processing Systems (NeurIPS), 2023
Luca A. Lanzendörfer
Florian Grötschla
Emil Funke
Roger Wattenhofer
125
24
0
23 Jun 2023
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Qianji Di
Wenxing Ma
Chen Ma
Tianxiang Hou
Ying Shan
Hanzi Wang
145
1
0
23 Jun 2023
Label-noise-tolerant medical image classification via self-attention and self-supervised learning
Hongyang Jiang
Mengdi Gao
Yan Hu
Qi Ren
Zhaoheng Xie
Jiang-Dong Liu
NoLa
140
5
0
16 Jun 2023
Scaling Open-Vocabulary Object Detection
Neural Information Processing Systems (NeurIPS), 2023
Matthias Minderer
A. Gritsenko
N. Houlsby
VLM
ObjD
424
315
0
16 Jun 2023
ScaleDet: A Scalable Multi-Dataset Object Detector
Computer Vision and Pattern Recognition (CVPR), 2023
Yanbei Chen
Manchen Wang
Abhay Mittal
Zhenlin Xu
Paolo Favaro
Joseph Tighe
Davide Modolo
ObjD
177
27
0
08 Jun 2023
Coarse Is Better? A New Pipeline Towards Self-Supervised Learning with Uncurated Images
Pattern Recognition (Pattern Recogn.), 2023
Ke Zhu
Yin He
Jianxin Wu
255
7
0
07 Jun 2023
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects
Computer Vision and Pattern Recognition (CVPR), 2023
Ruohan Gao
Yiming Dou
Hao Li
Tanmay Agarwal
Jeannette Bohg
Yunzhu Li
Li Fei-Fei
Jiajun Wu
155
51
0
01 Jun 2023
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
278
0
0
31 May 2023
What Can We Learn from Unlearnable Datasets?
Neural Information Processing Systems (NeurIPS), 2023
Pedro Sandoval-Segura
Vasu Singla
Jonas Geiping
Micah Goldblum
Tom Goldstein
279
20
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
International Journal of Computer Vision (IJCV), 2023
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
328
141
0
29 May 2023
Learning high-level visual representations from a child's perspective without strong inductive biases
A. Orhan
Brenden M. Lake
SSL
263
34
0
24 May 2023
NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using Knowledge Graphs in Visual Relationship Detection
D. Herron
Ernesto Jiménez-Ruiz
G. Tarroni
Tillman Weyde
188
2
0
22 May 2023
Relabeling Minimal Training Subset to Flip a Prediction
Findings (Findings), 2023
Jinghan Yang
Linjie Xu
Lequan Yu
305
3
0
22 May 2023
Annotation-free Audio-Visual Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jinxian Liu
Yu Wang
Chen Ju
Chaofan Ma
Ya Zhang
Weidi Xie
VOS
VLM
395
47
0
18 May 2023
Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jialing Yuan
Ye Yu
Gaurav Mittal
Matthew Hall
Sandra Sajeev
Mei Chen
215
14
0
17 May 2023
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go
Yecong Wan
Mingzhen Shao
Yuanshuo Cheng
YueQin Liu
Zhipeng Bao
208
9
0
17 May 2023
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
VLM
189
24
0
16 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with Elasticsearch
International Conference on Computer Analysis of Images and Patterns (CAIP), 2023
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
149
4
0
08 May 2023
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese
Information Fusion (Inf. Fusion), 2023
Nghia Hieu Nguyen
Duong T.D. Vo
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
196
27
0
07 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning
Neural Information Processing Systems (NeurIPS), 2023
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
276
32
0
04 May 2023
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunxin Li
Baotian Hu
Yuxin Ding
Lin Ma
Hao Fei
216
6
0
03 May 2023
An Extensible Multimodal Multi-task Object Dataset with Materials
International Conference on Learning Representations (ICLR), 2023
Trevor Scott Standley
Ruohan Gao
Dawn Chen
Jiajun Wu
Silvio Savarese
141
2
0
29 Apr 2023
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
219
6
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Computer Vision and Pattern Recognition (CVPR), 2023
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
279
24
0
25 Apr 2023
Docmarking: Real-Time Screen-Cam Robust Document Image Watermarking
A. Yakushev
Yury Markin
D. Obydenkov
A. Frolov
S. Fomin
Manuk Akopyan
A. Kozachok
Arthur Gaynov
43
2
0
25 Apr 2023
Building Multimodal AI Chatbots
Mingyu Lee
156
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency
Computer Vision and Pattern Recognition (CVPR), 2023
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
192
24
0
13 Apr 2023
ImageCaptioner
2
^2
2
: Image Captioner for Image Captioning Bias Amplification Assessment
AAAI Conference on Artificial Intelligence (AAAI), 2023
Eslam Mohamed Bakr
Pengzhan Sun
Erran L. Li
Mohamed Elhoseiny
200
10
0
10 Apr 2023
Knowledge Combination to Learn Rotated Detection Without Rotated Annotation
Computer Vision and Pattern Recognition (CVPR), 2023
Tianyu Zhu
Bryce Ferenczi
Pulak Purkait
Tom Drummond
Hamid Rezatofighi
Anton Van Den Hengel
238
20
0
05 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
166
9
0
04 Apr 2023
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations
Computer Vision and Pattern Recognition (CVPR), 2023
VS Vibashan
Ning Yu
Chen Xing
Can Qin
M. Gao
Juan Carlos Niebles
Vishal M. Patel
Ran Xu
VLM
ISeg
252
19
0
29 Mar 2023
Egocentric Auditory Attention Localization in Conversations
Computer Vision and Pattern Recognition (CVPR), 2023
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
229
23
0
28 Mar 2023
Previous
1
2
3
4
5
...
11
12
13
Next
Page 4 of 13
Page
of 13
Go