Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Binxin Yang
Shuyang Gu
Bo Zhang
Ting Zhang
Xuejin Chen
Xiaoyan Sun
Dong Chen
Fang Wen
DiffM
282
546
0
23 Nov 2022
Plug and Play Active Learning for Object Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Chenhongyi Yang
Lichao Huang
Elliot J. Crowley
ObjD
239
30
0
21 Nov 2022
ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Zhihang Zhong
Mingxi Cheng
Zhirong Wu
Yuhui Yuan
Yinqiang Zheng
Ji Li
Han Hu
Stephen Lin
Yoichi Sato
Imari Sato
VLM
CLIP
135
8
0
21 Nov 2022
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Computer Vision and Pattern Recognition (CVPR), 2022
Mengmeng Xu
Yanghao Li
Cheng-Yang Fu
Guohao Li
Tao Xiang
Juan-Manuel Perez-Rua
225
19
0
18 Nov 2022
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding
Computer Vision and Pattern Recognition (CVPR), 2022
Zijiao Chen
Jiaxin Qing
Tiange Xiang
Wan Lin Yue
J. Zhou
DiffM
MedIm
336
200
0
13 Nov 2022
SSGVS: Semantic Scene Graph-to-Video Synthesis
Yuren Cong
Jinhui Yi
Bodo Rosenhahn
M. Yang
242
8
0
11 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Computer Vision and Pattern Recognition (CVPR), 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Jiaming Song
Xiaogang Wang
Yu Qiao
VLM
553
958
0
10 Nov 2022
High-Quality Entity Segmentation
IEEE International Conference on Computer Vision (ICCV), 2022
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe Lin
Ming-Hsuan Yang
ISeg
295
77
0
10 Nov 2022
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection
Computer Vision and Image Understanding (CVIU), 2022
Huayi Zhou
Fei Jiang
Hongtao Lu
ObjD
289
107
0
04 Nov 2022
DEArt: Dataset of European Art
Artem Reshetnikov
M. Marinescu
J. M. López
VLM
3DH
191
13
0
02 Nov 2022
Universal Deep Image Compression via Content-Adaptive Optimization with Adapters
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Koki Tsubota
Hiroaki Akutsu
Kiyoharu Aizawa
158
22
0
02 Nov 2022
Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation
European Conference on Computer Vision (ECCV), 2022
Simone Rossetti
Damiano Zappia
Marta Sanzari
M. Schaerf
F. Pirri
ViT
323
80
0
31 Oct 2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Social Science Research Network (SSRN), 2022
Ç. S. Çoban
Oguzhan Keskin
Jordi Pont-Tuset
Fatma Guney
VOS
213
0
0
30 Oct 2022
A Survey on Causal Representation Learning and Future Work for Medical Image Analysis
Chang-Tien Lu
OOD
BDL
CML
MedIm
255
0
0
28 Oct 2022
Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Mitja Nikolaus
Emmanuelle Salin
Stéphane Ayache
Abdellah Fourtassi
Benoit Favre
144
17
0
21 Oct 2022
Similarity of Neural Architectures using Adversarial Attack Transferability
European Conference on Computer Vision (ECCV), 2022
Ian Ryu
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
538
3
0
20 Oct 2022
VTC: Improving Video-Text Retrieval with User Comments
European Conference on Computer Vision (ECCV), 2022
Laura Hanu
James Thewlis
Yuki M. Asano
Christian Rupprecht
VGen
231
8
0
19 Oct 2022
Learning to Discover and Detect Objects
Neural Information Processing Systems (NeurIPS), 2022
V. Fomenko
Ismail Elezi
Deva Ramanan
Laura Leal-Taixé
Aljosa Osep
ObjD
263
13
0
19 Oct 2022
A Tri-Layer Plugin to Improve Occluded Detection
British Machine Vision Conference (BMVC), 2022
Guanqi Zhan
Weidi Xie
Andrew Zisserman
211
26
0
18 Oct 2022
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel Logistics
International Conference on Machine Learning and Applications (ICMLA), 2022
Alexander Naumann
Felix Hertlein
Benchun Zhou
Laura Dörr
K. Furmans
179
6
0
18 Oct 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
191
1
0
18 Oct 2022
Non-Contrastive Learning Meets Language-Image Pre-Training
Computer Vision and Pattern Recognition (CVPR), 2022
Jinghao Zhou
Li Dong
Zhe Gan
Lijuan Wang
Furu Wei
VLM
CLIP
210
33
0
17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models
International Conference on Computer Science and Artificial Intelligence (ICCSAI), 2022
Yueqin Yin
Lianghua Huang
Yu Liu
Kaiqiang Huang
DiffM
143
13
0
16 Oct 2022
Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers
Tao Tang
Changlin Li
Guangrun Wang
Kaicheng Yu
Xiaojun Chang
Xiaodan Liang
ViT
212
1
0
16 Oct 2022
Active Learning from the Web
The Web Conference (WWW), 2022
Ryoma Sato
136
0
0
15 Oct 2022
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Wenliang Dai
Zihan Liu
Ziwei Ji
Jane Polak Scowcroft
Pascale Fung
MLLM
VLM
305
75
0
14 Oct 2022
Caption supervision enables robust learners
Ben Feuer
Ameya Joshi
Chinmay Hegde
SSL
CLIP
VLM
205
3
0
13 Oct 2022
Exploring Long-Sequence Masked Autoencoders
Ronghang Hu
Shoubhik Debnath
Saining Xie
Xinlei Chen
181
23
0
13 Oct 2022
A survey of Identification and mitigation of Machine Learning algorithmic biases in Image Analysis
Laurent Risser
Agustin Picard
Lucas Hervier
Jean-Michel Loubes
FaML
217
7
0
10 Oct 2022
A Review of Uncertainty Calibration in Pretrained Object Detectors
Denis Huseljic
M. Herde
Mehmet Muejde
Bernhard Sick
UQCV
140
0
0
06 Oct 2022
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers
International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2022
S. Chintalapati
Jonathan Bragg
Lucy Lu Wang
139
32
0
27 Sep 2022
A Snapshot of the Frontiers of Client Selection in Federated Learning
Gergely Németh
M. Lozano
Novi Quadrianto
Nuria Oliver
FedML
302
17
0
27 Sep 2022
Paraphrasing Is All You Need for Novel Object Captioning
Neural Information Processing Systems (NeurIPS), 2022
Cheng Yang
Yifan Hao
Wanshu Fan
Ruslan Salakhutdinov
Louis-Philippe Morency
Yu-Chiang Frank Wang
184
6
0
25 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOS
VLM
267
74
0
25 Sep 2022
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
IEEE Transactions on Image Processing (IEEE TIP), 2022
Hao Li
Jinfa Huang
Peng Jin
Guoli Song
Qi Wu
Jie Chen
376
27
0
21 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Neural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
334
218
0
20 Sep 2022
Enhance the Visual Representation via Discrete Adversarial Training
Neural Information Processing Systems (NeurIPS), 2022
Xiaofeng Mao
YueFeng Chen
Ranjie Duan
Yao Zhu
Gege Qi
Shaokai Ye
Xiaodan Li
Rong Zhang
Hui Xue
232
43
0
16 Sep 2022
VIPHY: Probing "Visible" Physical Commonsense Knowledge
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shikhar Singh
Ehsan Qasemi
Muhao Chen
254
6
0
15 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
International Conference on Learning Representations (ICLR), 2022
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLM
VLM
709
905
0
14 Sep 2022
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
163
22
0
14 Sep 2022
Pre-training image-language transformers for open-vocabulary tasks
A. Piergiovanni
Weicheng Kuo
A. Angelova
VLM
ViT
176
12
0
09 Sep 2022
im2nerf: Image to Neural Radiance Field in the Wild
Lu Mi
Abhijit Kundu
David A. Ross
F. Dellaert
Noah Snavely
Alireza Fathi
3DV
410
14
0
08 Sep 2022
Measuring the Interpretability of Unsupervised Representations via Quantized Reverse Probing
International Conference on Learning Representations (ICLR), 2022
Iro Laina
Yuki M. Asano
Andrea Vedaldi
SSL
165
9
0
07 Sep 2022
Scalable Regularization of Scene Graph Generation Models using Symbolic Theories
Davide Buffelli
Efthymia Tsamoura
202
2
0
06 Sep 2022
Design of the topology for contrastive visual-textual alignment
Zhun Sun
376
2
0
05 Sep 2022
RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection
Neural Information Processing Systems (NeurIPS), 2022
Hangjie Yuan
Jianwen Jiang
Samuel Albanie
Tao Feng
Ziyuan Huang
Dong Ni
Mingqian Tang
VLM
350
76
0
05 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
232
114
0
29 Aug 2022
Labeling of Cultural Heritage Collections on the Intersection of Visual Analytics and Digital Humanities
C. Meinecke
121
3
0
29 Aug 2022
Towards Federated Learning against Noisy Labels via Local Self-Regularization
International Conference on Information and Knowledge Management (CIKM), 2022
Xue Jiang
Sheng Sun
Yuwei Wang
Min Liu
198
47
0
25 Aug 2022
Is Medieval Distant Viewing Possible? : Extending and Enriching Annotation of Legacy Image Collections using Visual Analytics
Digital Scholarship in the Humanities (DSH), 2022
C. Meinecke
Estelle Guéville
D. Wrisley
Stefan Jänicke
210
4
0
20 Aug 2022
Previous
1
2
3
...
5
6
7
...
11
12
13
Next