Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
Can Transformers Capture Spatial Relations between Objects?
Chuan Wen
Dinesh Jayaraman
Yang Gao
ViT
178
8
0
01 Mar 2024
PLReMix: Combating Noisy Labels with Pseudo-Label Relaxed Contrastive Representation Learning
Xiaoyu Liu
Beitong Zhou
Cheng Cheng
235
6
0
27 Feb 2024
Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models
Shaeke Salman
M. Shams
Xiuwen Liu
Lingjiong Zhu
VLM
169
3
0
13 Feb 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Jiaming Song
Yu Qiao
Shiyang Feng
MLLM
512
139
0
08 Feb 2024
Locally-Adaptive Quantization for Streaming Vector Search
Cecilia Aguerrebere
Mark Hildebrand
Ishwar Bhati
Ted Willke
Mariano Tepper
308
17
0
03 Feb 2024
Category-wise Fine-Tuning: Resisting Incorrect Pseudo-Labels in Multi-Label Image Classification with Partial Labels
Chak Fong Chong
Xinyi Fang
Jielong Guo
Yapeng Wang
Wei Ke
C. Lam
Sio-Kei Im
221
3
0
30 Jan 2024
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding
Yatong Bai
Utsav Garg
Apaar Shanker
Haoming Zhang
Samyak Parajuli
...
Eugenia D Fomitcheva
E. Branson
Aerin Kim
Somayeh Sojoudi
Kyunghyun Cho
191
2
0
09 Jan 2024
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification
Xueling Zhu
Jian Liu
Dongqi Tang
Jiawei Ge
Weijia Liu
Bo Liu
Jiuxin Cao
VLM
207
1
0
02 Jan 2024
Amodal Completion via Progressive Mixed Context Diffusion
Katherine Xu
Lingzhi Zhang
Jianbo Shi
DiffM
245
35
0
24 Dec 2023
Bayesian Transfer Learning
Piotr M. Suder
Jason Xu
David B. Dunson
252
11
0
20 Dec 2023
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li
Guanbin Li
Hui Cheng
Zicheng Liao
Yizhou Yu
FedML
228
25
0
19 Dec 2023
Painterly Image Harmonization by Learning from Painterly Objects
AAAI Conference on Artificial Intelligence (AAAI), 2023
Li Niu
Junyan Cao
Yan Hong
Liqing Zhang
199
1
0
15 Dec 2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
Neural Information Processing Systems (NeurIPS), 2023
Jinho Park
Jack Hessel
Khyathi Chandu
Paul Pu Liang
Ximing Lu
...
Youngjae Yu
Qiuyuan Huang
Jianfeng Gao
Ali Farhadi
Yejin Choi
VLM
269
13
0
08 Dec 2023
Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation
Computer Vision and Pattern Recognition (CVPR), 2023
Zhipeng Du
Miaojing Shi
Jiankang Deng
ObjD
327
42
0
02 Dec 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J Taylor
268
12
0
21 Nov 2023
SniffyArt: The Dataset of Smelling Persons
Mathias Zinnen
Azhar Hussian
Hang Tran
Prathmesh Madhu
Andreas Maier
Vincent Christlein
188
11
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Computer Vision and Pattern Recognition (CVPR), 2023
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
392
383
0
10 Nov 2023
Exploring Dataset-Scale Indicators of Data Quality
Ben Feuer
Chinmay Hegde
193
1
0
07 Nov 2023
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV Images
International Journal of Remote Sensing (IJRS), 2023
A. Silva
H. Felix
Franscisco Paulo Magalhaes Simoes
Veronica Teichrieb
Michel Mozinho dos Santos
H. Santiago
V. Sgotti
H. B. D. T. L. Neto
367
35
0
02 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion (Inf. Fusion), 2023
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
399
71
0
01 Nov 2023
Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Minxing Zhang
Ning Yu
Rui Wen
Michael Backes
Yang Zhang
DiffM
191
30
0
30 Oct 2023
Open-Set Image Tagging with Multi-Grained Text Supervision
Xinyu Huang
Yi-Jie Huang
Youcai Zhang
Weiwei Tian
Rui Feng
Yuejie Zhang
Yanchun Xie
Yaqian Li
Lei Zhang
VLM
248
63
0
23 Oct 2023
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Chunlei Wang
Wenquan Feng
Xiangtai Li
Guangliang Cheng
Shuchang Lyu
Binghao Liu
Lijiang Chen
Qi Zhao
ObjD
VLM
269
14
0
22 Oct 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
276
9
0
20 Oct 2023
Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models
ACM Computing Surveys (ACM Comput. Surv.), 2023
Zhaozheng Chen
Qianru Sun
VLM
426
24
0
19 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
IEEE International Conference on Computer Vision (ICCV), 2023
Chengyang Zhao
Songlin Yang
Zhenfang Chen
Mingyu Ding
Chuang Gan
388
23
0
10 Oct 2023
Lightweight In-Context Tuning for Multimodal Unified Models
Yixin Chen
Shuai Zhang
Boran Han
Jiaya Jia
144
5
0
08 Oct 2023
Automatic and Efficient Customization of Neural Networks for ML Applications
Yuhan Liu
Chengcheng Wan
Kuntai Du
Henry Hoffmann
Junchen Jiang
Shan Lu
Michael Maire
133
1
0
07 Oct 2023
Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
Neural Information Processing Systems (NeurIPS), 2023
Naitik Khandelwal
Xiao Liu
Mengmi Zhang
CLL
297
1
0
02 Oct 2023
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu
Jiangtong Li
Bo Zhang
Li Niu
DiffM
237
15
0
27 Sep 2023
A Survey on Image-text Multimodal Models
Ruifeng Guo
Jingxuan Wei
Linzhuang Sun
Khai-Nguyen Nguyen
Guiyong Chang
Dawei Liu
Sibo Zhang
Zhengbing Yao
Mingjun Xu
Liping Bu
VLM
320
22
0
23 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
International Journal of Computer Vision (IJCV), 2023
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
312
45
0
22 Sep 2023
ReShader: View-Dependent Highlights for Single Image View-Synthesis
ACM Transactions on Graphics (TOG), 2023
Avinash Paliwal
Brandon Nguyen
Andrii Tsarov
N. Kalantari
346
3
0
19 Sep 2023
AdSEE: Investigating the Impact of Image Style Editing on Advertisement Attractiveness
Knowledge Discovery and Data Mining (KDD), 2023
Liyao Jiang
Chenglin Li
Haolan Chen
Xiao-Rong Gao
Xinwang Zhong
Yang Qiu
Shani Ye
Di Niu
117
0
0
15 Sep 2023
Collecting Visually-Grounded Dialogue with A Game Of Sorts
International Conference on Language Resources and Evaluation (LREC), 2023
Bram Willemsen
Dmytro Kalpakchi
Gabriel Skantze
114
2
0
10 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Computer Vision and Pattern Recognition (CVPR), 2023
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng Zhang
Han Hu
DongDong Chen
Baining Guo
DiffM
VLM
299
158
0
07 Sep 2023
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
IEEE International Conference on Computer Vision (ICCV), 2023
Ting Lei
Fabian Caba
Qingchao Chen
Hailin Jin
Yuxin Peng
Yang Liu
VLM
247
46
0
07 Sep 2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
L. Yu
Bowen Shi
Ramakanth Pasunuru
Benjamin Muller
O. Yu. Golovneva
...
Yaniv Taigman
Maryam Fazel-Zarandi
Asli Celikyilmaz
Luke Zettlemoyer
Armen Aghajanyan
MLLM
267
162
0
05 Sep 2023
FACET: Fairness in Computer Vision Evaluation Benchmark
IEEE International Conference on Computer Vision (ICCV), 2023
Laura Gustafson
Chloe Rolland
Nikhila Ravi
Quentin Duval
Aaron B. Adcock
Cheng-Yang Fu
Melissa Hall
Candace Ross
VLM
EGVM
351
59
0
31 Aug 2023
Separate and Locate: Rethink the Text in Text-based Visual Question Answering
ACM Multimedia (ACM MM), 2023
Chengyang Fang
Jiangnan Li
Liang Li
Can Ma
Dayong Hu
278
18
0
31 Aug 2023
SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ziyan Yang
Kushal Kafle
Zhe Lin
Scott D. Cohen
Zhihong Ding
Vicente Ordonez
252
1
0
24 Aug 2023
HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks
Zichao Dong
Weikun Zhang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
VLM
91
6
0
24 Aug 2023
Seeing the Intangible: Survey of Image Classification into High-Level and Abstract Categories
Delfina Sol Martinez Pandiani
Valentina Presutti
206
5
0
21 Aug 2023
ControlCom: Controllable Image Composition using Diffusion Model
Bo Zhang
Yuxuan Duan
Jun Lan
Y. Hong
Huijia Zhu
Weiqiang Wang
Li Niu
DiffM
230
47
0
19 Aug 2023
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
IEEE International Conference on Computer Vision (ICCV), 2023
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Samuel Albanie
Yining Pan
Tao Feng
Jianwen Jiang
Dong Ni
Yingya Zhang
Deli Zhao
VLM
244
61
0
18 Aug 2023
DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels
Soumadeep Saha
Utpal Garain
Arijit Ukil
A. Pal
Sundeep Khandelwal
167
1
0
09 Aug 2023
Foreground Object Search by Distilling Composite Image Feature
IEEE International Conference on Computer Vision (ICCV), 2023
Bo Zhang
Jiacheng Sui
Li Niu
251
7
0
09 Aug 2023
Which Tokens to Use? Investigating Token Reduction in Vision Transformers
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
ViT
274
63
0
09 Aug 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
Chinmay Hegde
OOD
249
2
0
07 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
ACM Multimedia (ACM MM), 2023
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
175
7
0
04 Aug 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next