Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.16249
Cited By
FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding
28 September 2023
Ana Ezquerro
Carlos Gómez-Rodríguez
Kevin Dela Rosa
Derek Hao Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding"
8 / 8 papers shown
Title
HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models
Aakash Tripathi
Asim Waqas
Yasin Yilmaz
Ghulam Rasool
29
5
0
13 May 2024
Unicom: Universal and Compact Representation Learning for Image Retrieval
Xiang An
Jiankang Deng
Kaicheng Yang
Jaiwei Li
Ziyong Feng
Jia Guo
Jing Yang
Tongliang Liu
VLM
SSL
30
26
0
12 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
259
4,223
0
30 Jan 2023
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
295
5,761
0
29 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
247
36,237
0
25 Aug 2016
1