Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.05173
Cited By
Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
11 April 2023
Ahmet Iscen
Alireza Fathi
Cordelia Schmid
VLM
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Image Recognition by Retrieving from Web-Scale Image-Text Data"
23 / 23 papers shown
Title
LIFT+: Lightweight Fine-Tuning for Long-Tail Learning
Jiang-Xin Shi
Tong Wei
Yu-Feng Li
25
0
0
17 Apr 2025
Memory-Modular Classification: Learning to Generalize with Memory Replacement
Dahyun Kang
Ahmet Iscen
Eunchan Jo
Sua Choi
Minsu Cho
Cordelia Schmid
VLM
KELM
OffRL
24
0
0
08 Apr 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
37
0
0
21 Jan 2025
DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Rongqing Li
Jiaqi Yu
Changsheng Li
Wenhan Luo
Ye Yuan
Guoren Wang
MLAU
78
0
0
08 Dec 2024
Granularity Matters in Long-Tail Learning
Shizhen Zhao
Xin Wen
J. Liu
Chuofan Ma
C. Yuan
Xiaojuan Qi
29
0
0
21 Oct 2024
A Statistical Framework for Data-dependent Retrieval-Augmented Models
Soumya Basu
A. S. Rawat
Manzil Zaheer
RALM
41
0
0
27 Aug 2024
RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Varun Nagaraj Rao
Siddharth Choudhary
Aditya Deshpande
R. Satzoda
Srikar Appalaraju
RALM
VLM
45
4
0
27 Jun 2024
The Solution for the CVPR2024 NICE Image Captioning Challenge
Longfei Huang
Shupeng Zhong
Xiangyu Wu
Ruoxuan Li
19
0
0
19 Apr 2024
How to Benchmark Vision Foundation Models for Semantic Segmentation?
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
VLM
27
7
0
18 Apr 2024
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Mathilde Caron
Ahmet Iscen
Alireza Fathi
Cordelia Schmid
32
5
0
04 Mar 2024
Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary Task Integration
Mahapara Khurshid
Mayank Vatsa
Richa Singh
11
0
0
16 Feb 2024
SCoRe: Submodular Combinatorial Representation Learning
Anay Majee
Suraj Kothawade
Krishnateja Killamsetty
Rishabh K. Iyer
16
1
0
29 Sep 2023
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts
Jiang-Xin Shi
Tong Wei
Zhi-Hua Zhou
Jiejing Shao
Xin-Yan Han
Yu-Feng Li
24
26
0
18 Sep 2023
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval
Seong-Hoon Eom
Namgyu Ho
Jaehoon Oh
Se-Young Yun
CLIP
VLM
26
0
0
29 Aug 2023
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool
Youyang Ng
Daisuke Miyashita
Yasuto Hoshi
Yasuhiro Morioka
Osamu Torii
Tomoya Kodama
J. Deguchi
RALM
8
9
0
08 Aug 2023
Retrieval-Enhanced Contrastive Vision-Text Models
Ahmet Iscen
Mathilde Caron
Alireza Fathi
Cordelia Schmid
CLIP
VLM
18
26
0
12 Jun 2023
The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition
Jingru Tan
Bo-wen Li
Xin Lu
Yongqiang Yao
F. Yu
Tong He
Wanli Ouyang
34
8
0
11 Oct 2022
A Memory Transformer Network for Incremental Learning
Ahmet Iscen
Thomas Bird
Mathilde Caron
Alireza Fathi
Cordelia Schmid
CLL
111
14
0
10 Oct 2022
Generalization Properties of Retrieval-based Models
Soumya Basu
A. S. Rawat
Manzil Zaheer
29
6
0
06 Oct 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
117
161
0
29 Sep 2022
Correlated Input-Dependent Label Noise in Large-Scale Image Classification
Mark Collier
Basil Mustafa
Efi Kokiopoulou
Rodolphe Jenatton
Jesse Berent
NoLa
176
53
0
19 May 2021
BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition
Boyan Zhou
Quan Cui
Xiu-Shen Wei
Zhao-Min Chen
243
782
0
05 Dec 2019
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,190
0
01 Sep 2014
1