Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.02151
Cited By
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
3 March 2023
Renrui Zhang
Xiangfei Hu
Bohao Li
Siyuan Huang
Hanqiu Deng
Hongsheng Li
Yu Qiao
Peng Gao
VLM
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners"
50 / 145 papers shown
Title
OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning
Cong Hua
Qianqian Xu
Zhiyong Yang
Zitai Wang
Shilong Bao
Qingming Huang
VLM
48
0
0
08 May 2025
GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Liangyu Xu
Yingxiu Zhao
J. Wang
Yingyao Wang
Bu Pi
...
Jihao Gu
X. Li
Xiaoyong Zhu
Jun Song
Bo Zheng
LRM
82
1
0
17 Apr 2025
Self-Evolving Visual Concept Library using Vision-Language Critics
Atharva Sehgal
Patrick Yuan
Ziniu Hu
Yisong Yue
Jennifer J. Sun
Swarat Chaudhuri
VLM
45
0
0
31 Mar 2025
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Fanding Huang
Jingyan Jiang
Qinting Jiang
Hebei Li
Faisal Nadeem Khan
Zhi Wang
VLM
43
0
0
30 Mar 2025
VTD-CLIP: Video-to-Text Discretization via Prompting CLIP
Wencheng Zhu
Yuexin Wang
Hongxuan Li
Pengfei Zhu
Q. Hu
CLIP
48
0
0
24 Mar 2025
FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation
Dong Zhao
Jinlong Li
Shuang Wang
Mengyao Wu
Qi Zang
N. Sebe
Zhun Zhong
60
0
0
23 Mar 2025
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance
Hui Liu
Wenya Wang
Kecheng Chen
Jie Liu
Yibing Liu
Tiexin Qin
Peisong He
Xinghao Jiang
Haoliang Li
BDL
VLM
78
0
0
20 Mar 2025
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Haoyang Li
Liang Wang
Chao Wang
Jing Jiang
Yan Peng
Guodong Long
VLM
64
1
0
17 Mar 2025
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
Hongyu Sun
Qiuhong Ke
Ming Cheng
Y. Wang
Deying Li
Chenhui Gou
Jianfei Cai
3DPC
87
0
0
15 Mar 2025
Modeling Variants of Prompts for Vision-Language Models
Ao Li
Zongfang Liu
Xinhua Li
Jinghui Zhang
Pengwei Wang
Hu Wang
VLM
45
0
0
13 Mar 2025
CAPT: Class-Aware Prompt Tuning for Federated Long-Tailed Learning with Vision-Language Model
Shihao Hou
Xinyi Shang
Shreyank N Gowda
Yang Lu
Chao-Xiang Wu
Yan Yan
Hanzi Wang
VLM
50
0
0
10 Mar 2025
Adapting OpenAI's CLIP Model for Few-Shot Image Inspection in Manufacturing Quality Control: An Expository Case Study with Multiple Application Examples
F. Megahed
Ying-Ju Chen
B. Colosimo
M. Grasso
L. Allison Jones-Farmer
S. Knoth
Hongyue Sun
I. Zwetsloot
AAML
VLM
60
0
0
22 Jan 2025
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models
Yassir Bendou
Amine Ouasfi
Vincent Gripon
A. Boukhayma
VLM
51
0
0
19 Jan 2025
Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis
Hongyu Sun
Qiuhong Ke
Y. Wang
Wang Chen
Kang Yang
Deying Li
Jianfei Cai
3DPC
67
3
0
17 Jan 2025
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Arnav M. Das
Gantavya Bhatt
Lilly Kumari
Sahil Verma
J. Bilmes
29
0
0
23 Dec 2024
Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition
Ethan Baron
Idan Tankel
Peter Tu
Guy Ben-Yosef
VLM
72
0
0
18 Dec 2024
Prompt Categories Cluster for Weakly Supervised Semantic Segmentation
Wangyu Wu
Xianglin Qiu
Siqi Song
Xiaowei Huang
Fei Ma
Jimin Xiao
VLM
62
4
0
18 Dec 2024
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels
Shizhuo Deng
Bowen Han
Jiaqi Chen
Hao Wang
Dongyue Chen
Tong Jia
VLM
NoLa
69
0
0
17 Dec 2024
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
Yayuan Li
Jintao Guo
Lei Qi
Wenbin Li
Yinghuan Shi
VLM
CLIP
74
0
0
16 Dec 2024
Prompt as Free Lunch: Enhancing Diversity in Source-Free Cross-domain Few-shot Learning through Semantic-Guided Prompting
Linhai Zhuo
Zheng Wang
Yuqian Fu
Tianwen Qian
VLM
69
1
0
01 Dec 2024
FodFoM: Fake Outlier Data by Foundation Models Creates Stronger Visual Out-of-Distribution Detector
Jiankang Chen
Ling Deng
Zhiyong Gan
Wei-Shi Zheng
Ruixuan Wang
OODD
74
0
0
22 Nov 2024
PyGen: A Collaborative Human-AI Approach to Python Package Creation
Saikat Barua
Mostafizur Rahman
Md Jafor Sadek
Rafiul Islam
Shehnaz Khaled
Md. Shohrab Hossain
44
1
0
13 Nov 2024
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization
Wanhua Li
Zibin Meng
Jiawei Zhou
D. Wei
Chuang Gan
Hanspeter Pfister
LRM
VLM
22
5
0
28 Oct 2024
Scene Graph Generation with Role-Playing Large Language Models
Guikun Chen
Jin Li
Wenguan Wang
VLM
40
5
0
20 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
21
0
0
16 Oct 2024
FLIER: Few-shot Language Image Models Embedded with Latent Representations
Zhinuo Zhou
Peng Zhou
Xiaoyong Pan
VLM
19
0
0
10 Oct 2024
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta
Shreyas Verma
Ujjwala Anantheswaran
Kevin Scaria
Mihir Parmar
Swaroop Mishra
Chitta Baral
ReLM
LRM
24
4
0
06 Oct 2024
FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification
Kexue Fu
Xiaoyuan Luo
Linhao Qu
Shuo Wang
Ying Xiong
Ilias Maglogiannis
Longxiang Gao
Manning Wang
26
1
0
29 Sep 2024
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification
Ming Li
J. Zhong
Chenxin Li
Liuzhuozheng Li
Nie Lin
Masashi Sugiyama
CLIP
VLM
18
2
0
25 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
34
1
0
19 Sep 2024
Knowledge Adaptation Network for Few-Shot Class-Incremental Learning
Ye Wang
Yaxiong Wang
Guoshuai Zhao
Xueming Qian
CLL
24
1
0
18 Sep 2024
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Yubin Wang
Xinyang Jiang
De Cheng
Wenli Sun
Dongsheng Li
Cairong Zhao
VLM
27
0
0
27 Aug 2024
DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models
Eman Ali
Sathira Silva
Muhammad Haris Khan
VLM
16
0
0
16 Aug 2024
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection
Yixin Guo
Yu Liu
Jianghao Li
Weimin Wang
Qi Jia
VLM
27
2
0
12 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Y. Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
32
2
0
05 Aug 2024
Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition
Jinfu Liu
C. L. P. Chen
Mengyuan Liu
42
11
0
22 Jul 2024
Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation
Jinda Lu
Shuo Wang
Yanbin Hao
Haifeng Liu
Xiang Wang
Meng Wang
28
2
0
19 Jul 2024
Robust Calibration of Large Vision-Language Adapters
Balamurali Murugesan
Julio Silva-Rodríguez
Ismail Ben Ayed
Jose Dolz
OODD
VLM
24
6
0
18 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
31
2
0
11 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
39
7
0
05 Jul 2024
EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
Chenxin Li
Brandon Yushan Feng
Yifan Liu
Hengyu Liu
Cheng Wang
Weihao Yu
Yixuan Yuan
3DGS
19
13
0
01 Jul 2024
Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval
Hanwen Su
G. Song
K. Huang
Jiyan Wang
Ming Yang
39
1
0
01 Jul 2024
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Tian Liu
Huixin Zhang
Shubham Parashar
Shu Kong
19
2
0
17 Jun 2024
Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
Minho Park
S. Park
Jooyeol Yun
Jaegul Choo
VLM
22
0
0
08 Jun 2024
Generative Active Learning for Long-tailed Instance Segmentation
Muzhi Zhu
Chengxiang Fan
Hao Chen
Y. Liu
Weian Mao
Xiaogang Xu
Chunhua Shen
35
5
0
04 Jun 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
43
3
0
28 May 2024
Prompt Learning for Generalized Vehicle Routing
Fei Liu
Xi Lin
Weiduo Liao
Zhenkun Wang
Qingfu Zhang
Xialiang Tong
Mingxuan Yuan
VLM
29
0
0
20 May 2024
Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation
Sangyeop Yeo
Yoojin Jang
Jaejun Yoo
19
1
0
19 May 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
Chengxiang Fan
Muzhi Zhu
Hao Chen
Yang Liu
Weijia Wu
Huaqi Zhang
Chunhua Shen
DiffM
49
11
0
16 May 2024
A Survey of Few-Shot Learning for Biomedical Time Series
Chenqi Li
Timothy Denison
Tingting Zhu
16
1
0
03 May 2024
1
2
3
Next