What Makes Good Few-shot Examples for Vision-Language Models?

What Makes Good Few-shot Examples for Vision-Language Models?

Papers citing "What Makes Good Few-shot Examples for Vision-Language Models?"