Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2503.04839
Cited By
v1
v2 (latest)
Advancing Multimodal In-Context Learning in Large Vision-Language Models with Task-aware Demonstrations
5 March 2025
Yanshu Li
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Advancing Multimodal In-Context Learning in Large Vision-Language Models with Task-aware Demonstrations"
35 / 35 papers shown
DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action
Zhen Fang
Zhuoyang Liu
Jiaming Liu
Hao Chen
Y. Zeng
Shiting Huang
Zehui Chen
L. Chen
Shanghang Zhang
Feng Zhao
LRM
151
4
0
27 Nov 2025
TACO: Enhancing Multimodal In-context Learning via Task Mapping-Guided Sequence Configuration
Yanshu Li
Tian Yun
Tian Yun
Pinyuan Feng
Jinfa Huang
Ruixiang Tang
566
29
0
21 May 2025
Make LVLMs Focus: Context-Aware Attention Modulation for Better Multimodal In-Context Learning
Yanshu Li
JianJiang Yang
Ziteng Yang
Bozheng Li
Yi Cao
...
Ligong Han
Yingjie Victor Chen
Songlin Fei
Dongfang Liu
Ruixiang Tang
488
8
0
21 May 2025
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang
Chancharik Mitra
Assaf Arbelle
Leonid Karlinsky
Trevor Darrell
Roei Herzig
294
44
0
21 Jun 2024
ICLEval: Evaluating In-Context Learning Ability of Large Language Models
Wentong Chen
Yankai Lin
ZhenHao Zhou
HongYun Huang
Yantao Jia
Bo Zhao
Ji-Rong Wen
ELM
293
11
0
21 Jun 2024
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Zhe Chen
Weiyun Wang
Hao Tian
Shenglong Ye
Zhangwei Gao
...
Tong Lu
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
MLLM
VLM
667
1,136
0
25 Apr 2024
Visual In-Context Learning for Large Vision-Language Models
Yucheng Zhou
Xiang Li
Qianning Wang
Jianbing Shen
MLLM
276
134
0
18 Feb 2024
Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Caoyun Fan
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
338
6
0
12 Dec 2023
How to Configure Good In-Context Sequence for Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2023
Li Li
Jiawei Peng
Huiyi Chen
Chongyang Gao
Xu Yang
MLLM
303
40
0
04 Dec 2023
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Neural Information Processing Systems (NeurIPS), 2023
Cheng Cheng
Lin Song
Ruoyi Xue
Hang Wang
Hongbin Sun
Yixiao Ge
Ying Shan
VLM
ObjD
580
56
0
07 Nov 2023
An Early Evaluation of GPT-4V(ision)
Yang Wu
Shilong Wang
Hao Yang
Tian Zheng
Hongbo Zhang
Yanyan Zhao
Bing Qin
MLLM
ELM
219
49
0
25 Oct 2023
Multimodal Neurons in Pretrained Text-Only Transformers
Sarah Schwettmann
Neil Chowdhury
Samuel J. Klein
David Bau
Antonio Torralba
MILM
386
50
0
03 Aug 2023
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Anas Awadalla
Irena Gao
Josh Gardner
Jack Hessel
Yusuf Hanafy
...
Simon Kornblith
Pang Wei Koh
Gabriel Ilharco
Mitchell Wortsman
Ludwig Schmidt
MLLM
462
591
0
02 Aug 2023
Learning to Retrieve In-Context Examples for Large Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Liang Wang
Nan Yang
Furu Wei
RALM
276
64
0
14 Jul 2023
Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chenglei Si
Dan Friedman
Nitish Joshi
Shi Feng
Danqi Chen
He He
386
65
0
22 May 2023
What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Jane Pan
Tianyu Gao
Howard Chen
Danqi Chen
243
167
0
16 May 2023
Larger language models do in-context learning differently
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
...
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
ReLM
LRM
553
461
0
07 Mar 2023
Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zhiyong Wu
Yaoxiang Wang
Jiacheng Ye
Lingpeng Kong
472
199
0
20 Dec 2022
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xinxi Lyu
Sewon Min
Iz Beltagy
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
240
82
0
19 Dec 2022
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes
Neural Information Processing Systems (NeurIPS), 2022
Shivam Garg
Dimitris Tsipras
Abigail Z. Jacobs
Gregory Valiant
777
734
0
01 Aug 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALM
LRM
AI4CE
929
1,672
0
21 May 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAG
LRM
680
1,949
0
25 Feb 2022
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
ACM Computing Surveys (CSUR), 2021
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
913
5,235
0
28 Jul 2021
Multimodal Few-Shot Learning with Frozen Language Models
Neural Information Processing Systems (NeurIPS), 2021
Maria Tsimpoukelli
Jacob Menick
Serkan Cabi
S. M. Ali Eslami
Oriol Vinyals
Felix Hill
MLLM
700
951
0
25 Jun 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILaw
LRM
1.2K
1,479
0
18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
1.6K
5,344
0
18 Apr 2021
What Makes Good In-Context Examples for GPT-
3
3
3
?
Workshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2021
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
669
1,702
0
17 Jan 2021
Making Pre-trained Language Models Better Few-shot Learners
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Tianyu Gao
Adam Fisch
Danqi Chen
945
2,248
0
31 Dec 2020
Language Models are Few-Shot Learners
Neural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
2.4K
56,453
0
28 May 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
514
853
0
10 May 2020
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
Computer Vision and Pattern Recognition (CVPR), 2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
854
1,493
0
31 May 2019
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
972
1,165
0
22 Feb 2018
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
1.5K
4,101
0
02 Dec 2016
CIDEr: Consensus-based Image Description Evaluation
Computer Vision and Pattern Recognition (CVPR), 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
947
5,370
0
20 Nov 2014
Microsoft COCO: Common Objects in Context
European Conference on Computer Vision (ECCV), 2014
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
27.3K
51,996
0
01 May 2014
1
Page 1 of 1