Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
2305.14160
Cited By
v1
v2
v3
v4 (latest)
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
23 May 2023
Lean Wang
Lei Li
Damai Dai
Deli Chen
Hao Zhou
Fandong Meng
Jie Zhou
Xu Sun
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning"
50 / 102 papers shown
Rethinking Associative Memory Mechanism in Induction Head
Shuo Wang
Issei Sato
429
0
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
879
7
0
14 Dec 2024
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Feihu Che
Zengqi Wen
Jianhua Tao
Jianhua Tao
LRM
ReLM
551
34
0
27 Nov 2024
Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
Zeping Yu
Sophia Ananiadou
1.1K
8
0
17 Nov 2024
Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models
Yue Li
Zhixue Zhao
Carolina Scarton
252
1
0
24 Oct 2024
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
International Conference on Learning Representations (ICLR), 2024
Chenxi Wang
Xiang Chen
Ningyu Zhang
Bozhong Tian
Haoming Xu
Shumin Deng
Ningyu Zhang
MLLM
LRM
785
49
0
15 Oct 2024
Can In-context Learning Really Generalize to Out-of-distribution Tasks?
International Conference on Learning Representations (ICLR), 2024
Qixun Wang
Yifei Wang
Yisen Wang
Xianghua Ying
OOD
279
15
0
13 Oct 2024
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
International Conference on Learning Representations (ICLR), 2024
Jiachun Li
Pengfei Cao
Zhuoran Jin
Yubo Chen
Kang Liu
Jun Zhao
LRM
ELM
359
13
0
12 Oct 2024
Temporal Reasoning Transfer from Text to Video
International Conference on Learning Representations (ICLR), 2024
Lei Li
Yuanxin Liu
Linli Yao
Peiyuan Zhang
Chenxin An
Lean Wang
Xu Sun
Dianbo Sui
Qi Liu
LRM
179
20
0
08 Oct 2024
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yongheng Zhang
Qiguang Chen
Jingxuan Zhou
Peng Wang
Jiasheng Si
Jin Wang
Wenpeng Lu
Libo Qin
LRM
359
14
0
06 Oct 2024
Self-Powered LLM Modality Expansion for Large Speech-Text Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tengfei Yu
Xuebo Liu
Zhiyi Hou
Liang Ding
Dacheng Tao
Min Zhang
218
5
0
04 Oct 2024
Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint
European Conference on Computer Vision (ECCV), 2024
Sixiang Chen
Tian-Chun Ye
Lucas Beerens
Zhaohu Xing
Yunlong Lin
Lei Zhu
DiffM
203
9
0
24 Sep 2024
Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding
Tianqiao Liu
Zui Chen
Zitao Liu
Mi Tian
Weiqi Luo
LRM
134
9
0
13 Sep 2024
From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning
International Conference on Machine Learning (ICML), 2024
Wei Chen
Zhen Huang
Liang Xie
Binbin Lin
Houqiang Li
...
Deng Cai
Yonggang Zhang
Wenxiao Wang
Xu Shen
Jieping Ye
336
32
0
03 Sep 2024
EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model
Feipeng Ma
Yizhou Zhou
Hebei Li
Zilong He
Siying Wu
Fengyun Rao
Siying Wu
Fengyun Rao
Yueyi Zhang
Xiaoyan Sun
458
10
0
21 Aug 2024
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions
Chenming Tang
Zhixiang Wang
Hao Sun
Yunfang Wu
LRM
492
1
0
16 Aug 2024
Label Words as Local Task Vectors in In-Context Learning
Bowen Zheng
Ming Ma
Zhongqiao Lin
Tianming Yang
238
4
0
23 Jun 2024
Learnable In-Context Vector for Visual Question Answering
Neural Information Processing Systems (NeurIPS), 2024
Yingzhe Peng
Chenduo Hao
Xu Yang
Jiawei Peng
Xinting Hu
Xin Geng
235
5
0
19 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Neural Information Processing Systems (NeurIPS), 2024
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
421
8
0
15 Jun 2024
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhenhong Zhou
Haiyang Yu
Xinghua Zhang
Rongwu Xu
Fei Huang
Yongbin Li
371
75
0
09 Jun 2024
Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective
Xinhao Yao
Xiaolin Hu
Shenzhi Yang
Yong Liu
240
3
0
06 Jun 2024
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Zefan Cai
Yichi Zhang
Bofei Gao
Yuliang Liu
Yongqian Li
...
Wayne Xiong
Yue Dong
Baobao Chang
Junjie Hu
Wen Xiao
681
179
0
04 Jun 2024
UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation
Hanzhang Zhou
Zijian Feng
Zixiao Zhu
Junlang Qian
Kezhi Mao
276
25
0
31 May 2024
Implicit In-context Learning
International Conference on Learning Representations (ICLR), 2024
Zhuowei Li
Zihao Xu
Ligong Han
Yunhe Gao
Song Wen
Di Liu
Hao Wang
Dimitris N. Metaxas
355
8
0
23 May 2024
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
Guochao Jiang
Zepeng Ding
Yuchen Shi
Deqing Yang
309
8
0
08 May 2024
Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Tianhui Zhang
Bei Peng
Danushka Bollegala
LRM
146
16
0
25 Apr 2024
Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Shaomu Tan
Di Wu
Christof Monz
MoMe
301
21
0
17 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
391
47
0
01 Apr 2024
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yongquan He
Wenyuan Zhang
Xuancheng Huang
Peng Zhang
Lingxun Meng
Jialin Li
Wenyuan Zhang
Yifu Gao
CLL
ALM
488
7
0
15 Mar 2024
Not All Layers of LLMs Are Necessary During Inference
Siqi Fan
Xin Jiang
Xiang Li
Xuying Meng
Peng Han
Shuo Shang
Aixin Sun
Yequan Wang
Zhongyuan Wang
431
70
0
04 Mar 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Daojian Zeng
Kang Liu
Jun Zhao
LRM
274
17
0
28 Feb 2024
Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models
Ercong Nie
Shuzhou Yuan
Bolei Ma
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
ReLM
530
8
0
28 Feb 2024
Large Language Models Can Better Understand Knowledge Graphs Than We Thought
Xinbang Dai
Yuncheng Hua
Tongtong Wu
Yang Sheng
Qiu Ji
Guilin Qi
425
13
0
18 Feb 2024
Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models
Zihao Lin
Mohammad Beigi
Hongxuan Li
Jiuxiang Gu
Yuxiang Zhang
Qifan Wang
Wenpeng Yin
Lifu Huang
KELM
166
10
0
16 Feb 2024
Do LLMs Know about Hallucination? An Empirical Investigation of LLM's Hidden States
Hanyu Duan
Yi Yang
Kar Yan Tam
HILM
174
48
0
15 Feb 2024
Universal Link Predictor By In-Context Learning on Graphs
Kaiwen Dong
Haitao Mao
Zhichun Guo
Nitesh Chawla
226
6
0
12 Feb 2024
NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning
Yufeng Zhao
Yoshihiro Sakai
Naoya Inoue
286
7
0
08 Feb 2024
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zeping Yu
Sophia Ananiadou
237
17
0
05 Feb 2024
Revisiting Demonstration Selection Strategies in In-Context Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Keqin Peng
Liang Ding
Yancheng Yuan
Xuebo Liu
Min Zhang
Y. Ouyang
Dacheng Tao
251
57
0
22 Jan 2024
Anchor function: a type of benchmark functions for studying language models
Zhongwang Zhang
Zhiwei Wang
Junjie Yao
Zhangchen Zhou
Xiaolong Li
E. Weinan
Z. Xu
336
9
0
16 Jan 2024
WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge
ACM Multimedia (MM), 2024
Wenbin Wang
Liang Ding
Li Shen
Yong Luo
Han Hu
Dacheng Tao
226
30
0
12 Jan 2024
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang
Shuibai Zhang
Zhuohao Yu
Guangsheng Bao
Yidong Wang
...
Ruochen Xu
Weirong Ye
Xing Xie
Weizhu Chen
Yue Zhang
388
25
0
26 Dec 2023
Neuron-Level Knowledge Attribution in Large Language Models
Zeping Yu
Sophia Ananiadou
FAtt
KELM
290
28
0
19 Dec 2023
One-Shot Learning as Instruction Data Prospector for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunshui Li
Binyuan Hui
Xiaobo Xia
Jiaxi Yang
Min Yang
...
Ling-Hao Chen
Junhao Liu
Tongliang Liu
Fei Huang
Yongbin Li
361
46
0
16 Dec 2023
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning
Xijie Huang
Li Lyna Zhang
Kwang-Ting Cheng
Fan Yang
Mao Yang
LRM
ReLM
301
16
0
14 Dec 2023
Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Yuhan Chen
Ang Lv
Ting-En Lin
Cai Chen
Yuchuan Wu
Fei Huang
Yongbin Li
Rui Yan
230
39
0
07 Dec 2023
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Computer Vision and Pattern Recognition (CVPR), 2023
Qidong Huang
Xiao-wen Dong
Pan Zhang
Sijin Yu
Conghui He
Yuan Liu
Dahua Lin
Weiming Zhang
Neng H. Yu
MLLM
447
356
0
29 Nov 2023
Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning
Kazuma Hashimoto
K. Raman
Michael Bendersky
368
2
0
16 Nov 2023
Evaluating, Understanding, and Improving Constrained Text Generation for Large Language Models
Xiang Chen
Xiaojun Wan
175
2
0
25 Oct 2023
Function Vectors in Large Language Models
International Conference on Learning Representations (ICLR), 2023
Eric Todd
Millicent Li
Arnab Sen Sharma
Aaron Mueller
Byron C. Wallace
David Bau
311
182
0
23 Oct 2023
Previous
1
2
3
Next