v1v2v3v4 (latest)

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

23 May 2023

Lei Li

Jie Zhou

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning"

50 / 102 papers shown

Rethinking Associative Memory Mechanism in Induction Head

Shuo Wang

Issei Sato

429

16 Dec 2024

Video Diffusion Transformers are In-Context Learners

879

14 Dec 2024

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

551

27 Nov 2024

Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering

Zeping Yu

Sophia Ananiadou

1.1K

17 Nov 2024

Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models

Yue Li

Zhixue Zhao

Carolina Scarton

252

24 Oct 2024

MLLM can see? Dynamic Correction Decoding for Hallucination MitigationInternational Conference on Learning Representations (ICLR), 2024

785

15 Oct 2024

Can In-context Learning Really Generalize to Out-of-distribution Tasks?International Conference on Learning Representations (ICLR), 2024

Yisen Wang

279

13 Oct 2024

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language ModelsInternational Conference on Learning Representations (ICLR), 2024

359

12 Oct 2024

Temporal Reasoning Transfer from Text to VideoInternational Conference on Learning Representations (ICLR), 2024

Lei Li

Chenxin An

Xu Sun

Qi Liu

179

08 Oct 2024

Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong InformationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Yongheng Zhang

Jingxuan Zhou

Libo Qin

359

06 Oct 2024

Self-Powered LLM Modality Expansion for Large Speech-Text ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tengfei Yu

Xuebo Liu

Zhiyi Hou

Liang Ding

Dacheng Tao

Min Zhang

218

04 Oct 2024

Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything ConstraintEuropean Conference on Computer Vision (ECCV), 2024

Sixiang Chen

Tian-Chun Ye

Lucas Beerens

Zhaohu Xing

Yunlong Lin

Lei Zhu

DiffM

203

24 Sep 2024

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

134

13 Sep 2024

From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint TuningInternational Conference on Machine Learning (ICML), 2024

Wei Chen

Zhen Huang

Liang Xie

Binbin Lin

Houqiang Li

...

Deng Cai

Yonggang Zhang

Wenxiao Wang

Xu Shen

Jieping Ye

336

03 Sep 2024

EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model

Yizhou Zhou

Siying Wu

Fengyun Rao

Yueyi Zhang

Xiaoyan Sun

458

21 Aug 2024

Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions

492

16 Aug 2024

Label Words as Local Task Vectors in In-Context Learning

238

23 Jun 2024

Learnable In-Context Vector for Visual Question AnsweringNeural Information Processing Systems (NeurIPS), 2024

Xu Yang

235

19 Jun 2024

Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language ModelsNeural Information Processing Systems (NeurIPS), 2024

421

15 Jun 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden StatesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zhenhong Zhou

Haiyang Yu

Xinghua Zhang

Rongwu Xu

Fei Huang

Yongbin Li

371

09 Jun 2024

Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective

240

06 Jun 2024

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

...

681

179

04 Jun 2024

UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation

276

31 May 2024

Implicit In-context LearningInternational Conference on Learning Representations (ICLR), 2024

Di Liu

355

23 May 2024

P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models

Guochao Jiang

Zepeng Ding

Yuchen Shi

Deqing Yang

309

08 May 2024

Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning

146

25 Apr 2024

Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation

Shaomu Tan

Di Wu

Christof Monz

MoMe

301

17 Apr 2024

Efficient Prompting Methods for Large Language Models: A Survey

Jingbo Zhu

391

01 Apr 2024

Don't Half-listen: Capturing Key-part Information in Continual Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

488

15 Mar 2024

Not All Layers of LLMs Are Necessary During Inference

Siqi Fan

Xin Jiang

Xiang Li

Yequan Wang

431

04 Mar 2024

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning

Kang Liu

Jun Zhao

LRM

274

28 Feb 2024

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models

530

28 Feb 2024

Large Language Models Can Better Understand Knowledge Graphs Than We Thought

425

18 Feb 2024

Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models

Yuxiang Zhang

166

16 Feb 2024

Do LLMs Know about Hallucination? An Empirical Investigation of LLM's Hidden States

Hanyu Duan

Yi Yang

Kar Yan Tam

HILM

174

15 Feb 2024

Universal Link Predictor By In-Context Learning on Graphs

226

12 Feb 2024

NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning

Yufeng Zhao

Yoshihiro Sakai

Naoya Inoue

286

08 Feb 2024

How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zeping Yu

Sophia Ananiadou

237

05 Feb 2024

Revisiting Demonstration Selection Strategies in In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Liang Ding

Min Zhang

251

22 Jan 2024

Anchor function: a type of benchmark functions for studying language models

336

16 Jan 2024

WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World KnowledgeACM Multimedia (MM), 2024

Wenbin Wang

Liang Ding

Li Shen

Yong Luo

Han Hu

Dacheng Tao

226

12 Jan 2024

Supervised Knowledge Makes Large Language Models Better In-context Learners

...

Xing Xie

388

26 Dec 2023

Neuron-Level Knowledge Attribution in Large Language Models

Zeping Yu

Sophia Ananiadou

FAtt KELM

290

19 Dec 2023

One-Shot Learning as Instruction Data Prospector for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yunshui Li

Binyuan Hui

Xiaobo Xia

Jiaxi Yang

Min Yang

...

Fei Huang

361

16 Dec 2023

Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning

Li Lyna Zhang

Fan Yang

301

14 Dec 2023

Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use

Ting-En Lin

Rui Yan

230

07 Dec 2023

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-AllocationComputer Vision and Pattern Recognition (CVPR), 2023

Conghui He

Dahua Lin

447

356

29 Nov 2023

Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning

Kazuma Hashimoto

K. Raman

Michael Bendersky

368

16 Nov 2023

Evaluating, Understanding, and Improving Constrained Text Generation for Large Language Models

Xiang Chen

Xiaojun Wan

175

25 Oct 2023

Function Vectors in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

311

182

23 Oct 2023

All Papers

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Papers citing "Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning"