A Theory of Emergent In-Context Learning as Implicit Structure Induction

14 March 2023

Michael Hahn

Navin Goyal

LRM

ArXiv (abs)PDF HTML Github

Papers citing "A Theory of Emergent In-Context Learning as Implicit Structure Induction"

50 / 66 papers shown

Genomic Next-Token Predictors are In-Context Learners

275

16 Nov 2025

Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training

267

10 Nov 2025

Hyperspectral data augmentation with transformer-based diffusion models

Mattia Ferrari

Lorenzo Bruzzone

155

09 Oct 2025

Can LLMs Reason Over Non-Text Modalities in a Training-Free Manner? A Case Study with In-Context Representation Learning

267

22 Sep 2025

InSQuAD: In-Context Learning for Efficient Retrieval via Submodular Mutual Information to Enforce Quality and Diversity

Souradeep Nanda

Anay Majee

Rishabh K. Iyer

175

28 Aug 2025

The Other Mind: How Language Models Exhibit Human Temporal Cognition

263

21 Jul 2025

Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective

265

19 Jun 2025

Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning

Chengye Li

Haiyun Liu

Yuanxi Li

279

13 Jun 2025

Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors

Harish Tayyar Madabushi

Melissa Torgbi

C. Bonial

467

29 May 2025

Mechanistic evaluation of Transformers and state space models

505

21 May 2025

ICL CIPHERS: Quantifying "Learning" in In-Context Learning via Substitution Ciphers

535

28 Apr 2025

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

426

24 Apr 2025

Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B

382

31 Mar 2025

Enough Coin Flips Can Make LLMs Act BayesianAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

383

06 Mar 2025

Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers

1.4K

04 Feb 2025

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?International Conference on Learning Representations (ICLR), 2025

Yutong Yin

Zhaoran Wang

LRM ReLM

1.4K

27 Jan 2025

Using Pre-trained LLMs for Multivariate Time Series Forecasting

293

10 Jan 2025

Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024

Jiajun Song

Zhuoyan Xu

Yiqiao Zhong

405

31 Dec 2024

Conceptual In-Context Learning and Chain of Concepts: Solving Complex Conceptual Problems Using Large Language Models

Nishtha N. Vaidya

Thomas Runkler

Thomas Hubauer

Veronika Haderlein-Hoegberg

Maja Mlicic Brandt

LRM

299

19 Dec 2024

Bayesian scaling laws for in-context learning

663

21 Oct 2024

On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures

533

15 Oct 2024

Racing Thoughts: Explaining Contextualization Errors in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

597

02 Oct 2024

In-Context Learning with Representations: Contextual Generalization of Trained TransformersNeural Information Processing Systems (NeurIPS), 2024

Yingbin Liang

374

19 Aug 2024

Representing Rule-based Chatbots with Transformers

Dan Friedman

Abhishek Panigrahi

Danqi Chen

476

15 Jul 2024

Estimating the Hallucination Rate of Generative AI

Andrew Jesson

Nicolas Beltran-Velez

594

11 Jun 2024

On Subjective Uncertainty Quantification and Calibration in Natural Language GenerationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Ziyu Wang

Chris Holmes

UQLM

545

07 Jun 2024

What Do Language Models Learn in Context? The Structured Task HypothesisAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

489

06 Jun 2024

Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

Fabian Falck

Ziyu Wang

Chris Holmes

450

02 Jun 2024

How In-Context Learning Emerges from Training on Unstructured Data: On the Role of Co-Occurrence, Positional Information, and Noise Structures

Kevin Christian Wibisono

Yixin Wang

161

31 May 2024

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Siyu Chen

357

30 May 2024

Does learning the right latent variables necessarily improve in-context learning?

477

29 May 2024

Finding Visual Task Vectors

Yutong Bai

332

08 Apr 2024

Can large language models explore in-context?Neural Information Processing Systems (NeurIPS), 2024

717

22 Mar 2024

Concept-aware Data Construction Improves In-context Learning of Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Michal Štefánik

Marek Kadlcík

Petr Sojka

325

08 Mar 2024

LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History

Mario Fritz

408

28 Feb 2024

Visual In-Context Learning for Large Vision-Language Models

276

134

18 Feb 2024

Pelican Soup Framework: A Theoretical Framework for Language Model Capabilities

Ting-Rui Chiang

Dani Yogatama

266

16 Feb 2024

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Dimitris Papailiopoulos

463

121

06 Feb 2024

Learning Universal PredictorsInternational Conference on Machine Learning (ICML), 2024

Marcus Hutter

...

288

26 Jan 2024

Demystifying Chains, Trees, and Graphs of ThoughtsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

...

1.2K

25 Jan 2024

In-Context Language Learning: Architectures and AlgorithmsInternational Conference on Machine Learning (ICML), 2024

Bailin Wang

451

23 Jan 2024

Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

591

11 Jan 2024

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Sharath Chandra Raparthy

387

06 Dec 2023

How are Prompts Different in Terms of Sensitivity?North American Chapter of the Association for Computational Linguistics (NAACL), 2023

Sheng Lu

Hendrik Schuff

Iryna Gurevych

394

13 Nov 2023

Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label DescriptionsInternational Conference on Learning Representations (ICLR), 2023

286

13 Nov 2023

The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jiazheng Li

Yulan He

390

01 Nov 2023

Which Examples to Annotate for In-Context Learning? Towards Effective and Efficient Selection

George Karypis

295

30 Oct 2023

In-Context Learning Dynamics with Random Binary SequencesInternational Conference on Learning Representations (ICLR), 2023

518

26 Oct 2023

Function Vectors in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

485

217

23 Oct 2023

Do pretrained Transformers Learn In-Context by Gradient Descent?

Lingfeng Shen

Aayush Mishra

Daniel Khashabi

481

12 Oct 2023