v1v2v3v4 (latest)

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

10 February 2020

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 645 papers shown

EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations

...

181

01 Dec 2025

Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day

Nalam Venkata Abhishek

Tram Truong-Huu

Biplab Sikdar

LMTD ALM

295

28 Nov 2025

An Empirical Study on the Security Vulnerabilities of GPTs

181

28 Nov 2025

Adaptive Focus Memory for Language Models

Christopher Cruz

KELM

297

16 Nov 2025

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

329

08 Nov 2025

Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement

Sekh Mainul Islam

Pepa Atanasova

Isabelle Augenstein

181

03 Nov 2025

LM-mixup: Text Data Augmentation via Language Model based Mixup

128

23 Oct 2025

Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks

Javier Marín

134

23 Oct 2025

KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints

262

22 Oct 2025

Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection

259

21 Oct 2025

From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering

190

21 Oct 2025

Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization

Tina Behnia

Puneesh Deora

Christos Thrampoulidis

135

17 Oct 2025

Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior

173

16 Oct 2025

On the Entity-Level Alignment in Crosslingual Consistency

196

11 Oct 2025

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use

...

222

06 Oct 2025

SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning

135

30 Sep 2025

Pretraining with hierarchical memories: separating long-tail and common knowledge

297

29 Sep 2025

Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding

214

29 Sep 2025

Knowledge Homophily in Large Language Models

154

28 Sep 2025

Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation

216

24 Sep 2025

Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

178

23 Sep 2025

Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference

207

23 Sep 2025

How Persuasive is Your Context?

Tu Nguyen

Kevin Du

Alexander Miserlis Hoyle

Ryan Cotterell

142

22 Sep 2025

KAHAN: Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration

Yajing Yang

Tony Deng

Min-Yen Kan

142

21 Sep 2025

Rethinking the Role of Text Complexity in Language Model Pretraining

Dan John Velasco

M. R

242

20 Sep 2025

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

185

19 Sep 2025

Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing

Lukas Toral

Teddy Lazebnik

214

10 Sep 2025

Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms

182

10 Sep 2025

CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking

206

04 Sep 2025

Provable Benefits of In-Tool Learning for Large Language Models

172

28 Aug 2025

CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models

Anant Khandelwal

Manish Gupta

Puneet Agrawal

246

25 Aug 2025

Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective

192

23 Aug 2025

From Confidence to Collapse in LLM Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

268

22 Aug 2025

Hallucinations in medical devices

208

18 Aug 2025

Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models

...

325

17 Aug 2025

Fast, Slow, and Tool-augmented Thinking for LLMs: A Review

174

17 Aug 2025

RAST: A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction

447

14 Aug 2025

Learning Facts at Scale with Active Reading

Jessy Lin

Vincent-Pierre Berges

206

13 Aug 2025

Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models

209

01 Aug 2025

A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions

Agada Joseph Oche

Ademola Glory Folashade

Tirthankar Ghosal

Arpan Biswas

3DV VLM

481

25 Jul 2025

Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to MisinformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

175

24 Jul 2025

Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

226

25 Jun 2025

From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts

148

20 Jun 2025

Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning

326

15 Jun 2025

Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

422

01 Jun 2025

TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

221

31 May 2025

How much do language models memorize?

455

30 May 2025

Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning

281

30 May 2025

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Ishwar B Balappanawar

Vamshi Krishna Bonagiri

Anish Joishy

Manas Gaur

K. Thirunarayan

Ponnurangam Kumaraguru

ReLM LRM

325

28 May 2025

Precise In-Parameter Concept Erasure in Large Language Models

435

28 May 2025