ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
v1v2v3v4 (latest)

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 645 papers shown
EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations
EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations
Xinyun Zhou
Xinfeng Li
Yinan Peng
Ming Xu
X. Zhang
...
X. Jia
Kun Wang
Qingsong Wen
Xiaofeng Wang
Wei Dong
AAML
181
1
0
01 Dec 2025
Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day
Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day
Milad Abdollahzadeh
Abdul Raheem
Zilong Zhao
Uzair Javaid
Kevin Yee
Nalam Venkata Abhishek
Tram Truong-Huu
Biplab Sikdar
LMTDALM
295
0
0
28 Nov 2025
An Empirical Study on the Security Vulnerabilities of GPTs
An Empirical Study on the Security Vulnerabilities of GPTs
Tong Wu
Weibin Wu
Zibin Zheng
LLMAGELM
181
0
0
28 Nov 2025
Adaptive Focus Memory for Language Models
Adaptive Focus Memory for Language Models
Christopher Cruz
KELM
297
0
0
16 Nov 2025
Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
Alina Fastowski
Bardh Prenkaj
Yuxiao Li
Gjergji Kasneci
AAMLKELMHILM
329
0
0
08 Nov 2025
Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement
Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement
Sekh Mainul Islam
Pepa Atanasova
Isabelle Augenstein
181
1
0
03 Nov 2025
LM-mixup: Text Data Augmentation via Language Model based Mixup
LM-mixup: Text Data Augmentation via Language Model based Mixup
Zhijie Deng
Zhouan Shen
Ling Li
Yao Zhou
Zhaowei Zhu
Yanji He
Wei Wang
Jiaheng Wei
128
0
0
23 Oct 2025
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Javier Marín
134
1
0
23 Oct 2025
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints
Kailin Jiang
Hongbo Jiang
Ning Jiang
Zhi Gao
Jinhe Bi
Yuchen Ren
B. Li
Yuntao Du
L. J. Liu
Qing Li
CLLOffRLKELMVLM
262
6
0
22 Oct 2025
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection
Hongyi He
Xiao Liu
Zhenghao Lin
Mingni Tang
Y. Cheng
Jintao Wang
W. Li
Peng Cheng
Yeyun Gong
OODD
259
0
0
21 Oct 2025
From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering
From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering
Lei Li
Xiao Zhou
Y. Zhang
X. Wu
RALMMedIm
190
0
0
21 Oct 2025
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Facts in Stats: Impacts of Pretraining Diversity on Language Model Generalization
Tina Behnia
Puneesh Deora
Christos Thrampoulidis
135
0
0
17 Oct 2025
Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior
Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior
Rahul Nadkarni
Yanai Elazar
Hila Gonen
Noah A. Smith
KELM
173
0
0
16 Oct 2025
On the Entity-Level Alignment in Crosslingual Consistency
On the Entity-Level Alignment in Crosslingual Consistency
Yihong Liu
Mingyang Wang
François Yvon
Hinrich Schütze
HILM
196
1
0
11 Oct 2025
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Pengfei He
Zhenwei Dai
Bing He
Hui Liu
Xianfeng Tang
...
Subhabrata Mukherjee
Suhang Wang
Yue Xing
Shucheng Zhou
Benoit Dumoulin
LLMAG
222
3
0
06 Oct 2025
SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
Zichao Shen
Chen Gao
Jiaqi Yuan
Tianchen Zhu
Xingcheng Fu
Qingyun Sun
135
1
0
30 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
KELMMoERALMLRM
297
4
0
29 Sep 2025
Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
Bingkui Tong
Jiaer Xia
Kaiyang Zhou
MLLM
214
4
0
29 Sep 2025
Knowledge Homophily in Large Language Models
Knowledge Homophily in Large Language Models
Utkarsh Sahu
Zhisheng Qi
M. Halappanavar
Nedim Lipka
Ryan Rossi
Franck Dernoncourt
Yu Zhang
Yao Ma
Yu Wang
154
0
0
28 Sep 2025
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie
Jun Zhou
G. Wang
Shisong Wud
Zichen Wang
216
0
0
24 Sep 2025
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
Dongjun Kim
Gyuho Shim
YongChan Chun
Minhyuk Kim
Chanjun Park
Heuiseok Lim
178
1
0
23 Sep 2025
Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference
Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference
Ben Finkelshtein
Silviu Cucerzan
S. Jauhar
Ryen W. White
207
0
0
23 Sep 2025
How Persuasive is Your Context?
How Persuasive is Your Context?
Tu Nguyen
Kevin Du
Alexander Miserlis Hoyle
Ryan Cotterell
142
0
0
22 Sep 2025
KAHAN: Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
KAHAN: Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
Yajing Yang
Tony Deng
Min-Yen Kan
142
0
0
21 Sep 2025
Rethinking the Role of Text Complexity in Language Model Pretraining
Rethinking the Role of Text Complexity in Language Model Pretraining
Dan John Velasco
M. R
242
2
0
20 Sep 2025
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
Kangtao Lv
Haibin Chen
Yujin Yuan
Langming Liu
Shilei Liu
Yongwei Wang
Yuchi Xu
B. Zheng
KELM
185
0
0
19 Sep 2025
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Lukas Toral
Teddy Lazebnik
214
0
0
10 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELMHILM
182
3
0
10 Sep 2025
CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Ruiling Guo
Xinwei Yang
Chen Huang
Tong Zhang
Yong Hu
206
0
0
04 Sep 2025
Provable Benefits of In-Tool Learning for Large Language Models
Provable Benefits of In-Tool Learning for Large Language Models
Sam Houliston
Ambroise Odonnat
Charles Arnal
Vivien A. Cabannes
RALM
172
2
0
28 Aug 2025
CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models
CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models
Anant Khandelwal
Manish Gupta
Puneet Agrawal
246
2
0
25 Aug 2025
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
Yunxiao Zhao
Hao Xu
Zhiqiang Wang
Xiaoli Li
Jiye Liang
Ru Li
LRM
192
1
0
23 Aug 2025
From Confidence to Collapse in LLM Factual Robustness
From Confidence to Collapse in LLM Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Alina Fastowski
Bardh Prenkaj
Gjergji Kasneci
HILMAAML
268
3
0
22 Aug 2025
Hallucinations in medical devices
Hallucinations in medical devices
Jason Granstedt
Prabhat Kc
Rucha Deshpande
Victor Garcia
Aldo Badano
208
6
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
...
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
Junhao Song
ELM
325
6
0
17 Aug 2025
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
Xinda Jia
Jinpeng Li
Zezhong Wang
Jingjing Li
Xingshan Zeng
Yasheng Wang
Weinan Zhang
Yong Yu
Weiwen Liu
LRM
174
1
0
17 Aug 2025
RAST: A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
RAST: A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
Weilin Ruan
Xilin Dang
Ziyu Zhou
Sisuo Lyu
Yuxuan Liang
AI4TS
447
1
0
14 Aug 2025
Learning Facts at Scale with Active Reading
Learning Facts at Scale with Active Reading
Jessy Lin
Vincent-Pierre Berges
Xilun Chen
Anuj Kumar
Gargi Ghosh
Barlas Oğuz
RALMKELM
206
6
0
13 Aug 2025
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Xin Liu
Qiyang Song
Shaowen Xu
Kerou Zhou
Wenbo Jiang
Xiaoqi Jia
Weijuan Zhang
Heqing Huang
Yakai Li
KELM
209
0
0
01 Aug 2025
A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions
A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions
Agada Joseph Oche
Ademola Glory Folashade
Tirthankar Ghosal
Arpan Biswas
3DVVLM
481
32
0
25 Jul 2025
Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to Misinformation
Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to MisinformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Kyubeen Han
Junseo Jang
Hongjin Kim
Geunyeong Jeong
Harksoo Kim
175
3
0
24 Jul 2025
Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Todd Nief
David Reber
Sean Richardson
Ari Holtzman
KELM
226
0
0
25 Jun 2025
From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts
From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts
Daniel Christoph
Max Ploner
Patrick Haller
Alan Akbik
KELM
148
1
0
20 Jun 2025
Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
Khurram Yamin
Gaurav R. Ghosal
Bryan Wilder
LRM
326
3
0
15 Jun 2025
Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
Hongru Wang
Cheng Qian
Pengfei Yu
Jiahao Qiu
Boyang Xue
Mengdi Wang
Heng Ji
Kam-Fai Wong
Kam-Fai Wong
422
10
0
01 Jun 2025
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Boyi Zhang
Zhuo Liu
Hangfeng He
LRM
221
0
0
31 May 2025
How much do language models memorize?
How much do language models memorize?
John X. Morris
Chawin Sitawarin
Chuan Guo
Narine Kokhlikyan
G. E. Suh
Alexander M. Rush
Kamalika Chaudhuri
Saeed Mahloujifar
KELMELM
455
35
0
30 May 2025
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
Vasilije Markovic
Lazar Obradovic
Laszlo Hajdu
Jovan Pavlovic
281
9
0
30 May 2025
Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds
Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds
Ishwar B Balappanawar
Vamshi Krishna Bonagiri
Anish Joishy
Manas Gaur
K. Thirunarayan
Ponnurangam Kumaraguru
ReLMLRM
325
0
0
28 May 2025
Precise In-Parameter Concept Erasure in Large Language Models
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELMMU
435
7
0
28 May 2025
1234...111213
Next
Page 1 of 13
Pageof 13