ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
v1v2v3v4 (latest)

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXiv (abs)PDFHTML

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 627 papers shown
Title
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Pengfei He
Zhenwei Dai
Bing He
Hui Liu
Xianfeng Tang
...
Subhabrata Mukherjee
Suhang Wang
Yue Xing
Jiliang Tang
Benoit Dumoulin
LLMAG
8
0
0
06 Oct 2025
SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning
Zichao Shen
Chen Gao
Jiaqi Yuan
Tianchen Zhu
Xingcheng Fu
Qingyun Sun
0
0
0
30 Sep 2025
Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
Bingkui Tong
Jiaer Xia
Kaiyang Zhou
MLLM
8
0
0
29 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
RALMKELM
48
0
0
29 Sep 2025
Knowledge Homophily in Large Language Models
Knowledge Homophily in Large Language Models
Utkarsh Sahu
Zhisheng Qi
M. Halappanavar
Nedim Lipka
Ryan Rossi
Franck Dernoncourt
Yu Zhang
Yao Ma
Yu Wang
0
0
0
28 Sep 2025
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie
Jun Zhou
G. Wang
Shisong Wud
Zichen Wang
12
0
0
24 Sep 2025
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
Dongjun Kim
Gyuho Shim
YongChan Chun
Minhyuk Kim
Chanjun Park
Heuiseok Lim
0
0
0
23 Sep 2025
Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference
Actions Speak Louder than Prompts: A Large-Scale Study of LLMs for Graph Inference
Ben Finkelshtein
Silviu Cucerzan
S. Jauhar
Ryen W. White
20
0
0
23 Sep 2025
How Persuasive is Your Context?
How Persuasive is Your Context?
Tu Nguyen
Kevin Du
Alexander Miserlis Hoyle
Ryan Cotterell
28
0
0
22 Sep 2025
KAHAN: Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
KAHAN: Knowledge-Augmented Hierarchical Analysis and Narration for Financial Data Narration
Yajing Yang
Tony Deng
Min-Yen Kan
16
0
0
21 Sep 2025
Rethinking the Role of Text Complexity in Language Model Pretraining
Rethinking the Role of Text Complexity in Language Model Pretraining
Dan John Velasco
M. R
8
1
0
20 Sep 2025
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
Kangtao Lv
Haibin Chen
Yujin Yuan
Langming Liu
Shilei Liu
Yongwei Wang
Yuchi Xu
B. Zheng
KELM
22
0
0
19 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELMHILM
14
0
0
10 Sep 2025
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing
Lukas Toral
Teddy Lazebnik
40
0
0
10 Sep 2025
CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
CANDY: Benchmarking LLMs' Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Ruiling Guo
Xinwei Yang
Chen Huang
Tong Zhang
Yong Hu
32
0
0
04 Sep 2025
Provable Benefits of In-Tool Learning for Large Language Models
Provable Benefits of In-Tool Learning for Large Language Models
Sam Houliston
Ambroise Odonnat
Charles Arnal
Vivien A. Cabannes
RALM
52
1
0
28 Aug 2025
CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models
CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models
Anant Khandelwal
Manish Gupta
Puneet Agrawal
38
0
0
25 Aug 2025
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
Yunxiao Zhao
Hao Xu
Zhiqiang Wang
Xiaoli Li
Jiye Liang
Ru Li
LRM
20
1
0
23 Aug 2025
From Confidence to Collapse in LLM Factual Robustness
From Confidence to Collapse in LLM Factual Robustness
Alina Fastowski
Bardh Prenkaj
Gjergji Kasneci
HILMAAML
56
0
0
22 Aug 2025
Hallucinations in medical devices
Hallucinations in medical devices
Jason Granstedt
Prabhat Kc
Rucha Deshpande
Victor Garcia
Aldo Badano
36
0
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Tianyang Wang
...
Lu Chen
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
ELM
32
0
0
17 Aug 2025
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
Xinda Jia
Jinpeng Li
Zezhong Wang
Jingjing Li
Xingshan Zeng
Yasheng Wang
Weinan Zhang
Yong Yu
Weiwen Liu
LRM
44
0
0
17 Aug 2025
A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
A Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
Weilin Ruan
Xilin Dang
Ziyu Zhou
Sisuo Lyu
Yuxuan Liang
AI4TS
32
0
0
14 Aug 2025
Learning Facts at Scale with Active Reading
Learning Facts at Scale with Active Reading
Jessy Lin
Vincent-Pierre Berges
Xilun Chen
Anuj Kumar
Gargi Ghosh
Barlas Oğuz
RALMKELM
52
1
0
13 Aug 2025
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models
Xin Liu
Qiyang Song
Shaowen Xu
Kerou Zhou
Wenbo Jiang
Xiaoqi Jia
Weijuan Zhang
Heqing Huang
Yakai Li
KELM
52
0
0
01 Aug 2025
A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions
A Systematic Review of Key Retrieval-Augmented Generation (RAG) Systems: Progress, Gaps, and Future Directions
Agada Joseph Oche
Ademola Glory Folashade
Tirthankar Ghosal
Arpan Biswas
3DVVLM
90
2
0
25 Jul 2025
Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to Misinformation
Exploring the Impact of Instruction-Tuning on LLM's Susceptibility to Misinformation
Kyubeen Han
Junseo Jang
Hongjin Kim
Geunyeong Jeong
Harksoo Kim
61
0
0
24 Jul 2025
From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts
From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts
Daniel Christoph
Max Ploner
Patrick Haller
Alan Akbik
KELM
50
0
0
20 Jun 2025
LLMs Struggle to Perform Counterfactual Reasoning with Parametric Knowledge
LLMs Struggle to Perform Counterfactual Reasoning with Parametric Knowledge
Khurram Yamin
Gaurav R. Ghosal
Bryan Wilder
LRM
83
3
0
15 Jun 2025
Toward a Theory of Agents as Tool-Use Decision-Makers
Toward a Theory of Agents as Tool-Use Decision-Makers
Hongru Wang
Cheng Qian
Pengfei Yu
Jiahao Qiu
Boyang Xue
Mengdi Wang
Heng Ji
Kam-Fai Wong
114
6
0
01 Jun 2025
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
Boyi Zhang
Zhuo Liu
Hangfeng He
LRM
74
0
0
31 May 2025
How much do language models memorize?
How much do language models memorize?
John X. Morris
Chawin Sitawarin
Chuan Guo
Narine Kokhlikyan
G. E. Suh
Alexander M. Rush
Kamalika Chaudhuri
Saeed Mahloujifar
KELMELM
164
11
0
30 May 2025
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning
Vasilije Markovic
Lazar Obradovic
Laszlo Hajdu
Jovan Pavlovic
98
2
0
30 May 2025
If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?
If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?
Ishwar B Balappanawar
Vamshi Krishna Bonagiri
Anish Joishy
Manas Gaur
K. Thirunarayan
Ponnurangam Kumaraguru
ReLMLRM
116
0
0
28 May 2025
Precise In-Parameter Concept Erasure in Large Language Models
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELMMU
158
0
0
28 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
135
1
0
27 May 2025
InFact: Informativeness Alignment for Improved LLM Factuality
InFact: Informativeness Alignment for Improved LLM Factuality
Roi Cohen
Russa Biswas
Gerard de Melo
77
0
0
26 May 2025
A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models
A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models
Utkarsh Sahu
Zhisheng Qi
Y. Lei
Ryan Rossi
Franck Dernoncourt
Nesreen K. Ahmed
M. Halappanavar
Yao Ma
Yu Wang
134
0
0
25 May 2025
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
Xinran Gu
Kaifeng Lyu
Jiazheng Li
Jingzhao Zhang
154
1
0
23 May 2025
Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization
Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization
Aliakbar Nafar
Kristen Brent Venable
Zijun Cui
Parisa Kordjamshidi
BDL
100
1
0
21 May 2025
Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs
Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs
Federico Ranaldi
Andrea Zugarini
Leonardo Ranaldi
Fabio Massimo Zanzotto
71
0
0
21 May 2025
EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
Yanbo Dai
Zhenlan Ji
Zongjie Li
Shuai Wang
KELM
140
0
0
17 May 2025
DACL-RAG: Data Augmentation Strategy with Curriculum Learning for Retrieval-Augmented Generation
DACL-RAG: Data Augmentation Strategy with Curriculum Learning for Retrieval-Augmented Generation
S. Wang
Li Zhang
Zheren Fu
Zhendong Mao
Yongdong Zhang
84
0
0
15 May 2025
IterKey: Iterative Keyword Generation with LLMs for Enhanced Retrieval Augmented Generation
IterKey: Iterative Keyword Generation with LLMs for Enhanced Retrieval Augmented Generation
Kazuki Hayashi
Hidetaka Kamigaito
Shinya Kouda
Taro Watanabe
RALM
171
3
0
13 May 2025
DeltaEdit: Enhancing Sequential Editing in Large Language Models by Controlling Superimposed Noise
DeltaEdit: Enhancing Sequential Editing in Large Language Models by Controlling Superimposed Noise
Ding Cao
Yuchen Cai
Rongxi Guo
Xiaoxiao He
Guiquan Liu
KELM
229
0
0
12 May 2025
CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code
CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code
Tasnim Ahmed
Salimur Choudhury
86
1
0
02 May 2025
EnronQA: Towards Personalized RAG over Private Documents
EnronQA: Towards Personalized RAG over Private Documents
Michael J. Ryan
Danmei Xu
Chris Nivera
Daniel Campos
SILM
191
4
0
01 May 2025
ConSens: Assessing context grounding in open-book question answering
ConSens: Assessing context grounding in open-book question answeringInternational Conference on Artificial Neural Networks (ICANN), 2025
Ivan Vankov
Matyo Ivanov
Adriana Correia
Victor Botev
ELM
250
0
0
30 Apr 2025
Functional Abstraction of Knowledge Recall in Large Language Models
Functional Abstraction of Knowledge Recall in Large Language Models
Zijian Wang
Chang Xu
KELM
99
1
0
20 Apr 2025
Replicating ReLM Results: Validating Large Language Models with ReLM
Replicating ReLM Results: Validating Large Language Models with ReLM
Reece Adamson
Erin Song
54
0
0
16 Apr 2025
1234...111213
Next