Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
2002.08910
Cited By
v1
v2
v3
v4 (latest)
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"How Much Knowledge Can You Pack Into the Parameters of a Language Model?"
50 / 627 papers shown
Title
Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Yu Fu
Haz Sameen Shahgir
Hui Liu
Xianfeng Tang
Qi He
Yue Dong
KELM
228
1
0
11 Apr 2025
ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models
Joel Barmettler
Abraham Bernstein
Luca Rossetto
KELM
3DV
95
1
0
10 Apr 2025
Saliency-driven Dynamic Token Pruning for Large Language Models
Yao Tao
Yehui Tang
Yun Wang
Mingjian Zhu
Hailin Hu
Yunhe Wang
208
2
0
06 Apr 2025
On the Connection Between Diffusion Models and Molecular Dynamics
Liam Harcombe
Timothy T. Duignan
DiffM
189
0
0
04 Apr 2025
Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
Xinyu Wang
Linrui Ma
Jerry Huang
Peng Lu
Prasanna Parthasarathi
Xiao-Wen Chang
Boxing Chen
Yufei Cui
KELM
195
3
0
28 Mar 2025
Leveraging Language Models for Analyzing Longitudinal Experiential Data in Education
Ahatsham Hayat
Bilal Khan
Mohammad Hasan
AI4Ed
115
0
0
27 Mar 2025
Resolving UnderEdit & OverEdit with Iterative & Neighbor-Assisted Model Editing
Bhiman Kumar Baghel
Scott M. Jordan
Zheyuan Ryan Shi
Xiang Lorraine Li
KELM
150
0
0
14 Mar 2025
Taming Knowledge Conflicts in Language Models
Gaotang Li
Yuzhong Chen
Hanghang Tong
KELM
197
4
0
14 Mar 2025
ROGRAG: A Robustly Optimized GraphRAG Framework
Huanjun Kong
Zhefan Wang
Chenyang Wang
Zhe Ma
Nanqing Dong
130
0
0
09 Mar 2025
Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data
Henrik Nolte
Michèle Finck
Kristof Meding
AILaw
PILM
226
2
0
03 Mar 2025
Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks
Umar Ali Khan
Ekram Khan
Fiza Khan
A. A. Moinuddin
165
0
0
02 Mar 2025
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
Hongzhan Lin
Yang Deng
Yuxuan Gu
Wenxuan Zhang
Jing Ma
See-Kiong Ng
Tat-Seng Chua
LLMAG
KELM
HILM
210
6
0
25 Feb 2025
Revealing and Mitigating Over-Attention in Knowledge Editing
Pinzheng Wang
Zecheng Tang
Keyan Zhou
Junlin Li
Qiaoming Zhu
Hao Fei
KELM
245
4
0
21 Feb 2025
OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities
Michael Kouremetis
Marissa Dotter
Alex Byrne
Dan Martin
Ethan Michalak
Gianpaolo Russo
Michael Threet
Guido Zarrella
ELM
167
13
0
18 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Khyati Khandelwal
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Jun Yao
OffRL
198
2
0
17 Feb 2025
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
Wanqi Yang
Yongqian Li
Meng Fang
Lawrence Yunliang Chen
207
1
0
09 Feb 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
199
4
0
20 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
224
9
0
08 Jan 2025
SMARTCAL: An Approach to Self-Aware Tool-Use Evaluation and Calibration
Yuanhao Shen
Xiaodan Zhu
Lei Chen
216
6
0
11 Dec 2024
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
Roi Cohen
Konstantin Dobler
Eden Biran
Gerard de Melo
261
14
0
09 Dec 2024
What can LLM tell us about cities?
Zhuoheng Li
Yaochen Wang
Zhixue Song
Yuqi Huang
Rui Bao
Guanjie Zheng
Z. Li
141
4
0
25 Nov 2024
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Moran Yanuka
Assaf Ben-Kish
Yonatan Bitton
Idan Szpektor
Raja Giryes
VLM
266
3
0
13 Nov 2024
Controllable Context Sensitivity and the Knob Behind It
Julian Minder
Kevin Du
Niklas Stoehr
Giovanni Monea
Chris Wendler
Robert West
Robert Bamler
KELM
246
14
0
11 Nov 2024
Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation
Yu-Liang Zhan
Zhong-Yi Lu
Hao Sun
Ze-Feng Gao
122
0
0
10 Nov 2024
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
Shruti Singh
Nandan Sarkar
Arman Cohan
132
5
0
08 Nov 2024
Gradient Localization Improves Lifelong Pretraining of Language Models
Jared Fernandez
Yonatan Bisk
Emma Strubell
KELM
129
3
0
07 Nov 2024
Enabling LLM Knowledge Analysis via Extensive Materialization
Yujia Hu
Tuan-Phong Nguyen
Shrestha Ghosh
Simon Razniewski
KELM
157
4
0
07 Nov 2024
Code-Switching Curriculum Learning for Multilingual Transfer in LLMs
Haneul Yoo
Cheonbok Park
Sangdoo Yun
Alice Oh
Hwaran Lee
157
9
0
04 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
359
6
0
01 Nov 2024
HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
Yucheng Zhang
Qinfeng Li
Tianyu Du
Xuhong Zhang
Xinkui Zhao
Zhengwen Feng
Yuxiang Cai
AAML
SILM
129
10
0
30 Oct 2024
A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models
Elena Kardanova
Alina Ivanova
Ksenia Tarasova
Taras Pashchenko
Aleksei Tikhoniuk
Elen Yusupova
Anatoly Kasprzhak
Yaroslav Kuzminov
Ekaterina Kruchinskaia
Irina Brun
170
1
0
29 Oct 2024
Learning and Unlearning of Fabricated Knowledge in Language Models
Chen Sun
Nolan Miller
A. Zhmoginov
Max Vladymyrov
Mark Sandler
KELM
MU
110
2
0
29 Oct 2024
All Entities are Not Created Equal: Examining the Long Tail for Ultra-Fine Entity Typing
Advait Deshmukh
Ashwin Umadi
Dananjay Srinivas
Maria Leonor Pacheco
86
0
0
22 Oct 2024
Solving Sparse \& High-Dimensional-Output Regression via Compression
Renyuan Li
Zhehui Chen
Guanyi Wang
68
0
0
21 Oct 2024
NetSafe: Exploring the Topological Safety of Multi-agent Networks
Miao Yu
Shilong Wang
Guibin Zhang
Junyuan Mao
Chenlong Yin
Qijiong Liu
Qingsong Wen
Kun Wang
Yang Wang
113
19
0
21 Oct 2024
Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Shailaja Keyur Sampat
Maitreya Patel
Yezhou Yang
Chitta Baral
64
0
0
17 Oct 2024
Enhancing Fact Retrieval in PLMs through Truthfulness
Paul Youssef
Jorg Schlotterer
C. Seifert
KELM
HILM
107
0
0
17 Oct 2024
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
Ahmed Oumar El-Shangiti
Tatsuya Hiraoka
Hilal AlQuabeh
Benjamin Heinzerling
Kentaro Inui
206
3
0
17 Oct 2024
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Jiatao Li
Xinyu Hu
Xunjian Yin
Xiaojun Wan
RALM
262
0
0
17 Oct 2024
Telco-DPR: A Hybrid Dataset for Evaluating Retrieval Models of 3GPP Technical Specifications
Thaina Saraiva
Marco Sousa
Pedro Vieira
António Rodrigues
145
3
0
15 Oct 2024
ACER: Automatic Language Model Context Extension via Retrieval
Luyu Gao
Yunyi Zhang
Jamie Callan
RALM
97
0
0
11 Oct 2024
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Bozhou Li
Hao Liang
Yang Li
Fangcheng Fu
Hongzhi Yin
Conghui He
Wentao Zhang
KELM
CLL
131
0
0
08 Oct 2024
Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models
M. Farahani
Richard Johansson
RALM
136
5
0
07 Oct 2024
Neuron-Level Sequential Editing for Large Language Models
Houcheng Jiang
Cunchun Li
Tianyu Zhang
An Zhang
Ruipeng Wang
Tao Liang
Xiang Wang
KELM
149
8
0
05 Oct 2024
Defining Knowledge: Bridging Epistemology and Large Language Models
Constanza Fierro
Ruchira Dhar
Filippos Stamatiou
Nicolas Garneau
Anders Søgaard
KELM
161
9
0
03 Oct 2024
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization
Mingyang Wang
Lukas Lange
Heike Adel
Jannik Strötgen
Hinrich Schütze
KELM
123
3
0
03 Oct 2024
Large Language Models as Markov Chains
Oussama Zekri
Ambroise Odonnat
Khyati Khandelwal
Linus Bleistein
Nicolas Boullé
I. Redko
190
23
0
03 Oct 2024
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Cunchun Li
Houcheng Jiang
Kun Wang
Yunshan Ma
Shi Jie
Xiangnan He
Tat-Seng Chua
Tat-seng Chua
KELM
303
101
0
03 Oct 2024
Adaptively Private Next-Token Prediction of Large Language Models
James Flemings
Meisam Razaviyayn
Murali Annavaram
192
2
0
02 Oct 2024
Recursive Abstractive Processing for Retrieval in Dynamic Datasets
Charbel Chucri
Rami Azouz
Joachim Ott
84
0
0
02 Oct 2024
Previous
1
2
3
4
5
...
11
12
13
Next