ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
v1v2v3v4 (latest)

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXiv (abs)PDFHTML

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 627 papers shown
Title
Decoding Large-Language Models: A Systematic Overview of Socio-Technical
  Impacts, Constraints, and Emerging Questions
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
77
0
0
25 Sep 2024
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question
  Answering
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
Wanqi Yang
Yanda Li
Meng Fang
Ling Chen
129
9
0
25 Sep 2024
Konstruktor: A Strong Baseline for Simple Knowledge Graph Question
  Answering
Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering
M. Lysyuk
Mikhail Salnikov
Pavel Braslavski
Sergey Petrakov
88
2
0
24 Sep 2024
GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation
GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation
B. Rappazzo
Yingheng Wang
Aaron Ferber
Daniel Schwalbe-Koda
VLM
101
1
0
23 Sep 2024
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
David Herel
Vojtech Bartek
Jiri Jirak
Tomas Mikolov
215
4
0
20 Sep 2024
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large
  Language Model Reasoning
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning
Santosh Kumar Radha
Yasamin Nouri Jelyani
Ara Ghukasyan
Oktay Goktas
LLMAGLM&RoLRM
185
10
0
19 Sep 2024
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and
  Language Models
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and Language Models
Abhinav Jain
Chris Jermaine
Vaibhav Unhelkar
KELMLLMAG
106
2
0
18 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using
  LLMs
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
129
11
0
15 Sep 2024
Understanding Knowledge Drift in LLMs through Misinformation
Understanding Knowledge Drift in LLMs through Misinformation
Alina Fastowski
Gjergji Kasneci
KELM
104
2
0
11 Sep 2024
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
224
15
0
11 Sep 2024
A Fresh Take on Stale Embeddings: Improving Dense Retriever Training
  with Corrector Networks
A Fresh Take on Stale Embeddings: Improving Dense Retriever Training with Corrector Networks
Nicholas Monath
Will Grathwohl
Michael Boratko
Rob Fergus
Andrew McCallum
Manzil Zaheer
114
0
0
03 Sep 2024
Explicit Inductive Inference using Large Language Models
Explicit Inductive Inference using Large Language Models
Tianyang Liu
Tianyi Li
Liang Cheng
Mark Steedman
108
0
0
26 Aug 2024
Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter
Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter
Junhao Chen
Bowen Wang
Zhouqiang Jiang
Yuta Nakashima
147
3
0
20 Aug 2024
FastFiD: Improve Inference Efficiency of Open Domain Question Answering
  via Sentence Selection
FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection
Yufei Huang
Xu Han
Maosong Sun
115
2
0
12 Aug 2024
Urban Region Pre-training and Prompting: A Graph-based Approach
Urban Region Pre-training and Prompting: A Graph-based Approach
Jiahui Jin
Yifan Song
Dong Kan
Haojia Zhu
Xiangguo Sun
Zhicheng Li
Xigang Sun
Jinghui Zhang
AI4TSAI4CE
257
5
0
12 Aug 2024
MaxMind: A Memory Loop Network to Enhance Software Productivity based on
  Large Language Models
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
Yuchen Dong
Xiaoxiang Fang
Yuchen Hu
Renshuang Jiang
Zhe Jiang
140
0
0
07 Aug 2024
Entity Retrieval for Answering Entity-Centric Questions
Entity Retrieval for Answering Entity-Centric Questions
Hassan S. Shavarani
Anoop Sarkar
RALM
82
6
0
05 Aug 2024
Can LLMs predict the convergence of Stochastic Gradient Descent?
Can LLMs predict the convergence of Stochastic Gradient Descent?
Hiroki Sakaji
Khyati Khandelwal
Wataru Kuramoto
LRM
135
2
0
03 Aug 2024
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Ori Yoran
S. Amouyal
Chaitanya Malaviya
Ben Bogin
Ofir Press
Jonathan Berant
LLMAG
182
61
0
22 Jul 2024
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Meng Wang
Yunzhi Yao
Ziwen Xu
Shuofei Qiao
Shumin Deng
...
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
Ningyu Zhang
201
49
0
22 Jul 2024
Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation
  for Industrial Knowledge Base
Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base
Zhiyu An
Xianzhong Ding
Yen-Chun Fu
Cheng-Chung Chu
Yan Li
Wan Du
RALM
87
9
0
20 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text
  Generation: A State-of-the-Art Investigation
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
129
14
0
19 Jul 2024
Grounding and Evaluation for Large Language Models: Practical Challenges
  and Lessons Learned (Survey)
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILMELMAILaw
111
20
0
10 Jul 2024
Zero-shot Persuasive Chatbots with LLM-Generated Strategies and
  Information Retrieval
Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval
Kazuaki Furumai
Roberto Legaspi
Julio Vizcarra
Yudai Yamazaki
Yasutaka Nishimura
Sina J. Semnani
Kazushi Ikeda
Weiyan Shi
Monica S. Lam
136
10
0
04 Jul 2024
Fundamental Problems With Model Editing: How Should Rational Belief
  Revision Work in LLMs?
Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?
Peter Hase
Thomas Hofweber
Xiang Zhou
Elias Stengel-Eskin
Joey Tianyi Zhou
KELMLRM
135
19
0
27 Jun 2024
Mental Modeling of Reinforcement Learning Agents by Language Models
Mental Modeling of Reinforcement Learning Agents by Language Models
Wenhao Lu
Xufeng Zhao
Josua Spisak
Jae Hee Lee
Stefan Wermter
LLMAGLRMLM&Ro
105
3
0
26 Jun 2024
It Is Not About What You Say, It Is About How You Say It: A Surprisingly
  Simple Approach for Improving Reading Comprehension
It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
Sagi Shaier
Lawrence E Hunter
Katharina von der Wense
136
4
0
24 Jun 2024
One Thousand and One Pairs: A "novel" challenge for long-context
  language models
One Thousand and One Pairs: A "novel" challenge for long-context language models
Marzena Karpinska
Katherine Thai
Kyle Lo
Tanya Goyal
Mohit Iyyer
LRM
204
62
0
24 Jun 2024
Beyond Individual Facts: Investigating Categorical Knowledge Locality of
  Taxonomy and Meronomy Concepts in GPT Models
Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models
Christopher Burger
Yifan Hu
Thai Le
KELM
94
0
0
22 Jun 2024
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in
  LLMs
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen
Jiatong Han
Muhammed Razzak
Lisa Schut
Shreshth A. Malik
Yarin Gal
HILM
175
84
0
22 Jun 2024
Understanding Finetuning for Factual Knowledge Extraction
Understanding Finetuning for Factual Knowledge Extraction
Gaurav R. Ghosal
Tatsunori Hashimoto
Aditi Raghunathan
112
23
0
20 Jun 2024
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM Pipelines
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM Pipelines
Wenbo Sun
Jiaqi Wang
Qiming Guo
Ziyu Li
Wenlu Wang
Rihan Hai
98
11
0
20 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
119
1
0
19 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a
  Single Token
Estimating Knowledge in Large Language Models Without Generating a Single Token
Daniela Gottesman
Mor Geva
143
21
0
18 Jun 2024
How Do Large Language Models Acquire Factual Knowledge During
  Pretraining?
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
Hoyeon Chang
Jinho Park
Seonghyeon Ye
Sohee Yang
Youngkyung Seo
Du-Seong Chang
Minjoon Seo
KELM
154
65
0
17 Jun 2024
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
Shengkang Wang
Hongzhan Lin
Ziyang Luo
Zhen Ye
Guang Chen
Jing Ma
251
5
0
17 Jun 2024
DIEKAE: Difference Injection for Efficient Knowledge Augmentation and
  Editing of Large Language Models
DIEKAE: Difference Injection for Efficient Knowledge Augmentation and Editing of Large Language Models
Alessio Galatolo
Meriem Beloucif
Katie Winkle
85
0
0
15 Jun 2024
DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for
  Question Answering over Knowledge Graphs
DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs
Haishuo Fang
Xiaodan Zhu
Iryna Gurevych
AI4CE
77
3
0
11 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELMALMLM&MA
251
58
0
09 Jun 2024
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge
  Sources
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
Dongkyu Lee
Chandana Satya Prakash
Jack G. M. FitzGerald
Jens Lehmann
RALM
136
4
0
07 Jun 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
TAIA: Large Language Models are Out-of-Distribution Data Learners
Shuyang Jiang
Yusheng Liao
Ya Zhang
Yu Wang
Yanfeng Wang
109
5
0
30 May 2024
Knowledge Graph Tuning: Real-time Large Language Model Personalization
  based on Human Feedback
Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback
Jingwei Sun
Zhixu Du
Yiran Chen
KELM
90
3
0
30 May 2024
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation
Abhishek Divekar
Greg Durrett
187
12
0
16 May 2024
SciQAG: A Framework for Auto-Generated Science Question Answering
  Dataset with Fine-grained Evaluation
SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluation
Yuwei Wan
Yixuan Liu
Aswathy Ajith
Clara Grazian
B. Hoex
Wenjie Zhang
Chunyu Kit
Tong Xie
Ian Foster
122
16
0
16 May 2024
Self-Improving Customer Review Response Generation Based on LLMs
Self-Improving Customer Review Response Generation Based on LLMs
Guy Azov
Tatiana Pelc
Adi Fledel Alon
Gila Kamhi
103
3
0
06 May 2024
Enhancing Contextual Understanding in Large Language Models through
  Contrastive Decoding
Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding
Zheng Zhao
Emilio Monti
Jens Lehmann
H. Assem
131
40
0
04 May 2024
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Sneha Singhania
Simon Razniewski
Gerhard Weikum
RALM
183
1
0
04 May 2024
Multi-hop Question Answering over Knowledge Graphs using Large Language
  Models
Multi-hop Question Answering over Knowledge Graphs using Large Language Models
Abir Chakraborty
KELMRALM
131
8
0
30 Apr 2024
Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Jiaqing Yuan
Lin Pan
Chung-Wei Hang
Jiang Guo
Jiarong Jiang
Bonan Min
Patrick Ng
Zhiguo Wang
HILMELM
106
4
0
24 Apr 2024
Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Dongryeol Lee
Minwoo Lee
Kyungmin Min
Joonsuk Park
Kyomin Jung
149
2
0
24 Apr 2024
Previous
123456...111213
Next