Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2002.08910
Cited By
v1
v2
v3
v4 (latest)
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"How Much Knowledge Can You Pack Into the Parameters of a Language Model?"
50 / 627 papers shown
Title
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching
Vaibhav Adlakha
Shehzaad Dhuliawala
Kaheer Suleman
H. D. Vries
Siva Reddy
BDL
198
104
0
02 Oct 2021
A Survey of Knowledge Enhanced Pre-trained Models
Jian Yang
Xinyu Hu
Gang Xiao
Yulong Shen
KELM
291
7
0
01 Oct 2021
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
KELM
LRM
351
65
0
29 Sep 2021
More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering
Yang Bai
D. Wang
205
11
0
25 Sep 2021
RETRONLU: Retrieval Augmented Task-Oriented Semantic Parsing
Vivek Gupta
Akshat Shrivastava
Adithya Sagar
Armen Aghajanyan
Denis Savenkov
RALM
99
23
0
21 Sep 2021
Distilling Relation Embeddings from Pre-trained Language Models
Asahi Ushio
Jose Camacho-Collados
Steven Schockaert
91
24
0
21 Sep 2021
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
...
Xinxian Huang
Xin Tian
Xinchao Xu
Yingzhan Lin
Zhengyu Niu
VLM
ALM
101
66
0
20 Sep 2021
Towards Zero-Label Language Learning
Zirui Wang
Adams Wei Yu
Orhan Firat
Yuan Cao
SyDa
285
107
0
19 Sep 2021
Do Language Models Know the Way to Rome?
Bastien Liétard
Mostafa Abdou
Anders Søgaard
139
22
0
16 Sep 2021
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
Michael J.Q. Zhang
Eunsol Choi
RALM
141
162
0
13 Sep 2021
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre
Kartik Perisetla
Anthony Chen
Nikhil Ramesh
Chris DuBois
Sameer Singh
HILM
448
292
0
10 Sep 2021
R2-D2: A Modular Baseline for Open-Domain Question Answering
Martin Fajcik
Martin Docekal
Karel Ondrej
Pavel Smrz
102
48
0
08 Sep 2021
General-Purpose Question-Answering with Macaw
Oyvind Tafjord
Peter Clark
SyDa
ELM
MLLM
101
62
0
06 Sep 2021
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
192
24
0
01 Sep 2021
Robust Retrieval Augmented Generation for Zero-shot Slot Filling
Michael R. Glass
Gaetano Rossiello
Md. Faisal Mahbub Chowdhury
A. Gliozzo
RALM
126
33
0
31 Aug 2021
Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning
Cunxiang Wang
Boyuan Zheng
Y. Niu
Yue Zhang
LRM
140
23
0
15 Aug 2021
How Optimal is Greedy Decoding for Extractive Question Answering?
Or Castel
Ori Ram
Avia Efrat
Omer Levy
138
4
0
12 Aug 2021
Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering
Alexander Hanbo Li
Patrick Ng
Peng Xu
Henghui Zhu
Zhiguo Wang
Bing Xiang
LMTD
251
33
0
05 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
142
392
0
04 Aug 2021
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
106
16
0
04 Aug 2021
Automatic Claim Review for Climate Science via Explanation Generation
Shraey Bhatia
Jey Han Lau
Timothy Baldwin
56
5
0
30 Jul 2021
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oğuz
Kushal Lakhotia
Anchit Gupta
Patrick Lewis
Vladimir Karpukhin
...
Xilun Chen
Sebastian Riedel
Anuj Kumar
Sonal Gupta
Yashar Mehdad
RALM
111
69
0
28 Jul 2021
One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval
Akari Asai
Xinyan Velocity Yu
Jungo Kasai
Hannaneh Hajishirzi
RALM
LRM
159
77
0
26 Jul 2021
Time-Aware Language Models as Temporal Knowledge Bases
Bhuwan Dhingra
Jeremy R. Cole
Julian Martin Eisenschlos
D. Gillick
Jacob Eisenstein
William W. Cohen
KELM
233
304
0
29 Jun 2021
Multimodal Few-Shot Learning with Frozen Language Models
Maria Tsimpoukelli
Jacob Menick
Serkan Cabi
S. M. Ali Eslami
Oriol Vinyals
Felix Hill
MLLM
363
845
0
25 Jun 2021
Biomedical Interpretable Entity Representations
Diego Garcia-Olano
Yasumasa Onoe
Ioana Baldini
Joydeep Ghosh
Byron C. Wallace
Kush R. Varshney
AI4CE
77
3
0
17 Jun 2021
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
Lingyong Yan
M. Liao
Tong Xue
Jin Xu
121
142
0
17 Jun 2021
Probing Pre-Trained Language Models for Disease Knowledge
Israa Alghanmi
Luis Espinosa-Anke
Steven Schockaert
LM&MA
ELM
109
13
0
14 Jun 2021
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
226
925
0
14 Jun 2021
Memory-efficient Transformers via Top-
k
k
k
Attention
Ankit Gupta
Guy Dar
Shaya Goodman
David Ciprut
Jonathan Berant
MQ
153
66
0
13 Jun 2021
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
Bhargavi Paranjape
Julian Michael
Marjan Ghazvininejad
Luke Zettlemoyer
Hannaneh Hajishirzi
ReLM
LRM
96
69
0
12 Jun 2021
End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Devendra Singh Sachan
Siva Reddy
William L. Hamilton
Chris Dyer
Dani Yogatama
OOD
RALM
169
174
0
09 Jun 2021
Translate, then Parse! A strong baseline for Cross-Lingual AMR Parsing
S. Uhrig
Yoalli Rezepka Garcia
Juri Opitz
Anette Frank
138
27
0
08 Jun 2021
BERTnesia: Investigating the capture and forgetting of knowledge in BERT
Jonas Wallat
Jaspreet Singh
Avishek Anand
CLL
KELM
222
62
0
05 Jun 2021
Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?
Cunxiang Wang
Pai Liu
Yue Zhang
RALM
139
87
0
03 Jun 2021
Answer Generation for Retrieval-based Question Answering Systems
Chao-Chun Hsu
Eric Lind
Luca Soldaini
Alessandro Moschitti
97
27
0
02 Jun 2021
Implicit Representations of Meaning in Neural Language Models
Belinda Z. Li
Maxwell Nye
Jacob Andreas
NAI
MILM
169
194
0
01 Jun 2021
On the Interplay Between Fine-tuning and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
110
14
0
31 May 2021
Automatic Fake News Detection: Are Models Learning to Reason?
Casper Hansen
Christian B. Hansen
Lucas Chaves Lima
100
13
0
17 May 2021
Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters
Yan Xu
Etsuko Ishii
Samuel Cahyawijaya
Zihan Liu
Genta Indra Winata
Andrea Madotto
Jane Polak Scowcroft
Pascale Fung
RALM
91
46
0
13 May 2021
Efficient Retrieval Optimized Multi-task Learning
He Fun
S. Gandhi
Sujith Ravi
RALM
94
6
0
20 Apr 2021
On the Influence of Masking Policies in Intermediate Pre-training
Qinyuan Ye
Belinda Z. Li
Sinong Wang
Benjamin Bolte
Hao Ma
Anuj Kumar
Xiang Ren
Madian Khabsa
122
12
0
18 Apr 2021
GooAQ: Open Question Answering with Diverse Answer Types
Daniel Khashabi
Amos Ng
Tushar Khot
Ashish Sabharwal
Hannaneh Hajishirzi
Chris Callison-Burch
115
61
0
18 Apr 2021
Simple and Efficient ways to Improve REALM
Vidhisha Balachandran
Ashish Vaswani
Yulia Tsvetkov
Niki Parmar
VLM
82
5
0
18 Apr 2021
Knowledge Neurons in Pretrained Transformers
Damai Dai
Li Dong
Y. Hao
Zhifang Sui
Baobao Chang
Furu Wei
KELM
MU
321
516
0
18 Apr 2021
Zero-shot Slot Filling with DPR and RAG
Michael R. Glass
Gaetano Rossiello
A. Gliozzo
59
1
0
17 Apr 2021
Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding
Nouha Dziri
Andrea Madotto
Osmar Zaiane
A. Bose
HILM
106
145
0
17 Apr 2021
Enriching a Model's Notion of Belief using a Persistent Memory
Nora Kassner
Oyvind Tafjord
Hinrich Schütze
Peter Clark
CLL
RALM
KELM
97
6
0
16 Apr 2021
Editing Factual Knowledge in Language Models
Nicola De Cao
Wilker Aziz
Ivan Titov
KELM
225
551
0
16 Apr 2021
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Eric P. Lehman
Sarthak Jain
Karl Pichotta
Yoav Goldberg
Byron C. Wallace
OOD
MIACV
128
128
0
15 Apr 2021
Previous
1
2
3
...
10
11
12
13
Next