Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory
Lei Liu
Xiaoyan Yang
Yue Shen
Binbin Hu
Qing Cui
Jinjie Gu
Guannan Zhang
LRM
LLMAG
KELM
271
41
0
15 Nov 2023
Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ruyao Xu
Taolin Zhang
Chengyu Wang
Zhongjie Duan
Cen Chen
Minghui Qiu
Dawei Cheng
Xiaofeng He
Weining Qian
179
1
0
12 Nov 2023
Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Zhangyin Feng
Weitao Ma
Weijiang Yu
Lei Huang
Haotian Wang
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
KELM
287
49
0
10 Nov 2023
AI-native Interconnect Framework for Integration of Large Language Model Technologies in 6G Systems
Sasu Tarkoma
Roberto Morabito
Jaakko Sauvola
357
32
0
10 Nov 2023
Evaluating Generative Ad Hoc Information Retrieval
Lukas Gienapp
Harrisen Scells
Niklas Deckers
Janek Bevendorff
Shuai Wang
...
Maik Fröbe
Guide Zucoon
Benno Stein
Matthias Hagen
Martin Potthast
RALM
419
23
0
08 Nov 2023
Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning
Sai Munikoti
Anurag Acharya
S. Wagle
Sameera Horawalavithana
LRM
137
10
0
07 Nov 2023
A Survey of Large Language Models Attribution
Dongfang Li
Zetian Sun
Xinshuo Hu
Zhenyu Liu
Ziyang Chen
Baotian Hu
Aiguo Wu
Min Zhang
HILM
284
76
0
07 Nov 2023
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lang Cao
265
30
0
02 Nov 2023
Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
124
9
0
01 Nov 2023
ChipNeMo: Domain-Adapted LLMs for Chip Design
Mingjie Liu
Teodor-Dumitru Ene
Robert M. Kirby
Chris Cheng
N. Pinckney
...
Pratik P Suthar
Varun Tej
Walker J. Turner
Kaizhe Xu
Haoxin Ren
746
229
0
31 Oct 2023
Defining a New NLP Playground
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sha Li
Chi Han
Pengfei Yu
Carl Edwards
Pengfei Yu
...
Yi R. Fung
Charles Yu
Joel R. Tetreault
Eduard H. Hovy
Heng Ji
380
5
0
31 Oct 2023
General-Purpose Retrieval-Enhanced Medical Prediction Model Using Near-Infinite History
Machine Learning in Health Care (MLHC), 2023
Junu Kim
Chaeeun Shim
Bosco Seong Kyu Yang
Chami Im
Sung Yoon Lim
Han-Gil Jeong
Edward Choi
364
10
0
31 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
248
14
0
29 Oct 2023
Knowledge Corpus Error in Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yejoon Lee
Philhoon Oh
Hyunjung Shim
149
2
0
27 Oct 2023
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Science China Information Sciences (Sci China Inf Sci), 2023
Xinglong Mao
Chaoyou Fu
Zhengye Zhang
Tong Xu
Hao Wang
Dianbo Sui
Chunjiang Ge
Ke Li
Xingguo Sun
Enhong Chen
VLM
MLLM
335
197
0
24 Oct 2023
Large Search Model: Redefining Search Stack in the Era of LLMs
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
LRM
KELM
227
25
0
23 Oct 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haoyan Yang
Zhitao Li
Yong Zhang
Jianzong Wang
Ning Cheng
Ming Li
Jing Xiao
RALM
207
41
0
23 Oct 2023
The Law and NLP: Bridging Disciplinary Disconnects
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Robert Mahari
Dominik Stammbach
Elliott Ash
Alex Pentland
AILaw
217
12
0
22 Oct 2023
Knowledge-Augmented Language Model Verification
Jinheon Baek
Soyeong Jeong
Minki Kang
Jong C. Park
Sung Ju Hwang
RALM
167
19
0
19 Oct 2023
Reliable Academic Conference Question Answering: A Study Based on Large Language Model
Zhiwei Huang
Long Jin
Junjie Wang
Mingchen Tu
Yin Hua
Zhiqiang Liu
Jiawei Meng
Hua-zeng Chen
Wen Zhang
197
1
0
19 Oct 2023
Emptying the Ocean with a Spoon: Should We Edit Models?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuval Pinter
Michael Elhadad
KELM
233
29
0
18 Oct 2023
If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in History
Workshop on Computational Humanities Research (CHR), 2023
Giselle Gonzalez Garcia
Christian D. Weilbach
61
9
0
16 Oct 2023
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jingcheng Deng
Liang Pang
Huawei Shen
Xueqi Cheng
RALM
272
14
0
16 Oct 2023
Farzi Data: Autoregressive Data Distillation
Noveen Sachdeva
Zexue He
Wang-Cheng Kang
Jianmo Ni
D. Cheng
Julian McAuley
DD
249
4
0
15 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
409
41
0
15 Oct 2023
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
Md. Rony
Christian Suess
Sinchana Ramakanth Bhat
Viju Sudhi
Julia Schneider
Maximilian Vogel
Roman Teucher
Ken E. Friedl
S. Sahoo
223
15
0
14 Oct 2023
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sehyun Choi
Tianqing Fang
Zhaowei Wang
Yangqiu Song
224
53
0
13 Oct 2023
MemGPT: Towards LLMs as Operating Systems
Charles Packer
Sarah Wooders
Kevin Lin
Vivian Fang
Shishir G. Patil
Ion Stoica
Alfons Kemper
RALM
1.7K
321
0
12 Oct 2023
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
International Conference on Machine Learning (ICML), 2023
Wei Ping
Ming-Yu Liu
Lawrence C. McAfee
Peng Xu
Bo Li
Mohammad Shoeybi
Bryan Catanzaro
RALM
466
69
0
11 Oct 2023
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Luiza Amador Pozzobon
Beyza Ermis
Patrick Lewis
Sara Hooker
296
27
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Yongfeng Zhang
Xing Xie
Zheng Zhang
Yue Zhang
HILM
KELM
450
258
0
11 Oct 2023
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zihan Zhang
Meng Fang
Lingxi Chen
Mohammad-Reza Namazi-Rad
Jun Wang
KELM
235
39
0
11 Oct 2023
CacheGen: KV Cache Compression and Streaming for Fast Language Model Serving
Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023
Yuhan Liu
Hanchen Li
Yihua Cheng
Siddhant Ray
Yuyang Huang
...
Ganesh Ananthanarayanan
Michael Maire
Henry Hoffmann
Ari Holtzman
Junchen Jiang
566
141
0
11 Oct 2023
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
Kexun Zhang
Hongqiao Chen
Lei Li
Wenjie Wang
272
7
0
10 Oct 2023
Text Embeddings Reveal (Almost) As Much As Text
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
John X. Morris
Volodymyr Kuleshov
Vitaly Shmatikov
Alexander M. Rush
RALM
294
165
0
10 Oct 2023
SALMON: Self-Alignment with Instructable Reward Models
International Conference on Learning Representations (ICLR), 2023
Zhiqing Sun
Songlin Yang
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
ALM
SyDa
353
53
0
09 Oct 2023
What do larger image classifiers memorise?
Michal Lukasik
Vaishnavh Nagarajan
A. S. Rawat
A. Menon
Sanjiv Kumar
258
5
0
09 Oct 2023
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
LRM
216
27
0
08 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALM
KELM
241
80
0
08 Oct 2023
Prompt-augmented Temporal Point Process for Streaming Event Sequence
Neural Information Processing Systems (NeurIPS), 2023
Siqiao Xue
Yan Wang
Zhixuan Chu
Xiaoming Shi
Caigao Jiang
Hongyan Hao
Gangwei Jiang
Xiaoyun Feng
James Y. Zhang
Junqing Zhou
AI4TS
270
30
0
08 Oct 2023
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Tian Jin
Nolan Clement
Xin Dong
Vaishnavh Nagarajan
Michael Carbin
Jonathan Ragan-Kelley
Gintare Karolina Dziugaite
LRM
331
5
0
07 Oct 2023
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation
Fangyuan Xu
Weijia Shi
Eunsol Choi
RALM
341
221
0
06 Oct 2023
Thought Propagation: An Analogical Approach to Complex Reasoning with Large Language Models
International Conference on Learning Representations (ICLR), 2023
Junchi Yu
Xiao-Yu Zhang
Rex Ying
LRM
454
39
0
06 Oct 2023
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhen Wan
Yating Zhang
Yexiang Wang
Fei Cheng
Sadao Kurohashi
CLL
AILaw
246
14
0
05 Oct 2023
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tu Vu
Mohit Iyyer
Xuezhi Wang
Noah Constant
Jerry W. Wei
...
Chris Tar
Yun-hsuan Sung
Denny Zhou
Quoc Le
Thang Luong
KELM
HILM
LRM
535
300
0
05 Oct 2023
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Educational Data Mining (EDM), 2023
Zachary Levonian
Chenglu Li
Wangda Zhu
Anoushka Gade
Owen Henkel
Millie-Ellen Postle
Wanli Xing
AI4Ed
RALM
231
54
0
04 Oct 2023
Retrieval meets Long Context Large Language Models
International Conference on Learning Representations (ICLR), 2023
Peng Xu
Ming-Yu Liu
Xianchao Wu
Lawrence C. McAfee
Chen Zhu
Zihan Liu
Sandeep Subramanian
Evelina Bakhturina
Mohammad Shoeybi
Bryan Catanzaro
RALM
LRM
458
111
0
04 Oct 2023
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
International Conference on Learning Representations (ICLR), 2023
Xi Lin
Xilun Chen
Mingda Chen
Weijia Shi
Maria Lomeli
...
Jacob Kahn
Gergely Szilvasy
Mike Lewis
Luke Zettlemoyer
Scott Yih
RALM
430
208
0
02 Oct 2023
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
International Conference on Learning Representations (ICLR), 2023
Qingqing Cao
Sewon Min
Yizhong Wang
Hannaneh Hajishirzi
MQ
RALM
224
7
0
02 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine Translation
International Conference on Learning Representations (ICLR), 2023
Gabriele Sarti
Grzegorz Chrupala
Malvina Nissim
Arianna Bisazza
292
5
0
02 Oct 2023
Previous
1
2
3
...
11
12
13
...
16
17
18
Next