ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens
v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELMRALM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown
Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term
  Memory
Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory
Lei Liu
Xiaoyan Yang
Yue Shen
Binbin Hu
Qing Cui
Jinjie Gu
Guannan Zhang
LRMLLMAGKELM
271
41
0
15 Nov 2023
Learning Knowledge-Enhanced Contextual Language Representations for
  Domain Natural Language Understanding
Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ruyao Xu
Taolin Zhang
Chengyu Wang
Zhongjie Duan
Cen Chen
Minghui Qiu
Dawei Cheng
Xiaofeng He
Weining Qian
179
1
0
12 Nov 2023
Trends in Integration of Knowledge and Large Language Models: A Survey
  and Taxonomy of Methods, Benchmarks, and Applications
Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Zhangyin Feng
Weitao Ma
Weijiang Yu
Lei Huang
Haotian Wang
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
KELM
287
49
0
10 Nov 2023
AI-native Interconnect Framework for Integration of Large Language Model
  Technologies in 6G Systems
AI-native Interconnect Framework for Integration of Large Language Model Technologies in 6G Systems
Sasu Tarkoma
Roberto Morabito
Jaakko Sauvola
357
32
0
10 Nov 2023
Evaluating Generative Ad Hoc Information Retrieval
Evaluating Generative Ad Hoc Information Retrieval
Lukas Gienapp
Harrisen Scells
Niklas Deckers
Janek Bevendorff
Shuai Wang
...
Maik Fröbe
Guide Zucoon
Benno Stein
Matthias Hagen
Martin Potthast
RALM
419
23
0
08 Nov 2023
Evaluating the Effectiveness of Retrieval-Augmented Large Language
  Models in Scientific Document Reasoning
Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning
Sai Munikoti
Anurag Acharya
S. Wagle
Sameera Horawalavithana
LRM
137
10
0
07 Nov 2023
A Survey of Large Language Models Attribution
A Survey of Large Language Models Attribution
Dongfang Li
Zetian Sun
Xinshuo Hu
Zhenyu Liu
Ziyang Chen
Baotian Hu
Aiguo Wu
Min Zhang
HILM
284
76
0
07 Nov 2023
Learn to Refuse: Making Large Language Models More Controllable and
  Reliable through Knowledge Scope Limitation and Refusal Mechanism
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal MechanismConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lang Cao
265
30
0
02 Nov 2023
Attention Alignment and Flexible Positional Embeddings Improve
  Transformer Length Extrapolation
Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
124
9
0
01 Nov 2023
ChipNeMo: Domain-Adapted LLMs for Chip Design
ChipNeMo: Domain-Adapted LLMs for Chip Design
Mingjie Liu
Teodor-Dumitru Ene
Robert M. Kirby
Chris Cheng
N. Pinckney
...
Pratik P Suthar
Varun Tej
Walker J. Turner
Kaizhe Xu
Haoxin Ren
746
229
0
31 Oct 2023
Defining a New NLP Playground
Defining a New NLP PlaygroundConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sha Li
Chi Han
Pengfei Yu
Carl Edwards
Pengfei Yu
...
Yi R. Fung
Charles Yu
Joel R. Tetreault
Eduard H. Hovy
Heng Ji
380
5
0
31 Oct 2023
General-Purpose Retrieval-Enhanced Medical Prediction Model Using
  Near-Infinite History
General-Purpose Retrieval-Enhanced Medical Prediction Model Using Near-Infinite HistoryMachine Learning in Health Care (MLHC), 2023
Junu Kim
Chaeeun Shim
Bosco Seong Kyu Yang
Chami Im
Sung Yoon Lim
Han-Gil Jeong
Edward Choi
364
10
0
31 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
  Modeling Likewise
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
248
14
0
29 Oct 2023
Knowledge Corpus Error in Question Answering
Knowledge Corpus Error in Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yejoon Lee
Philhoon Oh
Hyunjung Shim
149
2
0
27 Oct 2023
Woodpecker: Hallucination Correction for Multimodal Large Language
  Models
Woodpecker: Hallucination Correction for Multimodal Large Language ModelsScience China Information Sciences (Sci China Inf Sci), 2023
Xinglong Mao
Chaoyou Fu
Zhengye Zhang
Tong Xu
Hao Wang
Dianbo Sui
Chunjiang Ge
Ke Li
Xingguo Sun
Enhong Chen
VLMMLLM
335
197
0
24 Oct 2023
Large Search Model: Redefining Search Stack in the Era of LLMs
Large Search Model: Redefining Search Stack in the Era of LLMs
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
LRMKELM
227
25
0
23 Oct 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question
  Answering via Pluggable Reward-Driven Contextual Adapter
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual AdapterConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haoyan Yang
Zhitao Li
Yong Zhang
Jianzong Wang
Ning Cheng
Ming Li
Jing Xiao
RALM
207
41
0
23 Oct 2023
The Law and NLP: Bridging Disciplinary Disconnects
The Law and NLP: Bridging Disciplinary DisconnectsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Robert Mahari
Dominik Stammbach
Elliott Ash
Alex Pentland
AILaw
217
12
0
22 Oct 2023
Knowledge-Augmented Language Model Verification
Knowledge-Augmented Language Model Verification
Jinheon Baek
Soyeong Jeong
Minki Kang
Jong C. Park
Sung Ju Hwang
RALM
167
19
0
19 Oct 2023
Reliable Academic Conference Question Answering: A Study Based on Large
  Language Model
Reliable Academic Conference Question Answering: A Study Based on Large Language Model
Zhiwei Huang
Long Jin
Junjie Wang
Mingchen Tu
Yin Hua
Zhiqiang Liu
Jiawei Meng
Hua-zeng Chen
Wen Zhang
197
1
0
19 Oct 2023
Emptying the Ocean with a Spoon: Should We Edit Models?
Emptying the Ocean with a Spoon: Should We Edit Models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuval Pinter
Michael Elhadad
KELM
233
29
0
18 Oct 2023
If the Sources Could Talk: Evaluating Large Language Models for Research
  Assistance in History
If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in HistoryWorkshop on Computational Humanities Research (CHR), 2023
Giselle Gonzalez Garcia
Christian D. Weilbach
61
9
0
16 Oct 2023
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder
  for Language Modeling
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jingcheng Deng
Liang Pang
Huawei Shen
Xueqi Cheng
RALM
272
14
0
16 Oct 2023
Farzi Data: Autoregressive Data Distillation
Farzi Data: Autoregressive Data Distillation
Noveen Sachdeva
Zexue He
Wang-Cheng Kang
Jianmo Ni
D. Cheng
Julian McAuley
DD
249
4
0
15 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
409
41
0
15 Oct 2023
CarExpert: Leveraging Large Language Models for In-Car Conversational
  Question Answering
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
Md. Rony
Christian Suess
Sinchana Ramakanth Bhat
Viju Sudhi
Julia Schneider
Maximilian Vogel
Roman Teucher
Ken E. Friedl
S. Sahoo
223
15
0
14 Oct 2023
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level
  Hallucination Detection
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sehyun Choi
Tianqing Fang
Zhaowei Wang
Yangqiu Song
224
53
0
13 Oct 2023
MemGPT: Towards LLMs as Operating Systems
MemGPT: Towards LLMs as Operating Systems
Charles Packer
Sarah Wooders
Kevin Lin
Vivian Fang
Shishir G. Patil
Ion Stoica
Alfons Kemper
RALM
1.7K
321
0
12 Oct 2023
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining
InstructRetro: Instruction Tuning post Retrieval-Augmented PretrainingInternational Conference on Machine Learning (ICML), 2023
Wei Ping
Ming-Yu Liu
Lawrence C. McAfee
Peng Xu
Bo Li
Mohammad Shoeybi
Bryan Catanzaro
RALM
466
69
0
11 Oct 2023
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented
  Models
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Luiza Amador Pozzobon
Beyza Ermis
Patrick Lewis
Sara Hooker
296
27
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Yongfeng Zhang
Xing Xie
Zheng Zhang
Yue Zhang
HILMKELM
450
258
0
11 Oct 2023
How Do Large Language Models Capture the Ever-changing World Knowledge?
  A Review of Recent Advances
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent AdvancesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zihan Zhang
Meng Fang
Lingxi Chen
Mohammad-Reza Namazi-Rad
Jun Wang
KELM
235
39
0
11 Oct 2023
CacheGen: KV Cache Compression and Streaming for Fast Language Model
  Serving
CacheGen: KV Cache Compression and Streaming for Fast Language Model ServingConference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023
Yuhan Liu
Hanchen Li
Yihua Cheng
Siddhant Ray
Yuyang Huang
...
Ganesh Ananthanarayanan
Michael Maire
Henry Hoffmann
Ari Holtzman
Junchen Jiang
566
141
0
11 Oct 2023
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained
  Decoding
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
Kexun Zhang
Hongqiao Chen
Lei Li
Wenjie Wang
272
7
0
10 Oct 2023
Text Embeddings Reveal (Almost) As Much As Text
Text Embeddings Reveal (Almost) As Much As TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
John X. Morris
Volodymyr Kuleshov
Vitaly Shmatikov
Alexander M. Rush
RALM
294
165
0
10 Oct 2023
SALMON: Self-Alignment with Instructable Reward Models
SALMON: Self-Alignment with Instructable Reward ModelsInternational Conference on Learning Representations (ICLR), 2023
Zhiqing Sun
Songlin Yang
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
ALMSyDa
353
53
0
09 Oct 2023
What do larger image classifiers memorise?
What do larger image classifiers memorise?
Michal Lukasik
Vaishnavh Nagarajan
A. S. Rawat
A. Menon
Sanjiv Kumar
258
5
0
09 Oct 2023
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on
  Open-Source Model
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
LRM
216
27
0
08 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Self-Knowledge Guided Retrieval Augmentation for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALMKELM
241
80
0
08 Oct 2023
Prompt-augmented Temporal Point Process for Streaming Event Sequence
Prompt-augmented Temporal Point Process for Streaming Event SequenceNeural Information Processing Systems (NeurIPS), 2023
Siqiao Xue
Yan Wang
Zhixuan Chu
Xiaoming Shi
Caigao Jiang
Hongyan Hao
Gangwei Jiang
Xiaoyun Feng
James Y. Zhang
Junqing Zhou
AI4TS
270
30
0
08 Oct 2023
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates
  before In-Context Learning
The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning
Tian Jin
Nolan Clement
Xin Dong
Vaishnavh Nagarajan
Michael Carbin
Jonathan Ragan-Kelley
Gintare Karolina Dziugaite
LRM
331
5
0
07 Oct 2023
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective
  Augmentation
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation
Fangyuan Xu
Weijia Shi
Eunsol Choi
RALM
341
221
0
06 Oct 2023
Thought Propagation: An Analogical Approach to Complex Reasoning with
  Large Language Models
Thought Propagation: An Analogical Approach to Complex Reasoning with Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Junchi Yu
Xiao-Yu Zhang
Rex Ying
LRM
454
39
0
06 Oct 2023
Reformulating Domain Adaptation of Large Language Models as
  Adapt-Retrieve-Revise
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-ReviseAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhen Wan
Yating Zhang
Yexiang Wang
Fei Cheng
Sadao Kurohashi
CLLAILaw
246
14
0
05 Oct 2023
FreshLLMs: Refreshing Large Language Models with Search Engine
  Augmentation
FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Tu Vu
Mohit Iyyer
Xuezhi Wang
Noah Constant
Jerry W. Wei
...
Chris Tar
Yun-hsuan Sung
Denny Zhou
Quoc Le
Thang Luong
KELMHILMLRM
535
300
0
05 Oct 2023
Retrieval-augmented Generation to Improve Math Question-Answering:
  Trade-offs Between Groundedness and Human Preference
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human PreferenceEducational Data Mining (EDM), 2023
Zachary Levonian
Chenglu Li
Wangda Zhu
Anoushka Gade
Owen Henkel
Millie-Ellen Postle
Wanli Xing
AI4EdRALM
231
54
0
04 Oct 2023
Retrieval meets Long Context Large Language Models
Retrieval meets Long Context Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Peng Xu
Ming-Yu Liu
Xianchao Wu
Lawrence C. McAfee
Chen Zhu
Zihan Liu
Sandeep Subramanian
Evelina Bakhturina
Mohammad Shoeybi
Bryan Catanzaro
RALMLRM
458
111
0
04 Oct 2023
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
RA-DIT: Retrieval-Augmented Dual Instruction TuningInternational Conference on Learning Representations (ICLR), 2023
Xi Lin
Xilun Chen
Mingda Chen
Weijia Shi
Maria Lomeli
...
Jacob Kahn
Gergely Szilvasy
Mike Lewis
Luke Zettlemoyer
Scott Yih
RALM
430
208
0
02 Oct 2023
BTR: Binary Token Representations for Efficient Retrieval Augmented
  Language Models
BTR: Binary Token Representations for Efficient Retrieval Augmented Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Qingqing Cao
Sewon Min
Yizhong Wang
Hannaneh Hajishirzi
MQRALM
224
7
0
02 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine
  Translation
Quantifying the Plausibility of Context Reliance in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023
Gabriele Sarti
Grzegorz Chrupala
Malvina Nissim
Arianna Bisazza
292
5
0
02 Oct 2023
Previous
123...111213...161718
Next