Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
CiteME: Can Language Models Accurately Cite Scientific Claims?
Ori Press
Andreas Hochlehnert
Christian Schroeder de Witt
Vishaal Udandarao
Ofir Press
Matthias Bethge
292
29
0
10 Jul 2024
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Rulin Shao
Jacqueline He
Akari Asai
Weijia Shi
Tim Dettmers
Sewon Min
Luke Zettlemoyer
Pang Wei Koh
RALM
326
52
0
09 Jul 2024
Mixture of A Million Experts
Xu Owen He
MoE
375
50
0
04 Jul 2024
DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation
Taeho Hwang
Soyeong Jeong
Sukmin Cho
SeungYoon Han
Jong C. Park
RALM
379
3
0
04 Jul 2024
The Structure of Financial Equity Research Reports -- Identification of the Most Frequently Asked Questions in Financial Analyst Reports to Automate Equity Research Using Llama 3 and GPT-4
Adria Pop
Jan Spörer
128
0
0
04 Jul 2024
Neurocache: Efficient Vector Retrieval for Long-range Language Modeling
Ali Safaya
Deniz Yuret
207
0
0
02 Jul 2024
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Yue Yu
Ming-Yu Liu
Zihan Liu
Wei Ping
Jiaxuan You
Chao Zhang
Mohammad Shoeybi
Bryan Catanzaro
ALM
RALM
369
164
0
02 Jul 2024
Memory
3
\text{Memory}^3
Memory
3
: Language Modeling with Explicit Memory
Hongkang Yang
Peng Liu
Wenjin Wang
Huayi Lai
Zhiyu Li
...
Yu Yu
Kai Chen
Feiyu Xiong
Linpeng Tang
Weinan E
235
33
0
01 Jul 2024
SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest
Christoforus Yoga Haryanto
Minh Hieu Vu
Trung Duc Nguyen
Emily Lomempow
Yulia Nurliana
Sona Taheri
215
3
0
01 Jul 2024
BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering
Zheng Chu
Jingchang Chen
Qianglong Chen
Haotian Wang
Kun Zhu
Xiyuan Du
Weijiang Yu
Ming Liu
Bing Qin
LRM
252
14
0
28 Jun 2024
RAVEN: Multitask Retrieval Augmented Vision-Language Learning
Varun Nagaraj Rao
Siddharth Choudhary
Aditya Deshpande
R. Satzoda
Srikar Appalaraju
RALM
VLM
282
7
0
27 Jun 2024
Banishing LLM Hallucinations Requires Rethinking Generalization
Johnny Li
Saksham Consul
Eda Zhou
James Wong
Naila Farooqui
...
Zhuxiaona Wei
Tian Wu
Ben Echols
Sharon Zhou
Gregory Diamos
LRM
307
20
0
25 Jun 2024
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
Zexuan Qiu
Chinmay Pani
Bin Wu
Jingjing Li
Aiwei Liu
Irwin King
KELM
RALM
409
12
0
25 Jun 2024
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track
Ronak Pradeep
Nandan Thakur
Sahel Sharifymoghaddam
Eric Zhang
Ryan Nguyen
Daniel Campos
Nick Craswell
Jimmy Lin
289
31
0
24 Jun 2024
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
Cheng-Yu Hsieh
Yung-Sung Chuang
Chun-Liang Li
Zifeng Wang
Long T. Le
...
James R. Glass
Alexander Ratner
Zifeng Wang
Ranjay Krishna
Tomas Pfister
350
74
0
23 Jun 2024
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Ziyan Jiang
Xueguang Ma
Wenhu Chen
RALM
395
103
0
21 Jun 2024
Data-Centric AI in the Age of Large Language Models
Xinyi Xu
Zhaoxuan Wu
Rui Qiao
Arun Verma
Yao Shu
...
Xiaoqiang Lin
Wenyang Hu
Zhongxiang Dai
Pang Wei Koh
Bryan Kian Hsiang Low
ALM
360
4
0
20 Jun 2024
Augmenting Query and Passage for Retrieval-Augmented Generation using LLMs for Open-Domain Question Answering
Minsang Kim
Cheoneum Park
Seungjun Baek
RALM
170
0
0
20 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
290
17
0
19 Jun 2024
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Di Wu
Jia-Chen Gu
Fan Yin
Nanyun Peng
Kai-Wei Chang
HILM
153
5
0
19 Jun 2024
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jirui Qi
Gabriele Sarti
Raquel Fernández
Arianna Bisazza
RALM
275
12
0
19 Jun 2024
In-Context Former: Lightning-fast Compressing Context for Large Language Model
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiangfeng Wang
Zaiyi Chen
Zheyong Xie
Tong Xu
Yongyi He
Enhong Chen
202
9
0
19 Jun 2024
In-Context In-Context Learning with Transformer Neural Processes
Symposium on Advances in Approximate Bayesian Inference (AABI), 2024
Matthew Ashman
Cristiana-Diana Diaconu
Adrian Weller
Richard E. Turner
230
4
0
19 Jun 2024
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
International Conference on Learning Representations (ICLR), 2024
Zhepei Wei
Wei-Lin Chen
Yu Meng
RALM
604
12
0
19 Jun 2024
RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
International Conference on Computational Linguistics (COLING), 2024
Shuting Wang
Xin Yu
Mang Wang
Weipeng Chen
Yutao Zhu
Zhicheng Dou
RALM
222
16
0
18 Jun 2024
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers
Myeonghwa Lee
Seonho An
Min-Soo Kim
3DV
RALM
200
34
0
18 Jun 2024
What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling
Yutong Hu
Quzhe Huang
Kangcheng Luo
Yansong Feng
137
2
0
17 Jun 2024
Iterative Utility Judgment Framework via LLMs Inspired by Relevance in Philosophy
Hengran Zhang
Keping Bi
Jiafeng Guo
Xueqi Cheng
273
4
0
17 Jun 2024
SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention
Qianchao Zhu
Jiangfei Duan
Chang Chen
Siran Liu
Xiuhong Li
Xin Lv
Xiao Chuanfu
Huanqi Cao
Chao Yang
337
27
0
17 Jun 2024
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Neural Information Processing Systems (NeurIPS), 2024
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALM
ALM
LRM
ReLM
ELM
275
142
0
14 Jun 2024
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Changan Chen
Puyuan Peng
Ami Baid
Zihui Xue
Wei-Ning Hsu
David Harwath
Kristen Grauman
VGen
254
19
0
13 Jun 2024
Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling
Zile Qiao
Wei Ye
Yong Jiang
Tong Mo
Pengjun Xie
Weiping Li
Fei Huang
Shikun Zhang
KELM
176
5
0
12 Jun 2024
Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives
Da Xu
Danqing Zhang
Guangyu Yang
Bo Yang
Shuyuan Xu
Lingling Zheng
Cindy Liang
150
4
0
10 Jun 2024
Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue
Simone Alghisi
Massimo Rizzoli
Gabriel Roccabruna
Seyed Mahed Mousavi
Giuseppe Riccardi
OffRL
511
16
0
10 Jun 2024
Retrieval & Fine-Tuning for In-Context Tabular Models
Neural Information Processing Systems (NeurIPS), 2024
Valentin Thomas
Junwei Ma
Rasa Hosseinzadeh
Keyvan Golestan
Guangwei Yu
Anthony L. Caterini
M. Volkovs
249
33
0
07 Jun 2024
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yueze Wang
Zheng Liu
Shitao Xiao
Bo Zhao
Yongping Xiong
269
54
0
06 Jun 2024
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Ling Yang
Zhaochen Yu
Tianjun Zhang
Shiyi Cao
Minkai Xu
Wentao Zhang
Joseph E. Gonzalez
Bin Cui
LLMAG
LM&Ro
LRM
KELM
333
79
0
06 Jun 2024
XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
Faisal Tareque Shohan
Mir Tafseer Nayeem
Samsul Islam
Abu Ubaida Akash
Shafiq Joty
223
6
0
06 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
181
2
0
05 Jun 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Kenneth Enevoldsen
Márton Kardos
Niklas Muennighoff
Kristoffer Nielbo
253
20
0
04 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
375
0
0
04 Jun 2024
EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs
Zixuan Dong
Baoyun Peng
Yufei Wang
Jia Fu
Xiaodong Wang
Yongxue Shan
Xin Zhou
308
13
0
03 Jun 2024
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
Cheng Tan
Jingxuan Wei
Linzhuang Sun
Zhangyang Gao
Siyuan Li
Bihui Yu
Ruifeng Guo
Stan Z. Li
ReLM
LRM
3DV
282
14
0
31 May 2024
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu
Zhaoheng Huang
Zhicheng Dou
Ji-Rong Wen
RALM
265
8
0
30 May 2024
Toward Conversational Agents with Context and Time Sensitive Long-term Memory
Nick Alonso
Tomás Figliolia
A. Ndirango
Beren Millidge
RALM
3DV
416
6
0
29 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong Liu
Ruiming Tang
KELM
246
7
0
29 May 2024
CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control
Huanshuo Liu
Hao Zhang
Zhijiang Guo
Kuicai Dong
Xiangyang Li
Yi Quan Lee
Cong Zhang
Yong Liu
3DV
270
6
0
29 May 2024
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
Minghan Li
Xilun Chen
Ari Holtzman
Beidi Chen
Jimmy Lin
Anuj Kumar
Xi Lin
RALM
BDL
730
21
0
29 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
244
5
0
26 May 2024
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection
Yun Zhu
Jia-Chen Gu
Caitlin Sikora
Ho Ko
Yinxiao Liu
...
Lei Shu
Liangchen Luo
Lei Meng
Bang Liu
Jindong Chen
RALM
247
24
0
25 May 2024
Previous
1
2
3
...
7
8
9
...
16
17
18
Next