Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
Contextual Document Embeddings
International Conference on Learning Representations (ICLR), 2024
John X. Morris
Alexander M. Rush
516
16
0
03 Oct 2024
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining
Vinayak Arannil
Neha Narwal
Sourav Sanjukta Bhabesh
Sai Nikhil Thirandas
Darren Yow-Bang Wang
Graham Horwood
Alex Anto Chirayath
Gouri Pandeshwar
299
1
0
30 Sep 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
International Conference on Learning Representations (ICLR), 2024
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
HILM
631
44
0
30 Sep 2024
Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems
International Conference on Computational Linguistics (COLING), 2024
Xuyang Wu
Shuowei Li
Hsin-Tai Wu
Zhiqiang Tao
Yi Fang
512
25
0
29 Sep 2024
Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Pritika Ramu
Koustava Goswami
Apoorv Saxena
Balaji Vasan Srinivavsan
290
10
0
25 Sep 2024
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Lu Chen
Ruqing Zhang
Jiafeng Guo
Yixing Fan
Xueqi Cheng
150
9
0
24 Sep 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely
Siyun Zhao
Yuqing Yang
Zilong Wang
Zhiyuan He
Luna Qiu
Lili Qiu
SyDa
RALM
3DV
322
92
0
23 Sep 2024
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines
Dongzhi Jiang
Renrui Zhang
Ziyu Guo
Yanmin Wu
Jiayi Lei
...
Guanglu Song
Peng Gao
Yu Liu
Chunyuan Li
Hongsheng Li
MLLM
287
48
0
19 Sep 2024
FoodPuzzle: Developing Large Language Model Agents as Flavor Scientists
Tenghao Huang
Donghee Lee
John Sweeney
Jiatong Shi
Emily Steliotes
Matthew Lange
Jonathan May
Muhao Chen
350
4
0
19 Sep 2024
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and Language Models
Abhinav Jain
Chris Jermaine
Vaibhav Unhelkar
KELM
LLMAG
206
2
0
18 Sep 2024
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Yujia Zhou
Yan Liu
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Zheng Liu
Chaozhuo Li
Zhicheng Dou
Tsung-Yi Ho
Philip S. Yu
3DV
RALM
281
80
0
16 Sep 2024
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
European Conference on Artificial Intelligence (ECAI), 2024
Gentiana Rashiti
G. Karunaratne
Mrinmaya Sachan
Abu Sebastian
Abbas Rahimi
RALM
543
1
0
12 Sep 2024
On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains
Xun Xian
Ganghua Wang
Xuan Bi
Jayanth Srinivasa
Jayanth Srinivasa
Charles Fleming
Mingyi Hong
Jie Ding
SILM
280
4
0
12 Sep 2024
Column Vocabulary Association (CVA): semantic interpretation of dataless tables
Margherita Martorana
Xueli Pan
Benno Kruit
Tobias Kuhn
Jacco van Ossenbruggen
177
1
0
06 Sep 2024
You Only Use Reactive Attention Slice For Long Context Retrieval
Yun Joon Soh
Hanxian Huang
Yuandong Tian
Jishen Zhao
RALM
214
1
0
03 Sep 2024
A Learnable Agent Collaboration Network Framework for Personalized Multimodal AI Search Engine
Yunxiao Shi
Min Xu
Haimin Zhang
Xing Zi
Qiang Wu
LLMAG
209
6
0
01 Sep 2024
Retrieval-Augmented Natural Language Reasoning for Explainable Visual Question Answering
Su Hyeon Lim
Minkuk Kim
Hyeon Bae Kim
Seong Tae Kim
ReLM
LRM
197
0
0
30 Aug 2024
Enhancing and Accelerating Large Language Models via Instruction-Aware Contextual Compression
Haowen Hou
Fei Ma
Binwen Bai
Xinxin Zhu
Fei Yu
200
3
0
28 Aug 2024
A Statistical Framework for Data-dependent Retrieval-Augmented Models
International Conference on Machine Learning (ICML), 2024
Soumya Basu
A. S. Rawat
Manzil Zaheer
RALM
282
2
0
27 Aug 2024
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy
Priyanka Mandikal
RALM
VLM
211
1
0
21 Aug 2024
Great Memory, Shallow Reasoning: Limits of
k
k
k
NN-LMs
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Shangyi Geng
Wenting Zhao
Alexander M. Rush
RALM
ReLM
LRM
235
6
0
21 Aug 2024
Large Language Model Driven Recommendation
Anton Korikov
Scott Sanner
Yashar Deldjoo
Zhankui He
Julian McAuley
...
René Vidal
M. Sathiamoorthy
Atoosa Kasrizadeh
Silvia Milano
Francesco Ricci
321
0
0
20 Aug 2024
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Jiri Hron
Laura J. Culp
Gamaleldin F. Elsayed
Rosanne Liu
Ben Adlam
...
T. Warkentin
Lechao Xiao
Kelvin Xu
Jasper Snoek
Simon Kornblith
161
3
0
14 Aug 2024
WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs
Weijian Xie
Xuefeng Liang
Yuhui Liu
Kaihua Ni
Hong Cheng
Zetian Hu
3DV
RALM
305
8
0
14 Aug 2024
Optimizing RAG Techniques for Automotive Industry PDF Chatbots: A Case Study with Locally Deployed Ollama Models
Fei Liu
Zejun Kang
Xing Han
150
25
0
12 Aug 2024
Retrieval-augmented code completion for local projects using large language models
Expert systems with applications (ESWA), 2024
Marko Hostnik
Marko Robnik-Sikonja
RALM
266
3
0
09 Aug 2024
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ziyuan Zhuang
Zhiyang Zhang
Sitao Cheng
Fangkai Yang
Jia Liu
Xin Huang
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
RALM
237
23
0
08 Aug 2024
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
Yuchen Dong
Xiaoxiang Fang
Yuchen Hu
Renshuang Jiang
Zhe Jiang
232
0
0
07 Aug 2024
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer
Moshe Berchansky
Moshe Wasserblat
Peter Izsak
3DV
293
8
0
05 Aug 2024
QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression
China Conference on Information Retrieval (CIR), 2024
Zhaohong Liu
Yihang Wang
Yixing Fan
Huaming Liao
Peng Dong
249
5
0
01 Aug 2024
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
Shanbo Cheng
Zhichao Huang
Tom Ko
Hang Li
Ningxin Peng
Lu Xu
Qini Zhang
296
11
0
31 Jul 2024
MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training
Rivik Setty
Chengjin Xu
Vinay Setty
Jian Guo
272
28
0
31 Jul 2024
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale
Ali AhmadiTeshnizi
Wenzhi Gao
Herman Brunborg
Shayan Talaei
Connor Lawless
Madeleine Udell
413
23
0
29 Jul 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
International Conference on Learning Representations (ICLR), 2024
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
384
44
0
29 Jul 2024
Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications
Till Speicher
Mohammad Aflah Khan
Qinyuan Wu
Vedant Nanda
Soumi Das
Bishwamittra Ghosh
Krishna P. Gummadi
Evimaria Terzi
255
7
0
27 Jul 2024
Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks
Yunfan Gao
Yun Xiong
Meng Wang
Haofen Wang
334
45
0
26 Jul 2024
Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach
Zhuowan Li
Cheng-rong Li
Mingyang Zhang
Qiaozhu Mei
Michael Bendersky
3DV
RALM
250
96
0
23 Jul 2024
MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation
Marco Simoni
Andrea Saracino
Vinod Puthuvath
Maurco Conti
303
12
0
22 Jul 2024
An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Yuetong Zhao
Hongyu Cao
Xianyu Zhao
Zhijian Ou
RALM
LRM
195
7
0
22 Jul 2024
Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors
Qizhi Pei
Lijun Wu
Zhenyu He
Jinhua Zhu
Ziheng Lu
Shufang Xie
Rui Yan
186
3
0
21 Jul 2024
Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation
Liwen Sun
James Zhao
Megan Han
Chenyan Xiong
MedIm
412
23
0
21 Jul 2024
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Peng Xu
Ming-Yu Liu
Xianchao Wu
Zihan Liu
Mohammad Shoeybi
Mohammad Shoeybi
Bryan Catanzaro
RALM
477
32
0
19 Jul 2024
Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark
Tsung-Han Wu
Giscard Biamby
Jerome Quenum
Ritwik Gupta
Joseph E. Gonzalez
Trevor Darrell
David M. Chan
VLM
285
0
0
18 Jul 2024
Retrieval-Augmented Generation for Natural Language Processing: A Survey
Shangyu Wu
Ying Xiong
Yufei Cui
Haolun Wu
Can Chen
...
Lianming Huang
Xue Liu
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
3DV
RALM
455
91
0
18 Jul 2024
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim
Alireza Salemi
Andrew Drozdov
Fernando Diaz
Hamed Zamani
367
10
0
17 Jul 2024
R+X: Retrieval and Execution from Everyday Human Videos
Georgios Papagiannis
Norman Di Palo
Pietro Vitiello
Edward Johns
448
33
0
17 Jul 2024
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
Youngsun Lim
Hyunjung Shim
DiffM
HILM
MQ
186
7
0
15 Jul 2024
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems
Yunxiao Shi
Xing Zi
Zijing Shi
Haimin Zhang
Qiang Wu
Min Xu
302
20
0
15 Jul 2024
ChatLogic: Integrating Logic Programming with Large Language Models for Multi-Step Reasoning
Zhongsheng Wang
Jiamou Liu
Qiming Bao
Hongfei Rong
Jingfeng Zhang
KELM
LRM
235
12
0
14 Jul 2024
Mitigating Entity-Level Hallucination in Large Language Models
Weihang Su
Yichen Tang
Jiaxin Mao
Changyue Wang
Zhijing Wu
Yiqun Liu
HILM
241
21
0
12 Jul 2024
Previous
1
2
3
...
6
7
8
...
16
17
18
Next