Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction
Sai Qian Zhang
Ziyun Li
Chuan Guo
Saeed Mahloujifar
Deeksha Dangwal
Edward Suh
B. D. Salvo
Chiao Liu
DiffM
310
2
0
11 Dec 2024
A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Zhepeng Wang
Runxue Bao
Yawen Wu
Guodong Liu
Lei Yang
Chen Tang
Feng Zheng
Weiwen Jiang
Jinming Duan
390
2
0
09 Dec 2024
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
The Web Conference (WWW), 2024
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
371
1
0
01 Dec 2024
On the Effectiveness of Incremental Training of Large Language Models
Miles Q. Li
Benjamin C. M. Fung
Shih-Chia Huang
CLL
AIFin
168
1
0
27 Nov 2024
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Amy Xin
Jinxin Liu
Zijun Yao
Zhicheng Li
S. Cao
Lei Hou
Juanzi Li
LRM
428
8
0
25 Nov 2024
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Ming Hu
Kun Yuan
Yaling Shen
Feilong Tang
Xiaohao Xu
...
Jin Ye
N. Padoy
Nassir Navab
Junjun He
Zongyuan Ge
VLM
CLIP
441
23
0
23 Nov 2024
RAG-Thief: Scalable Extraction of Private Data from Retrieval-Augmented Generation Applications with Agent-based Attacks
Changyue Jiang
Xudong Pan
Geng Hong
Chenfu Bao
Min Yang
SILM
294
23
0
21 Nov 2024
Molecule Generation with Fragment Retrieval Augmentation
Neural Information Processing Systems (NeurIPS), 2024
Seul Lee
Karsten Kreis
Srimukh Prasad Veccham
Meng Liu
Danny Reidenbach
Saee Paliwal
Arash Vahdat
Weili Nie
VLM
269
18
0
18 Nov 2024
SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models
IEEE International Conference on Robotics and Automation (ICRA), 2024
M. Ginting
Dong-Ki Kim
Sung-Kyun Kim
Bandi Jai Krishna
Mykel J. Kochenderfer
Shayegan Omidshafiei
Ali-akbar Agha-mohammadi
LM&Ro
318
4
0
18 Nov 2024
Task-Aligned Tool Recommendation for Large Language Models
Hang Gao
Yongfeng Zhang
KELM
281
1
0
14 Nov 2024
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant
Neural Information Processing Systems (NeurIPS), 2024
Yujia Zhou
Zheng Liu
Zhicheng Dou
AIFin
LRM
RALM
151
5
0
11 Nov 2024
Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models
Guangzhi Xiong
Eric Xie
Amir Hassan Shariatmadari
Sikun Guo
Stefan Bekiranov
Aidong Zhang
LRM
HILM
201
21
0
04 Nov 2024
RuAG: Learned-rule-augmented Generation for Large Language Models
International Conference on Learning Representations (ICLR), 2024
Yudi Zhang
Pei Xiao
Lu Wang
Chen Zhang
Meng Fang
...
Qingwei Lin
Mykola Pechenizkiy
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
LRM
221
1
0
04 Nov 2024
Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
European Conference on Computer Vision (ECCV), 2024
Seongsu Ha
Chaeyun Kim
Donghwa Kim
Junho Lee
Sangho Lee
Joonseok Lee
258
6
0
03 Nov 2024
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Aliyah R. Hsu
James Zhu
Zhichao Wang
Bin Bi
Shubham Mehrotra
...
Sougata Chaudhuri
Regunathan Radhakrishnan
S. Asur
Claire Na Cheng
Bin Yu
ALM
LRM
692
1
0
03 Nov 2024
E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2024
Yun Jiang
Zilong Xie
Wei Zhang
Yun Fang
Shuai Pan
RALM
962
0
0
01 Nov 2024
LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models
Hieu Tran
Junda Wang
Yujan Ting
Weijing Huang
Terrence Chen
HILM
KELM
157
1
0
31 Oct 2024
Interpretable Next-token Prediction via the Generalized Induction Head
Eunji Kim
Sriya Mantena
Weiwei Yang
Chandan Singh
Sungroh Yoon
Jianfeng Gao
371
1
0
31 Oct 2024
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications
Monica Riedler
Stefan Langer
VLM
277
30
0
29 Oct 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Zhixin Zhang
Yiyuan Zhang
Xiaohan Ding
Xiangyu Yue
227
12
0
28 Oct 2024
Graph-based Uncertainty Metrics for Long-form Language Model Outputs
Mingjian Jiang
Yangjun Ruan
Prasanna Sattigeri
Salim Roukos
Tatsunori Hashimoto
207
11
0
28 Oct 2024
GCoder: Improving Large Language Model for Generalized Graph Problem Solving
Qifan Zhang
Xiaobin Hong
Jianheng Tang
Polydoros Giannouris
Yuhan Li
Wenzhong Li
Jing Tang
Jia Li
OffRL
AI4CE
LRM
229
9
0
24 Oct 2024
LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search
Neural Information Processing Systems (NeurIPS), 2024
Elias Jääsaari
Ville Hyvönen
Teemu Roos
229
4
0
24 Oct 2024
Retrieval-Augmented Diffusion Models for Time Series Forecasting
Neural Information Processing Systems (NeurIPS), 2024
Jingwei Liu
Ling Yang
Hongyan Li
Shenda Hong
DiffM
AI4TS
175
20
0
24 Oct 2024
TabDPT: Scaling Tabular Foundation Models on Real Data
Junwei Ma
Valentin Thomas
Rasa Hosseinzadeh
Hamidreza Kamkari
Alex Labach
Jesse C. Cresswell
Keyvan Golestan
Guangwei Yu
Anthony L. Caterini
M. Volkovs
LMTD
492
8
0
23 Oct 2024
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
International Conference on Machine Learning (ICML), 2024
Zongmeng Zhang
Yufeng Shi
Jinhua Zhu
Wengang Zhou
Xiang Qi
Peng Zhang
Haoyang Li
RALM
HILM
176
2
0
22 Oct 2024
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems
The Web Conference (WWW), 2024
Krishna Sayana
Raghavendra Vasudeva
Yuri Vasilevski
Kun Su
Liam Hebert
H. Pham
Ambarish Jash
Sukhdeep S. Sodhi
3DV
281
5
0
22 Oct 2024
ProveRAG: Provenance-Driven Vulnerability Analysis with Automated Retrieval-Augmented LLMs
Reza Fayyazi
Stella Hoyos Trueba
Michael Zuzak
S. Yang
260
5
0
22 Oct 2024
Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xin Zhou
Ping Nie
Yiwen Guo
Haojie Wei
Zhanqiu Zhang
Pasquale Minervini
Ruotian Ma
Tao Gui
Tao Gui
Xuanjing Huang
MoE
182
1
0
20 Oct 2024
RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
IEEE transactions on multimedia (IEEE TMM), 2024
Muhe Ding
Yang Ma
Pengda Qin
Yue Yu
Yuhong Li
Liqiang Nie
249
4
0
18 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLM
RALM
465
13
0
17 Oct 2024
Open Domain Question Answering with Conflicting Contexts
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Siyi Liu
Qiang Ning
Kishaloy Halder
Wei Xiao
Zheng Qi
...
Yi Zhang
Neha Anna John
Bonan Min
Yassine Benajiba
Dan Roth
LLMAG
538
11
0
16 Oct 2024
RuleRAG: Rule-Guided Retrieval-Augmented Generation with Language Models for Question Answering
Zhongwu Chen
Chengjin Xu
Dingmin Wang
Zhen Huang
Yong Dou
Xuhui Jiang
Jian Guo
RALM
993
8
0
15 Oct 2024
FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG
X. Zhao
Yan Zhong
Zetian Sun
Xinshuo Hu
Zhenyu Liu
Dongfang Li
Baotian Hu
Min Zhang
530
18
0
14 Oct 2024
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
Yicheng Fu
R. Anantha
Jianpeng Cheng
LRM
LLMAG
216
6
0
12 Oct 2024
ACER: Automatic Language Model Context Extension via Retrieval
Luyu Gao
Yunyi Zhang
Jamie Callan
RALM
174
0
0
11 Oct 2024
Generation with Dynamic Vocabulary
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yanting Liu
Changzhi Sun
Changzhi Sun
Yuanbin Wu
Xiaoling Wang
167
3
0
11 Oct 2024
Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks
Mathis Pink
Vy A. Vo
Qinyuan Wu
Jianing Mu
Javier S. Turek
Uri Hasson
K. A. Norman
Sebastian Michelmann
Alexander G. Huth
Mariya Toneva
246
8
0
10 Oct 2024
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Songshuo Lu
Hua Wang
Yutian Rong
Zhi Chen
Yaohua Tang
VLM
288
37
0
10 Oct 2024
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
International Conference on Learning Representations (ICLR), 2024
Zirui Zhao
Hanze Dong
Amrita Saha
Caiming Xiong
Doyen Sahoo
LRM
349
13
0
10 Oct 2024
Retrieval Replace Reduction: An effective visual token reduction method via semantic match
Yingen Liu
Fan Wu
Ruihui Li
Zhuo Tang
KenLi Li
VLM
135
0
0
09 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
479
13
0
09 Oct 2024
Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Fei Wang
Xingchen Wan
Ruoxi Sun
Jiefeng Chen
Sercan Ö. Arık
RALM
320
36
0
09 Oct 2024
LeanAgent: Lifelong Learning for Formal Theorem Proving
International Conference on Learning Representations (ICLR), 2024
Adarsh Kumarappan
Mo Tiwari
Peiyang Song
Robert Joseph George
Chaowei Xiao
Anima Anandkumar
CLL
LLMAG
LRM
535
12
0
08 Oct 2024
Driving with Regulation: Trustworthy and Interpretable Decision-Making for Autonomous Driving with Retrieval-Augmented Reasoning
Tianhui Cai
Yifan Liu
Zewei Zhou
Haoxuan Ma
Seth Z. Zhao
Zhiwen Wu
Xu Han
Zhiyu Huang
Jiaqi Ma
404
20
0
07 Oct 2024
Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law
Yongming Chen
Miner Chen
Ye Zhu
Juan Pei
Siyu Chen
Yu Zhou
Yi Wang
Yifan Zhou
Hao Li
Songan Zhang
AILaw
ELM
254
3
0
07 Oct 2024
PECAN: LLM-Guided Dynamic Progress Control with Attention-Guided Hierarchical Weighted Graph for Long-Document QA
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xinyu Wang
Yanzheng Xiang
Lin Gui
Yulan He
372
2
0
07 Oct 2024
Accelerating Inference of Networks in the Frequency Domain
ACM Multimedia Asia (MMAsia), 2024
Chenqiu Zhao
Guanfang Dong
Anup Basu
313
51
0
06 Oct 2024
Misinformation with Legal Consequences (MisLC): A New Task Towards Harnessing Societal Harm of Misinformation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chu Fei Luo
Radin Shayanfar
R. Bhambhoria
Samuel Dahan
Xiaodan Zhu
AILaw
208
2
0
04 Oct 2024
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Thang Nguyen
Peter Chin
Yu-Wing Tai
RALM
311
6
0
03 Oct 2024
Previous
1
2
3
...
5
6
7
...
16
17
18
Next