Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.04426
Cited By
v1
v2
v3 (latest)
Improving language models by retrieving from trillions of tokens
8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Improving language models by retrieving from trillions of tokens"
50 / 893 papers shown
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Haolong Yan
Kaijun Tan
Yeqing Shen
Xin Huang
Zheng Ge
Xiangyu Zhang
Si Li
Daxin Jiang
VLM
218
0
0
27 Mar 2025
The cell as a token: high-dimensional geometry in language models and cell embeddings
Bioinformatics (Bioinformatics), 2025
William Gilpin
404
1
0
26 Mar 2025
ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses
Esmail Gumaan
MoE
314
2
0
23 Mar 2025
OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery
Vignesh Prabhakar
Md Amirul Islam
Adam Atanas
Longji Xu
J. N. Han
...
Rucha Apte
Robert Clark
Kang Xu
Zihan Wang
Kai Liu
LRM
555
15
0
22 Mar 2025
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Weihang Su
Baoqing Yue
Jiaxin Mao
Yiran Hu
Jiaqi Li
C. Wang
Kaiyuan Zhang
Yueyue Wu
Yixiao Liu
AILaw
ELM
379
14
0
18 Mar 2025
HDLCoRe: A Training-Free Framework for Mitigating Hallucinations in LLM-Generated HDL
Heng Ping
Shixuan Li
Peiyu Zhang
Anzhe Cheng
Shukai Duan
...
Xiongye Xiao
Wei Yang
Shahin Nazarian
Andrei Irimia
Paul Bogdan
248
12
0
18 Mar 2025
OSCAR: Online Soft Compression And Reranking
Maxime Louis
Thibault Formal
Hervé Déjean
Stéphane Clinchant
287
0
0
17 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Tong Xu
3DV
367
35
0
11 Mar 2025
LLM-based Corroborating and Refuting Evidence Retrieval for Scientific Claim Verification
Siyuan Wang
James R. Foulds
Md Osman Gani
Shimei Pan
190
2
0
11 Mar 2025
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation
Shai Bergman
Zhang Ji
Anne-Marie Kermarrec
Diana Petrescu
Rafael Pires
Mathis Randl
M. Vos
301
3
0
07 Mar 2025
Language modelling techniques for analysing the impact of human genetic variation
Bioinformatics and Biology Insights (BBI), 2025
Megha Hegde
Jean-Christophe Nebel
Farzana Rahman
138
2
0
07 Mar 2025
Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning
Giulio Corallo
Orion Weller
Fabio Petroni
Paolo Papotti
MQ
VLM
302
3
0
06 Mar 2025
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
International Conference on Learning Representations (ICLR), 2025
Shangzhe Di
Zhelun Yu
Guanghao Zhang
Haoyuan Li
Tao Zhong
Hao Cheng
Bolin Li
Wanggui He
Fangxun Shu
Hao Jiang
210
38
0
01 Mar 2025
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin
Keisuke Kamahori
Yiyu Liu
Xiaoxiang Shi
Madhav Kashyap
...
Stephanie Wang
Arvind Krishnamurthy
Rohan Kadekodi
Luis Ceze
Baris Kasikci
3DV
VLM
973
6
0
28 Feb 2025
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings
Computer Vision and Pattern Recognition (CVPR), 2025
Aayush Dhakal
Srikumar Sastry
Subash Khanal
Adeel Ahmad
Eric Xing
Nathan Jacobs
412
6
0
27 Feb 2025
Do Retrieval-Augmented Language Models Adapt to Varying User Needs?
Peilin Wu
Xinlu Zhang
Wenhao Yu
Xingyu Liu
Xinya Du
Zhiyu Zoey Chen
RALM
413
1
0
27 Feb 2025
From Retrieval to Generation: Comparing Different Approaches
Abdelrahman Abdallah
Jamshid Mozafari
Bhawna Piryani
Mohammed Ali
Adam Jatowt
RALM
278
3
0
27 Feb 2025
PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation
Albert Gong
Kamilė Stankevičiūtė
Chao-gang Wan
Anmol Kabra
Raphael Thesmar
Johann Lee
Julius Klenke
Daniel Schwalbe-Koda
Kilian Q. Weinberger
LRM
RALM
326
4
0
27 Feb 2025
Bián: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation
Zhouyu Jiang
Mengshu Sun
Qing Cui
Lei Liang
RALM
3DV
697
1
0
26 Feb 2025
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
Hyeonjeong Ha
Qiusi Zhan
Jeonghwan Kim
Dimitrios Bralios
Saikrishna Sanniboina
Nanyun Peng
Kai-Wei Chang
Daniel Kang
Heng Ji
KELM
AAML
390
10
0
25 Feb 2025
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Viktor Moskvoretskii
M. Lysyuk
Mikhail Salnikov
Nikolay Ivanov
Sergey Pletenev
Daria Galimzianova
Nikita Krayko
Vasily Konovalov
Irina Nikishina
Sergey Petrakov
RALM
355
13
0
24 Feb 2025
Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid
Yunfeng Li
Jiqun Zhang
Guofu Liao
Xue Shi
Junhong Liu
217
0
0
24 Feb 2025
Forecasting Rare Language Model Behaviors
Erik Jones
Meg Tong
Jesse Mu
Mohammed Mahfoud
Jan Leike
Roger C. Grosse
Jared Kaplan
William Fithian
Ethan Perez
Mrinank Sharma
301
5
0
24 Feb 2025
PICASO: Permutation-Invariant Context Composition with State Space Models
International Conference on Learning Representations (ICLR), 2025
Tian Yu Liu
Alessandro Achille
Matthew Trager
Aditya Golatkar
Luca Zancato
Stefano Soatto
LRM
501
0
0
24 Feb 2025
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals
Linda Zeng
Rithwik Gupta
Divij Motwani
Diji Yang
Yi Zhang
AAML
487
12
0
22 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
3DV
KELM
578
17
0
20 Feb 2025
A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems
Lew Lefton
Kexin Rong
Chinar Dankhara
Lila Ghemri
Firdous Kausar
A. Hannibal Hamdallahi
193
1
0
20 Feb 2025
Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiachen Zhu
Congmin Zheng
Jianghao Lin
Kounianhua Du
Ying Wen
Yong Yu
Jun Wang
Weinan Zhang
LRM
ReLM
174
15
0
20 Feb 2025
Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Qitao Qin
Yucong Luo
Yihang Lu
Zhibo Chu
Xiaoman Liu
Xianwei Meng
199
2
0
19 Feb 2025
A-MEM: Agentic Memory for LLM Agents
Wujiang Xu
Zujie Liang
Kai Mei
Hang Gao
Juntao Tan
Yongfeng Zhang
LLMAG
KELM
RALM
1.0K
185
0
17 Feb 2025
Associative Recurrent Memory Transformer
Ivan Rodkin
Yuri Kuratov
Aydar Bulatov
Andrey Kravchenko
298
12
0
17 Feb 2025
CiteCheck: Towards Accurate Citation Faithfulness Detection
Ziyao Xu
Shaohang Wei
Zhuoheng Han
Jing Jin
Zhiyong Yang
Xiaoguang Li
Haochen Tan
Zhijiang Guo
Houfeng Wang
178
1
0
15 Feb 2025
KIMAs: A Configurable Knowledge Integrated Multi-Agent System
Zitao Li
Fei Wei
Yuexiang Xie
Dawei Gao
Weirui Kuang
Zhijian Ma
Bingchen Qian
Yaliang Li
Bolin Ding
402
1
0
13 Feb 2025
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
L. Yang
Zhaochen Yu
Tengjiao Wang
Mengdi Wang
ReLM
LRM
AI4CE
586
43
0
10 Feb 2025
Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy
Rishabh Uapadhyay
Marco Viviani
432
16
0
07 Feb 2025
Efficient Knowledge Feeding to Language Models: A Novel Integrated Encoder-Decoder Architecture
Sachin Kumar
Rishi Gottimukkala
Supriya Devidutta
K. Spindler
RALM
KELM
3DV
257
0
0
07 Feb 2025
RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
The Web Conference (WWW), 2025
Can Jin
Hongwu Peng
Anxiang Zhang
Nuo Chen
Jiahui Zhao
...
Keqin Li
Shuya Feng
Kai Zhong
Caiwen Ding
Dimitris N. Metaxas
670
4
0
02 Feb 2025
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Yiteng Tu
Weihang Su
Yujia Zhou
Wenshu Fan
Jiaxin Mao
RALM
631
16
0
30 Jan 2025
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Ran Xu
Hui Liu
Jiapeng Liu
Zhenwei Dai
Yaochen Xie
...
Chen Luo
Yang Li
Joyce C. Ho
Carl Yang
Qi He
RALM
529
25
0
28 Jan 2025
Parametric Retrieval Augmented Generation
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Weihang Su
Yichen Tang
Jiaxin Mao
Junxi Yan
C. Wang
Hongning Wang
Ziyi Ye
Yujia Zhou
Wenshu Fan
242
17
0
28 Jan 2025
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
International Conference on Computational Linguistics (COLING), 2024
Zijie Zhong
Hanwen Liu
Xiaoya Cui
Xiaofan Zhang
Zengchang Qin
318
24
0
28 Jan 2025
On Storage Neural Network Augmented Approximate Nearest Neighbor Search
Taiga Ikeda
Daisuke Miyashita
J. Deguchi
191
0
0
23 Jan 2025
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models
Qinggang Zhang
Shengyuan Chen
Yuanchen Bei
Zheng Yuan
Huachi Zhou
...
Hao-Heng Chen
Chuang Zhou
Yi-Ju Chang
Yi-Ju Chang
Xiao Huang
3DV
550
70
0
21 Jan 2025
SteLLA: A Structured Grading System Using LLMs with RAG
BigData Congress [Services Society] (BSS), 2024
Hefei Qiu
Brian White
Ashley Ding
Reinaldo Costa
Ali Hachem
Wei Ding
Ping Chen
AI4Ed
405
6
0
17 Jan 2025
Parallel Key-Value Cache Fusion for Position Invariant RAG
Philhoon Oh
Jinwoo Shin
Hyunjung Shim
3DV
1.0K
0
0
13 Jan 2025
Foundations of GenIR
Jiaxin Mao
Jingtao Zhan
Wenshu Fan
266
0
0
06 Jan 2025
A Unified Framework for Context-Aware IoT Management and State-of-the-Art IoT Traffic Anomaly Detection
Daniel Adu Worae
Athar Sheikh
Spyridon Mastorakis
258
3
0
31 Dec 2024
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval
Aakash Mahalingam
Vinesh Kumar Gande
Vasu Sharma
Vinija Jain
Divya Chaudhary
250
1
0
19 Dec 2024
RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yihang Cheng
Lan Zhang
Junyang Wang
Mu Yuan
Yunhao Yao
217
7
0
17 Dec 2024
IGR: Improving Diffusion Model for Garment Restoration from Person Image
Le Shen
Rong Huang
Zhijie Wang
DiffM
349
3
0
16 Dec 2024
Previous
1
2
3
4
5
6
...
16
17
18
Next