ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04426
  4. Cited By
Improving language models by retrieving from trillions of tokens
v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
Katie Millican
George van den Driessche
Jean-Baptiste Lespiau
Bogdan Damoc
Aidan Clark
Diego de Las Casas
Aurelia Guy
Jacob Menick
Roman Ring
Tom Hennigan
Saffron Huang
Lorenzo Maggiore
Chris Jones
Albin Cassirer
Andy Brock
Michela Paganini
G. Irving
Oriol Vinyals
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
    KELMRALM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown
Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences
Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences
Andrew Kyle Lampinen
Martin Engelcke
Yuxuan Li
Arslan Chaudhry
James L. McClelland
CLLBDL
486
4
0
24 Dec 2025
Retrieval-Augmented Memory for Online Learning
Retrieval-Augmented Memory for Online Learning
Wenzhang Du
RALMKELM
527
0
0
02 Dec 2025
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery ApproachIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2025
Shuqi Liu
Han Wu
Guanzhi Deng
Jianshu Chen
Xiaoyang Wang
Linqi Song
116
0
0
28 Nov 2025
Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval
Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval
Anup Roy
Rishabh Gyanendra Upadhyay
Animesh Rameshbhai Panara
Robin Mills
Aidan Millar
VLM
225
0
0
26 Nov 2025
Learning Plug-and-play Memory for Guiding Video Diffusion Models
Learning Plug-and-play Memory for Guiding Video Diffusion Models
Selena Song
Ziming Xu
Zijun Zhang
Kun Zhou
Jiaxian Guo
Lianhui Qin
Biwei Huang
VGen
284
0
0
24 Nov 2025
Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters
Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters
Zhan Su
Fengran Mo
Jian-yun Nie
92
0
0
21 Nov 2025
ARK: Answer-Centric Retriever Tuning via KG-augmented Curriculum Learning
Jiawei Zhou
Hang Ding
Haiyun Jiang
RALM
128
0
0
20 Nov 2025
Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Changhun Oh
Seongryong Oh
Jinwoo Hwang
Yoonsung Kim
Hardik Sharma
Jongse Park
3DGS
209
0
0
17 Nov 2025
Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
Dong Liu
Yanxuan Yu
66
0
0
12 Nov 2025
Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models
Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models
Teqi Hao
Xioayu Tan
Shaojie Shi
Yinghui Xu
Xihe Qiu
217
0
0
07 Nov 2025
Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG
Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG
Harshit Nainwani
Hediyeh Baban
AI4TS
264
0
0
07 Nov 2025
BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models
BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models
Chandra Vamsi Krishna Alla
Harish Naidu Gaddam
Manohar Kommi
RALM
285
0
0
07 Nov 2025
DMA: Online RAG Alignment with Human Feedback
DMA: Online RAG Alignment with Human Feedback
Yu Bai
Yukai Miao
Dawei Wang
Li Chen
Fei Long
...
Yanyu Ren
Tianfeng Liu
Hongtao Xie
Ce Yang
Xuhui Cai
158
0
0
06 Nov 2025
Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs
Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs
A. Cattaneo
Carlo Luschi
Daniel Justus
RALM
283
0
0
06 Nov 2025
Continual Learning, Not Training: Online Adaptation For Agents
Continual Learning, Not Training: Online Adaptation For Agents
Aman Jaglan
Jarrod Barnes
CLL
192
0
0
02 Nov 2025
Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis
Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis
Asal Meskin
Alireza Mirrokni
Ali Najar
Ali Behrouz
AI4TS
163
0
0
02 Nov 2025
Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge
Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge
Qi Luo
X. Li
Junqi Dai
Shuang Cheng
Xipeng Qiu
RALM
363
1
0
01 Nov 2025
MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Qi Luo
X. Li
Yuxin Wang
Tingshuo Fan
Yuan Li
Xinchi Chen
Xipeng Qiu
RALMKELMLRM
189
1
0
31 Oct 2025
RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding
RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding
Yinglu Li
Zhiying Lu
Zhihang Liu
Chuanbin Liu
Hongtao Xie
Hongtao Xie
VLM
309
1
0
31 Oct 2025
Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Qi Luo
Xiaonan Li
Tingshuo Fan
Xinchi Chen
Xipeng Qiu
RALM3DVLRM
591
0
0
30 Oct 2025
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
Yuhua Jiang
Shuang Cheng
Yihao Liu
Ermo Hua
Che Jiang
Weigao Sun
Yu Cheng
Feifei Gao
Biqing Qi
Bowen Zhou
93
0
0
30 Oct 2025
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data
Haoran Deng
Yingyu Lin
Zhenghao Lin
Xiao Liu
Yizhou Sun
Yi-An Ma
Yeyun Gong
143
0
0
29 Oct 2025
Optimizing Retrieval for RAG via Reinforcement Learning
Optimizing Retrieval for RAG via Reinforcement Learning
Jiawei Zhou
Lei Chen
139
1
0
28 Oct 2025
Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering
Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering
William Christian
Daniel Adamlu
Adrian Yu
Derwin Suhartono
RALM
186
0
0
24 Oct 2025
NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge
NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge
Hanyu Zhu
Lance Fiondella
Jiawei Yuan
K. Zeng
Long Jiao
SILMAAMLKELM
277
0
0
24 Oct 2025
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks
Javier Marín
86
0
0
23 Oct 2025
From Masks to Worlds: A Hitchhiker's Guide to World Models
From Masks to Worlds: A Hitchhiker's Guide to World Models
Jinbin Bai
Yu Lei
H. Wu
Yuchen Zhu
Shufan Li
Yi Xin
Xiangtai Li
Molei Tao
Aditya Grover
Ming-Hsuan Yang
VGenSyDa
185
2
0
23 Oct 2025
Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures
Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures
Rahul Raja
A. Vats
163
1
0
23 Oct 2025
Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering
Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering
Feras AlMannaa
Talia Tseriotou
Jenny Chim
Maria Liakata
ELM
192
0
0
21 Oct 2025
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Ji Du
Xin Wang
Fangwei Hao
Mingyang Yu
Chunyuan Chen
Jiesheng Wu
Bin Wang
Jing Xu
Ping Li
203
1
0
21 Oct 2025
Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval
Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval
Dong Yun
Marco Schouten
Dim P. Papadopoulos
RALM
165
0
0
21 Oct 2025
DVAGen: Dynamic Vocabulary Augmented Generation
DVAGen: Dynamic Vocabulary Augmented Generation
Wei Du
Nuowei Liu
Jie Wang
Jiahao Kuang
Tao Ji
X. Wang
Y. Wu
80
0
0
20 Oct 2025
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
Qiusi Zhan
Angeline Budiman-Chan
Abdelrahman Zayed
Xingzhi Guo
Daniel Kang
Joo-Kyung Kim
LLMAGKELMAI4TSELM
257
0
0
19 Oct 2025
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Minhua Lin
Zongyu Wu
Zhichao Xu
Hui Liu
Xianfeng Tang
Qi He
Charu C. Aggarwal
Hui Liu
Xiang Zhang
Suhang Wang
AI4TSLRM
564
2
0
19 Oct 2025
Stop-RAG: Value-Based Retrieval Control for Iterative RAG
Stop-RAG: Value-Based Retrieval Control for Iterative RAG
Jaewan Park
Solbee Cho
Jay-Yoon Lee
114
1
0
16 Oct 2025
An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation
An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation
Daniel Adu Worae
Spyridon Mastorakis
LLMAG
77
0
0
15 Oct 2025
Document Intelligence in the Era of Large Language Models: A Survey
Document Intelligence in the Era of Large Language Models: A Survey
Weishi Wang
Hengchang Hu
Zhijie Zhang
Zhaochen Li
Hongxin Shao
Daniel Dahlmeier
AI4TS
190
1
0
15 Oct 2025
BitNet Distillation
BitNet Distillation
Xun Wu
Shaohan Huang
Wenhui Wang
Ting Song
Li Dong
Yan Xia
Furu Wei
MQ
175
0
0
15 Oct 2025
Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation
Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation
Jiamin Chen
Yuchen Li
Xinyu Ma
X. Chen
Xiaokun Zhang
Shuaiqiang Wang
Chen Ma
D. Yin
RALMLRM
196
0
0
15 Oct 2025
Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response
Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response
Yiheng Chen
Lingyao Li
Zihui Ma
Qikai Hu
Yilun Zhu
Min Deng
Runlong Yu
AI4CE
79
1
0
14 Oct 2025
Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation
Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation
Linfeng Gao
Baolong Bi
Zheng Yuan
Le Wang
Zerui Chen
Zhimin Wei
Shenghua Liu
Qinggang Zhang
Jinsong Su
RALM
200
0
0
14 Oct 2025
Investigating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries
Investigating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries
Gabrielle Kaili-May Liu
Bryan Li
Arman Cohan
William Walden
Eugene Yang
RALM
267
0
0
13 Oct 2025
BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices
BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices
Euhid Aman
Esteban Carlin
Hsing-Kuo Pao
Giovanni Beltrame
Ghaluh Indah Permata Sari
Yie-Tarng Chen
125
1
0
12 Oct 2025
Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs
Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs
SuYang Xi
Chenxi Yang
Hong Ding
Yiqing Ni
Catherine C. Liu
Yunhao Liu
Chengqi Zhang
LRM
120
0
0
12 Oct 2025
LinearRAG: Linear Graph Retrieval Augmented Generation on Large-scale Corpora
LinearRAG: Linear Graph Retrieval Augmented Generation on Large-scale Corpora
Luyao Zhuang
Shengyuan Chen
Yilin Xiao
Huachi Zhou
Y. Zhang
Hao Chen
Qinggang Zhang
Xiao Shi Huang
AI4TS
305
4
0
11 Oct 2025
KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance
KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance
Kuangshi Ai
Jonathan A. Karr Jr.
Meng Jiang
Nitesh Chawla
Chaoli Wang
168
0
0
07 Oct 2025
Anytime-Valid Answer Sufficiency Certificates for LLM Generation via Sequential Information Lift
Anytime-Valid Answer Sufficiency Certificates for LLM Generation via Sequential Information Lift
Sanjeda Akter
Ibne Farabi Shihab
Anuj Sharma
138
0
0
07 Oct 2025
Domain-Shift-Aware Conformal Prediction for Large Language Models
Domain-Shift-Aware Conformal Prediction for Large Language Models
Zhexiao Lin
Yuanyuan Li
Neeraj Sarna
Yuanyuan Gao
Michael von Gablenz
137
2
0
07 Oct 2025
FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering
FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering
Yitao Long
Tiansheng Hu
Yilun Zhao
Arman Cohan
Chen Zhao
HILMAIFin
193
0
0
07 Oct 2025
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Qizheng Zhang
Changran Hu
Shubhangi Upasani
Boyuan Ma
Fenglu Hong
...
Mengmeng Ji
Hanchen Li
Urmish Thakker
James Zou
Kunle Olukotun
LLMAGKELM
223
29
0
06 Oct 2025
1234...161718
Next