Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2112.04426
Cited By

Improving language models by retrieving from trillions of tokens

v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021

Sebastian Borgeaud

Jordan Hoffmann

Eliza Rutherford

George van den Driessche

Jean-Baptiste Lespiau

Diego de Las Casas

Saffron Huang

Lorenzo Maggiore

Michela Paganini

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown

Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences

Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences

Andrew Kyle Lampinen

Martin Engelcke

Arslan Chaudhry

James L. McClelland

486

4

0

24 Dec 2025

Retrieval-Augmented Memory for Online Learning

Retrieval-Augmented Memory for Online Learning

527

0

0

02 Dec 2025

Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery ApproachIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2025

116

0

0

28 Nov 2025

Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval

Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval

Rishabh Gyanendra Upadhyay

Animesh Rameshbhai Panara

Aidan Millar

225

0

0

26 Nov 2025

Learning Plug-and-play Memory for Guiding Video Diffusion Models

Learning Plug-and-play Memory for Guiding Video Diffusion Models

284

0

0

24 Nov 2025

Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters

Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters

92

0

0

21 Nov 2025

ARK: Answer-Centric Retriever Tuning via KG-augmented Curriculum Learning

128

0

0

20 Nov 2025

Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration

Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration

209

0

0

17 Nov 2025

Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

66

0

0

12 Nov 2025

Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models

Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models

217

0

0

07 Nov 2025

Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG

Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG

Harshit Nainwani

264

0

0

07 Nov 2025

BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models

BudgetMem: Learning Selective Memory Policies for Cost-Efficient Long-Context Processing in Language Models

Chandra Vamsi Krishna Alla

Harish Naidu Gaddam

285

0

0

07 Nov 2025

DMA: Online RAG Alignment with Human Feedback

DMA: Online RAG Alignment with Human Feedback

...

158

0

0

06 Nov 2025

Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs

Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs

283

0

0

06 Nov 2025

Continual Learning, Not Training: Online Adaptation For Agents

Continual Learning, Not Training: Online Adaptation For Agents

192

0

0

02 Nov 2025

Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis

Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis

Alireza Mirrokni

163

0

0

02 Nov 2025

Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge

Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge

363

1

0

01 Nov 2025

MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval

MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval

189

1

0

31 Oct 2025

RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding

RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding

Hongtao Xie

309

1

0

31 Oct 2025

Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning

Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning

591

0

0

30 Oct 2025

Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism

Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism

93

0

0

30 Oct 2025

Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data

Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data

143

0

0

29 Oct 2025

Optimizing Retrieval for RAG via Reinforcement Learning

Optimizing Retrieval for RAG via Reinforcement Learning

139

1

0

28 Oct 2025

Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering

Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering

William Christian

Derwin Suhartono

186

0

0

24 Oct 2025

NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge

NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge

Lance Fiondella

277

0

0

24 Oct 2025

Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks

Capability Ceilings in Autoregressive Language Models: Empirical Evidence from Knowledge-Intensive Tasks

86

0

0

23 Oct 2025

From Masks to Worlds: A Hitchhiker's Guide to World Models

From Masks to Worlds: A Hitchhiker's Guide to World Models

Ming-Hsuan Yang

185

2

0

23 Oct 2025

Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures

Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures

163

1

0

23 Oct 2025

Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering

Investigating LLM Capabilities on Long Context Comprehension for Medical Question Answering

Talia Tseriotou

192

0

0

21 Oct 2025

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

203

1

0

21 Oct 2025

Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval

Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval

Dim P. Papadopoulos

165

0

0

21 Oct 2025

DVAGen: Dynamic Vocabulary Augmented Generation

DVAGen: Dynamic Vocabulary Augmented Generation

80

0

0

20 Oct 2025

SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents

SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents

Angeline Budiman-Chan

Abdelrahman Zayed

LLMAG KELM AI4TS ELM

257

0

0

19 Oct 2025

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

Charu C. Aggarwal

564

2

0

19 Oct 2025

Stop-RAG: Value-Based Retrieval Control for Iterative RAG

Stop-RAG: Value-Based Retrieval Control for Iterative RAG

114

1

0

16 Oct 2025

An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation

An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation

Daniel Adu Worae

Spyridon Mastorakis

77

0

0

15 Oct 2025

Document Intelligence in the Era of Large Language Models: A Survey

Document Intelligence in the Era of Large Language Models: A Survey

Daniel Dahlmeier

190

1

0

15 Oct 2025

BitNet Distillation

BitNet Distillation

175

0

0

15 Oct 2025

Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation

Grounding Long-Context Reasoning with Contextual Normalization for Retrieval-Augmented Generation

Shuaiqiang Wang

196

0

0

15 Oct 2025

Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response

Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response

79

1

0

14 Oct 2025

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

200

0

0

14 Oct 2025

Investigating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries

Investigating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries

Gabrielle Kaili-May Liu

267

0

0

13 Oct 2025

BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices

BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices

Giovanni Beltrame

Ghaluh Indah Permata Sari

125

1

0

12 Oct 2025

Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs

Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs

Catherine C. Liu

120

0

0

12 Oct 2025

LinearRAG: Linear Graph Retrieval Augmented Generation on Large-scale Corpora

LinearRAG: Linear Graph Retrieval Augmented Generation on Large-scale Corpora

305

4

0

11 Oct 2025

KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance

KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance

Jonathan A. Karr Jr.

168

0

0

07 Oct 2025

Anytime-Valid Answer Sufficiency Certificates for LLM Generation via Sequential Information Lift

Anytime-Valid Answer Sufficiency Certificates for LLM Generation via Sequential Information Lift

Ibne Farabi Shihab

138

0

0

07 Oct 2025

Domain-Shift-Aware Conformal Prediction for Large Language Models

Domain-Shift-Aware Conformal Prediction for Large Language Models

Michael von Gablenz

137

2

0

07 Oct 2025

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering

193

0

0

07 Oct 2025

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Shubhangi Upasani

...

223

29

0

06 Oct 2025

1 2 3 4...16 17 18