v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021

George van den Driessche

Jean-Baptiste Lespiau

Saffron Huang

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown

M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?

219

27 Mar 2025

The cell as a token: high-dimensional geometry in language models and cell embeddingsBioinformatics (Bioinformatics), 2025

William Gilpin

416

26 Mar 2025

ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses

Esmail Gumaan

MoE

317

23 Mar 2025

OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery

...

572

22 Mar 2025

JuDGE: Benchmarking Judgment Document Generation for Chinese Legal SystemAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

385

18 Mar 2025

HDLCoRe: A Training-Free Framework for Mitigating Hallucinations in LLM-Generated HDL

...

252

18 Mar 2025

OSCAR: Online Soft Compression And Reranking

291

17 Mar 2025

A Survey on Knowledge-Oriented Retrieval-Augmented Generation

...

374

11 Mar 2025

LLM-based Corroborating and Refuting Evidence Retrieval for Scientific Claim Verification

192

11 Mar 2025

Leveraging Approximate Caching for Faster Retrieval-Augmented Generation

307

07 Mar 2025

Language modelling techniques for analysing the impact of human genetic variationBioinformatics and Biology Insights (BBI), 2025

Megha Hegde

Jean-Christophe Nebel

Farzana Rahman

141

07 Mar 2025

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

302

06 Mar 2025

Streaming Video Question-Answering with In-context Video KV-Cache RetrievalInternational Conference on Learning Representations (ICLR), 2025

212

01 Mar 2025

TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

...

994

28 Feb 2025

RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2025

415

27 Feb 2025

Do Retrieval-Augmented Language Models Adapt to Varying User Needs?

423

27 Feb 2025

From Retrieval to Generation: Comparing Different Approaches

279

27 Feb 2025

PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation

330

27 Feb 2025

Bián: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation

697

26 Feb 2025

MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks

Saikrishna Sanniboina

391

25 Feb 2025

Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back HomeAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

355

24 Feb 2025

Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid

218

24 Feb 2025

Forecasting Rare Language Model Behaviors

304

24 Feb 2025

PICASO: Permutation-Invariant Context Composition with State Space ModelsInternational Conference on Learning Representations (ICLR), 2025

506

24 Feb 2025

Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals

495

22 Feb 2025

A Survey of Model Architectures in Information Retrieval

585

20 Feb 2025

A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems

A. Hannibal Hamdallahi

194

20 Feb 2025

Retrieval-Augmented Process Reward Model for Generalizable Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

178

20 Feb 2025

Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

203

19 Feb 2025

A-MEM: Agentic Memory for LLM Agents

1.0K

195

17 Feb 2025

Associative Recurrent Memory Transformer

302

17 Feb 2025

CiteCheck: Towards Accurate Citation Faithfulness Detection

181

15 Feb 2025

KIMAs: A Configurable Knowledge Integrated Multi-Agent System

402

13 Feb 2025

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

586

10 Feb 2025

Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy

Rishabh Uapadhyay

Marco Viviani

433

07 Feb 2025

Efficient Knowledge Feeding to Language Models: A Novel Integrated Encoder-Decoder Architecture

259

07 Feb 2025

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language ModelsThe Web Conference (WWW), 2025

...

675

02 Feb 2025

RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval DefectsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

633

30 Jan 2025

SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized DomainsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

...

529

28 Jan 2025

Parametric Retrieval Augmented GenerationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

250

28 Jan 2025

Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationInternational Conference on Computational Linguistics (COLING), 2024

318

28 Jan 2025

On Storage Neural Network Augmented Approximate Nearest Neighbor Search

Taiga Ikeda

Daisuke Miyashita

J. Deguchi

191

23 Jan 2025

A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models

...

555

21 Jan 2025

SteLLA: A Structured Grading System Using LLMs with RAGBigData Congress [Services Society] (BSS), 2024

405

17 Jan 2025

Parallel Key-Value Cache Fusion for Position Invariant RAG

1.0K

13 Jan 2025

267

06 Jan 2025

A Unified Framework for Context-Aware IoT Management and State-of-the-Art IoT Traffic Anomaly Detection

Daniel Adu Worae

Athar Sheikh

Spyridon Mastorakis

258

31 Dec 2024

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

250

19 Dec 2024

RemoteRAG: A Privacy-Preserving LLM Cloud RAG ServiceAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

217

17 Dec 2024

IGR: Improving Diffusion Model for Garment Restoration from Person Image

356

16 Dec 2024