Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2112.04426
Cited By

Improving language models by retrieving from trillions of tokens

v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021

Sebastian Borgeaud

Jordan Hoffmann

Eliza Rutherford

George van den Driessche

Jean-Baptiste Lespiau

Diego de Las Casas

Saffron Huang

Lorenzo Maggiore

Michela Paganini

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to
the Edge of Generalization

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

379

74

0

23 May 2024

Automated Evaluation of Retrieval-Augmented Language Models with
Task-Specific Exam Generation

Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

Gauthier Guinet

Behrooz Omidvar-Tehrani

279

33

0

22 May 2024

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

Jiajie Jin

Chenghao Zhang

Tong Zhao

Zhao Yang

Zhicheng Dou

Ji-Rong Wen

414

139

0

22 May 2024

Towards Retrieval-Augmented Architectures for Image Captioning

Towards Retrieval-Augmented Architectures for Image Captioning

Marcella Cornia

Lorenzo Baraldi

Alessandro Nicolosi

241

18

0

21 May 2024

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

William Brandon

Aniruddha Nrusimha

Jonathan Ragan-Kelley

257

88

0

21 May 2024

Information Leakage from Embedding in Large Language Models

Information Leakage from Embedding in Large Language Models

250

7

0

20 May 2024

PyZoBot: A Platform for Conversational Information Extraction and
Synthesis from Curated Zotero Reference Libraries through Advanced
Retrieval-Augmented Generation

PyZoBot: A Platform for Conversational Information Extraction and Synthesis from Curated Zotero Reference Libraries through Advanced Retrieval-Augmented Generation

Walaa Abu Rukbah

128

0

0

13 May 2024

DuetRAG: Collaborative Retrieval-Augmented Generation

DuetRAG: Collaborative Retrieval-Augmented Generation

Jingsheng Huang

168

1

0

12 May 2024

Large Language Models for Education: A Survey

Large Language Models for Education: A Survey

Wensheng Gan

Philip S. Yu

320

52

0

12 May 2024

AIOS Compiler: LLM as Interpreter for Natural Language Programming and
Flow Programming of AI Agents

AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents

190

10

0

11 May 2024

Redefining Information Retrieval of Structured Database via Large
Language Models

Redefining Information Retrieval of Structured Database via Large Language Models

212

2

0

09 May 2024

FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference

FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference

346

1

0

07 May 2024

BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

406

55

0

01 May 2024

When to Retrieve: Teaching LLMs to Utilize Information Retrieval
Effectively

When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively

Tiziano Labruna

Jon Ander Campos

216

19

0

30 Apr 2024

HELPER-X: A Unified Instructable Embodied Agent to Tackle Four
Interactive Vision-Language Domains with Memory-Augmented Language Models

HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models

Gabriel H. Sarch

Michael J. Tarr

Katerina Fragkiadaki

269

6

0

29 Apr 2024

From Persona to Personalization: A Survey on Role-Playing Language
Agents

From Persona to Personalization: A Survey on Role-Playing Language Agents

...

Ziquan Fu

Yanghua Xiao

384

181

0

28 Apr 2024

Studying Large Language Model Behaviors Under Realistic Knowledge
Conflicts

Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts

Evgenii Kortukov

Alexander Rubinstein

1.2K

5

2

24 Apr 2024

Graph Machine Learning in the Era of Large Language Models (LLMs)

Graph Machine Learning in the Era of Large Language Models (LLMs)

...

429

43

0

23 Apr 2024

From Matching to Generation: A Survey on Generative Information Retrieval

From Matching to Generation: A Survey on Generative Information Retrieval

Xiaoxi Li

Jiajie Jin

Peitian Zhang

551

132

0

23 Apr 2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

Xin Jin

377

79

0

18 Apr 2024

A Survey on Retrieval-Augmented Text Generation for Large Language
Models

A Survey on Retrieval-Augmented Text Generation for Large Language Models

318

91

0

17 Apr 2024

SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA
of LLMs

SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs

322

83

0

17 Apr 2024

Vocabulary-free Image Classification and Semantic Segmentation

Vocabulary-free Image Classification and Semantic Segmentation

Alessandro Conti

221

7

0

16 Apr 2024

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large
Language Model for Domain Question Answering

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

93

5

0

16 Apr 2024

Compression Represents Intelligence Linearly

Compression Represents Intelligence Linearly

216

39

0

15 Apr 2024

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually
Expanding Large Vocabularies

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

Shuyang Sun

Christian Schroeder de Witt

285

18

0

15 Apr 2024

Best Practices and Lessons Learned on Synthetic Data for Language Models

Best Practices and Lessons Learned on Synthetic Data for Language Models

Ruibo Liu

...

Diyi Yang

304

112

0

11 Apr 2024

Superposition Prompting: Improving and Accelerating Retrieval-Augmented
Generation

Superposition Prompting: Improving and Accelerating Retrieval-Augmented GenerationInternational Conference on Machine Learning (ICML), 2024

Mohammad Rastegari

358

13

0

10 Apr 2024

Privacy Preserving Prompt Engineering: A Survey

Privacy Preserving Prompt Engineering: A Survey

Kennedy Edemacu

380

37

0

09 Apr 2024

RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with
Multimodal Large Language Models

^2

: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

Michael Yu Wang

231

4

0

07 Apr 2024

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Jingyu Zhang

Benjamin Van Durme

Daniel Khashabi

582

13

0

05 Apr 2024

How Easily do Irrelevant Inputs Skew the Responses of Large Language
Models?

How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?

Yanghua Xiao

299

34

0

04 Apr 2024

Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing
Positional Bias in LLMs

Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing Positional Bias in LLMs

Fan Yang

97

9

0

01 Apr 2024

Source-Aware Training Enables Knowledge Attribution in Language Models

Source-Aware Training Enables Knowledge Attribution in Language Models

Muhammad Khalifa

Hao Peng

403

26

0

01 Apr 2024

SOAR: Improved Indexing for Approximate Nearest Neighbor Search

SOAR: Improved Indexing for Approximate Nearest Neighbor Search

Sanjiv Kumar

209

17

0

31 Mar 2024

Towards a Robust Retrieval-Based Summarization System

Towards a Robust Retrieval-Based Summarization System

Christopher G. Healey

184

13

0

29 Mar 2024

Quantum Natural Language Processing

Quantum Natural Language Processing

Dominic Widdows

Willie Aboumrad

303

8

0

28 Mar 2024

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in
Instructional Videos

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos

Hammad A. Ayyubi

219

4

0

27 Mar 2024

BLADE: Enhancing Black-box Large Language Models with Small
Domain-Specific Models

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

227

24

0

27 Mar 2024

Boosting Conversational Question Answering with Fine-Grained
Retrieval-Augmentation and Self-Check

Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check

Jie Zhou

183

31

0

27 Mar 2024

Cross-lingual Contextualized Phrase Retrieval

Cross-lingual Contextualized Phrase Retrieval

Zhi Qu

Hidetaka Kamigaito

Taro Watanabe

159

1

0

25 Mar 2024

Language Models Can Reduce Asymmetry in Information Markets

Language Models Can Reduce Asymmetry in Information Markets

Manuel Wüthrich

Bernhard Schölkopf

201

8

0

21 Mar 2024

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language
Models through Question Complexity

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

369

339

0

21 Mar 2024

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

196

18

0

21 Mar 2024

TAG: Guidance-free Open-Vocabulary Semantic Segmentation

TAG: Guidance-free Open-Vocabulary Semantic Segmentation

Yasufumi Kawano

Yoshimitsu Aoki

161

5

0

17 Mar 2024

DiPaCo: Distributed Path Composition

DiPaCo: Distributed Path Composition

Arthur Douillard

Rachita Chhaparia

MarcÁurelio Ranzato

235

6

0

15 Mar 2024

RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG

Tianjun Zhang

Shishir G. Patil

Matei A. Zaharia

Joseph E. Gonzalez

316

296

0

15 Mar 2024

DRAGIN: Dynamic Retrieval Augmented Generation based on the Information
Needs of Large Language Models

DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

3DV RALM AI4TS SyDa

321

49

0

15 Mar 2024

Borrowing Treasures from Neighbors: In-Context Learning for Multimodal
Learning with Missing Modalities and Data Scarcity

Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity

Adam Daneshmend

Andreas Demosthenous

Miguel R. D. Rodrigues

282

4

0

14 Mar 2024

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D
Prior

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D PriorComputer Vision and Pattern Recognition (CVPR), 2024

Cheng Chen

Chengzeng Feng

Chuan-Sheng Foo

Guosheng Lin

Fayao Liu

295

27

0

14 Mar 2024

1 2 3...8 9 10...16 17 18