v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021

George van den Driessche

Jean-Baptiste Lespiau

Saffron Huang

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown

Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors

Defu Lian

192

14 Mar 2024

Development of a Reliable and Accessible Caregiving Language Model (CaLM)

111

11 Mar 2024

PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design

Tim Kraska

227

08 Mar 2024

LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

225

07 Mar 2024

RATSF: Empowering Customer Service Volume Management through Retrieval-Augmented Time-Series Forecasting

Tianfeng Wang

Gaojie Cui

AI4TS

246

07 Mar 2024

MeaCap: Memory-Augmented Zero-shot Image Captioning

304

06 Mar 2024

Reliable, Adaptable, and Attributable Language Models with Retrieval

Akari Asai

Zexuan Zhong

Danqi Chen

Pang Wei Koh

Luke Zettlemoyer

Hanna Hajishirzi

Anuj Kumar

KELM RALM

322

05 Mar 2024

FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

183

04 Mar 2024

Retrieval-Augmented Generation for AI-Generated Content: A Survey

950

454

29 Feb 2024

RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval

Kaiyue Wen

Xingyu Dang

Kaifeng Lyu

421

28 Feb 2024

VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models

Jinyoung Yeo

161

28 Feb 2024

A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems

396

152

28 Feb 2024

Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

Ahmed Hassan Awadallah

Jennifer Neville

Nikhil Rao

235

27 Feb 2024

JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability

419

27 Feb 2024

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Zhenting Qi

269

27 Feb 2024

Retrieval is Accurate Generation

Leyang Cui

Wei Bi

400

27 Feb 2024

Long-Context Language Modeling with Parallel Context Encoding

Howard Yen

Tianyu Gao

Danqi Chen

327

26 Feb 2024

LLM Inference Unveiled: Survey and Roofline Model Insights

Zhihang Yuan

Yuzhang Shang

Yang Zhou

Zhen Dong

Zhe Zhou

...

Yong Jae Lee

Yan Yan

Beidi Chen

Guangyu Sun

Kurt Keutzer

623

149

26 Feb 2024

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

Ran Xu

265

25 Feb 2024

The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)

...

350

138

23 Feb 2024

DEEM: Dynamic Experienced Expert Modeling for Stance Detection

Xiaolong Wang

Yile Wang

Sijie Cheng

Peng Li

Yang Liu

176

23 Feb 2024

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models

Kang Liu

Jun Zhao

688

22 Feb 2024

OpenTab: Advancing Large Language Models as Open-domain Table Reasoners

George Karypis

328

22 Feb 2024

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Li Lyna Zhang

Fan Yang

225

260

21 Feb 2024

ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling

222

21 Feb 2024

RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian

136

20 Feb 2024

Instruction-tuned Language Models are Better Knowledge Learners

Weijia Shi

Graham Neubig

294

20 Feb 2024

Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image Classification

156

19 Feb 2024

BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence

Jiajie Jin

306

19 Feb 2024

EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

Heng Ji

212

17 Feb 2024

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

378

16 Feb 2024

OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models

Ali AhmadiTeshnizi

Wenzhi Gao

Madeleine Udell

LLMAG

211

15 Feb 2024

Context Composing for Full Line Code Completion

Anton Semenkin

Yaroslav Sokolov

Evgeniia Vu

14 Feb 2024

Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

Mathias Kraus

235

13 Feb 2024

Nearest Neighbour Score Estimators for Diffusion Generative Models

...

182

12 Feb 2024

PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models

427

12 Feb 2024

Retrieval-Augmented Thought Process as Sequential Decision Making

114

12 Feb 2024

Prompt Perturbation in Retrieval-Augmented Generation based Large Language ModelsKnowledge Discovery and Data Mining (KDD), 2024

Liming Zhu

213

11 Feb 2024

ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation

Zuobai Zhang

Jiarui Lu

Vijil Chenthamarakshan

Aurélie C. Lozano

Payel Das

Jian Tang

163

10 Feb 2024

Large Language Models: A Survey

844

762

09 Feb 2024

Memory Consolidation Enables Long-Context Video Understanding

461

08 Feb 2024

DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton

394

06 Feb 2024

Retrieve to Explain: Evidence-driven Predictions for Explainable Drug Target IdentificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

260

06 Feb 2024

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

...

347

06 Feb 2024

Retrieval-Augmented Score Distillation for Text-to-3D GenerationInternational Conference on Machine Learning (ICML), 2024

277

05 Feb 2024

IllusionX: An LLM-powered mixed reality personal companion

197

04 Feb 2024

Factuality of Large Language Models in the Year 2024

Yuxia Wang

Minghan Wang

Muhammad Arslan Manzoor

218

04 Feb 2024

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

518

162

02 Feb 2024

Retrieval Augmented End-to-End Spoken Dialog Models

Dian Yu

Laurent El Shafey

RALM AuLLM

214

02 Feb 2024

CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks

Xiaoxi Li

204

02 Feb 2024