v1v2v3 (latest)

Improving language models by retrieving from trillions of tokens

8 December 2021

George van den Driessche

Jean-Baptiste Lespiau

Saffron Huang

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Improving language models by retrieving from trillions of tokens"

50 / 893 papers shown

Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory

271

15 Nov 2023

Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Chengyu Wang

Cen Chen

179

12 Nov 2023

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

287

10 Nov 2023

AI-native Interconnect Framework for Integration of Large Language Model Technologies in 6G Systems

Sasu Tarkoma

Roberto Morabito

Jaakko Sauvola

357

10 Nov 2023

Evaluating Generative Ad Hoc Information Retrieval

...

419

08 Nov 2023

Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning

Sai Munikoti

Anurag Acharya

S. Wagle

Sameera Horawalavithana

LRM

137

07 Nov 2023

A Survey of Large Language Models Attribution

Baotian Hu

Min Zhang

284

07 Nov 2023

Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal MechanismConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Lang Cao

265

02 Nov 2023

Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation

Ta-Chung Chi

Ting-Han Fan

Alexander I. Rudnicky

124

01 Nov 2023

ChipNeMo: Domain-Adapted LLMs for Chip Design

...

746

229

31 Oct 2023

Defining a New NLP PlaygroundConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

Heng Ji

380

31 Oct 2023

General-Purpose Retrieval-Enhanced Medical Prediction Model Using Near-Infinite HistoryMachine Learning in Health Care (MLHC), 2023

Junu Kim

Edward Choi

364

31 Oct 2023

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

...

248

29 Oct 2023

Knowledge Corpus Error in Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yejoon Lee

Philhoon Oh

Hyunjung Shim

149

27 Oct 2023

Woodpecker: Hallucination Correction for Multimodal Large Language ModelsScience China Information Sciences (Sci China Inf Sci), 2023

Enhong Chen

335

197

24 Oct 2023

Large Search Model: Redefining Search Stack in the Era of LLMs

Liang Wang

227

23 Oct 2023

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual AdapterConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

207

23 Oct 2023

The Law and NLP: Bridging Disciplinary DisconnectsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

217

22 Oct 2023

Knowledge-Augmented Language Model Verification

167

19 Oct 2023

Reliable Academic Conference Question Answering: A Study Based on Large Language Model

197

19 Oct 2023

Emptying the Ocean with a Spoon: Should We Edit Models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yuval Pinter

Michael Elhadad

KELM

233

18 Oct 2023

If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in HistoryWorkshop on Computational Humanities Research (CHR), 2023

Giselle Gonzalez Garcia

Christian D. Weilbach

16 Oct 2023

RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

272

16 Oct 2023

Farzi Data: Autoregressive Data Distillation

Julian McAuley

249

15 Oct 2023

Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models

409

15 Oct 2023

CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering

Md. Rony

Christian Suess

Sinchana Ramakanth Bhat

223

14 Oct 2023

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

224

13 Oct 2023

MemGPT: Towards LLMs as Operating Systems

1.7K

321

12 Oct 2023

InstructRetro: Instruction Tuning post Retrieval-Augmented PretrainingInternational Conference on Machine Learning (ICML), 2023

466

11 Oct 2023

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Luiza Amador Pozzobon

Beyza Ermis

Patrick Lewis

Sara Hooker

296

11 Oct 2023

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

...

Xing Xie

450

258

11 Oct 2023

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent AdvancesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zihan Zhang

Meng Fang

Lingxi Chen

Mohammad-Reza Namazi-Rad

Jun Wang

KELM

235

11 Oct 2023

CacheGen: KV Cache Compression and Streaming for Fast Language Model ServingConference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023

...

Ganesh Ananthanarayanan

566

141

11 Oct 2023

Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding

272

10 Oct 2023

Text Embeddings Reveal (Almost) As Much As TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

294

165

10 Oct 2023

SALMON: Self-Alignment with Instructable Reward ModelsInternational Conference on Learning Representations (ICLR), 2023

Chuang Gan

353

09 Oct 2023

What do larger image classifiers memorise?

Sanjiv Kumar

258

09 Oct 2023

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Zhiyuan Liu

216

08 Oct 2023

Self-Knowledge Guided Retrieval Augmentation for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yile Wang

Peng Li

Maosong Sun

Yang Liu

RALM KELM

241

08 Oct 2023

Prompt-augmented Temporal Point Process for Streaming Event SequenceNeural Information Processing Systems (NeurIPS), 2023

270

08 Oct 2023

The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning

Jonathan Ragan-Kelley

Gintare Karolina Dziugaite

LRM

331

07 Oct 2023

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation

Fangyuan Xu

Weijia Shi

Eunsol Choi

RALM

341

221

06 Oct 2023

Thought Propagation: An Analogical Approach to Complex Reasoning with Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

454

06 Oct 2023

Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-ReviseAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Sadao Kurohashi

246

05 Oct 2023

FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

...

535

300

05 Oct 2023

Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human PreferenceEducational Data Mining (EDM), 2023

Chenglu Li

Wanli Xing

231

04 Oct 2023

Retrieval meets Long Context Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

458

111

04 Oct 2023

RA-DIT: Retrieval-Augmented Dual Instruction TuningInternational Conference on Learning Representations (ICLR), 2023

Weijia Shi

...

Luke Zettlemoyer

430

208

02 Oct 2023

BTR: Binary Token Representations for Efficient Retrieval Augmented Language ModelsInternational Conference on Learning Representations (ICLR), 2023

224

02 Oct 2023

Quantifying the Plausibility of Context Reliance in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023

292

02 Oct 2023