Long-range Language Modeling with Self-retrieval

Transactions of the Association for Computational Linguistics (TACL), 2023

23 June 2023

ArXiv (abs)PDF HTML HuggingFace (16 upvotes)

Papers citing "Long-range Language Modeling with Self-retrieval"

18 / 18 papers shown

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

321

28 Nov 2025

CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Santosh T.Y.S.S

Youssef Tarek Elkhayat

160

07 Aug 2025

Associative Recurrent Memory Transformer

384

17 Feb 2025

Retrieval Augmented Spelling Correction for E-Commerce ApplicationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

100

15 Oct 2024

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-HaystackNeural Information Processing Systems (NeurIPS), 2024

Artyom Sorokin

RALM ALM LRM ReLM ELM

327

185

14 Jun 2024

Reliable, Adaptable, and Attributable Language Models with Retrieval

Akari Asai

Zexuan Zhong

Danqi Chen

Pang Wei Koh

Luke Zettlemoyer

Hanna Hajishirzi

Anuj Kumar

KELM RALM

378

05 Mar 2024

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

E. Razumovskaia

Ivan Vulić

Anna Korhonen

276

04 Mar 2024

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

455

16 Feb 2024

Accelerating Retrieval-Augmented Language Model Serving with Speculation

283

25 Jan 2024

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

Zezhong Wang

344

24 Jan 2024

The Faiss libraryIEEE Transactions on Big Data (IEEE Trans. Big Data), 2024

Pierre-Emmanuel Mazaré

Maria Lomeli

Lucas Hosseini

Edouard Grave

956

538

16 Jan 2024

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

274

13 Oct 2023

CacheGen: KV Cache Compression and Streaming for Fast Language Model ServingConference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023

...

Ganesh Ananthanarayanan

752

173

11 Oct 2023

Making Retrieval-Augmented Language Models Robust to Irrelevant ContextInternational Conference on Learning Representations (ICLR), 2023

695

341

02 Oct 2023

Attention Sorting Combats Recency Bias In Long Context Language Models

A. Peysakhovich

Adam Lerer

LRM RALM

369

28 Sep 2023

Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models

692

29 Aug 2023

A Comprehensive Overview of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

Saeed Anwar

Muhammad Usman

1.2K

1,456

12 Jul 2023

Lost in the Middle: How Language Models Use Long ContextsTransactions of the Association for Computational Linguistics (TACL), 2023

707

3,198

06 Jul 2023