ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.13421
  4. Cited By
Long-range Language Modeling with Self-retrieval

Long-range Language Modeling with Self-retrieval

Transactions of the Association for Computational Linguistics (TACL), 2023
23 June 2023
Ohad Rubin
Jonathan Berant
    RALMKELM
ArXiv (abs)PDFHTMLHuggingFace (16 upvotes)

Papers citing "Long-range Language Modeling with Self-retrieval"

18 / 18 papers shown
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models
X. S. Hu
Zhanchao Zhou
Ruiqi Liang
Zehuan Li
Wei Wu
Jianguo Li
321
1
0
28 Nov 2025
CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation
CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Santosh T.Y.S.S
Youssef Tarek Elkhayat
Oana Ichim
Pranav Shetty
Dongsheng Wang
Zhiqiang Ma
Armineh Nourbakhsh
Xiaomo Liu
160
4
0
07 Aug 2025
Associative Recurrent Memory Transformer
Associative Recurrent Memory Transformer
Ivan Rodkin
Yuri Kuratov
Aydar Bulatov
Andrey Kravchenko
384
13
0
17 Feb 2025
Retrieval Augmented Spelling Correction for E-Commerce Applications
Retrieval Augmented Spelling Correction for E-Commerce ApplicationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xuan Guo
Rohit Patki
Dante Everaert
Christopher Potts
100
0
0
15 Oct 2024
BABILong: Testing the Limits of LLMs with Long Context
  Reasoning-in-a-Haystack
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-HaystackNeural Information Processing Systems (NeurIPS), 2024
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALMALMLRMReLMELM
327
185
0
14 Jun 2024
Reliable, Adaptable, and Attributable Language Models with Retrieval
Reliable, Adaptable, and Attributable Language Models with Retrieval
Akari Asai
Zexuan Zhong
Danqi Chen
Pang Wei Koh
Luke Zettlemoyer
Hanna Hajishirzi
Anuj Kumar
KELMRALM
378
83
0
05 Mar 2024
Analyzing and Adapting Large Language Models for Few-Shot Multilingual
  NLU: Are We There Yet?
Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?
E. Razumovskaia
Ivan Vulić
Anna Korhonen
276
15
0
04 Mar 2024
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs
  Miss
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALM
455
46
0
16 Feb 2024
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Accelerating Retrieval-Augmented Language Model Serving with Speculation
Zhihao Zhang
Alan Zhu
Lijie Yang
Yihua Xu
Lanting Li
P. Phothilimthana
Zhihao Jia
RALMKELM
283
25
0
25 Jan 2024
UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for
  Personalized Dialogue Systems
UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems
Hongru Wang
Wenyu Huang
Yang Deng
Rui Wang
Zezhong Wang
Yufei Wang
Fei Mi
Jeff Z. Pan
Kam-Fai Wong
RALM
344
54
0
24 Jan 2024
The Faiss library
The Faiss libraryIEEE Transactions on Big Data (IEEE Trans. Big Data), 2024
Matthijs Douze
Alexandr Guzhva
Chengqi Deng
Jeff Johnson
Gergely Szilvasy
Pierre-Emmanuel Mazaré
Maria Lomeli
Lucas Hosseini
Edouard Grave
956
538
0
16 Jan 2024
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level
  Hallucination Detection
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sehyun Choi
Tianqing Fang
Zhaowei Wang
Yangqiu Song
274
58
0
13 Oct 2023
CacheGen: KV Cache Compression and Streaming for Fast Language Model
  Serving
CacheGen: KV Cache Compression and Streaming for Fast Language Model ServingConference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023
Yuhan Liu
Hanchen Li
Yihua Cheng
Siddhant Ray
Yuyang Huang
...
Ganesh Ananthanarayanan
Michael Maire
Henry Hoffmann
Ari Holtzman
Junchen Jiang
752
173
0
11 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Making Retrieval-Augmented Language Models Robust to Irrelevant ContextInternational Conference on Learning Representations (ICLR), 2023
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALMLRM
695
341
0
02 Oct 2023
Attention Sorting Combats Recency Bias In Long Context Language Models
Attention Sorting Combats Recency Bias In Long Context Language Models
A. Peysakhovich
Adam Lerer
LRMRALM
369
94
0
28 Sep 2023
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
Qingyue Wang
Y. Fu
Yanan Cao
Zhiliang Tian
Zhiliang Tian
Dacheng Tao
LLMAGKELMRALM
692
56
0
29 Aug 2023
A Comprehensive Overview of Large Language Models
A Comprehensive Overview of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Lin Wang
OffRL
1.2K
1,456
0
12 Jul 2023
Lost in the Middle: How Language Models Use Long Contexts
Lost in the Middle: How Language Models Use Long ContextsTransactions of the Association for Computational Linguistics (TACL), 2023
Nelson F. Liu
Kevin Lin
John Hewitt
Ashwin Paranjape
Michele Bevilacqua
Fabio Petroni
Abigail Z. Jacobs
RALM
707
3,198
0
06 Jul 2023
1
Page 1 of 1