ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.07856
  4. Cited By
The NLP Task Effectiveness of Long-Range Transformers
v1v2 (latest)

The NLP Task Effectiveness of Long-Range Transformers

Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
16 February 2022
Guanghui Qin
Yukun Feng
Benjamin Van Durme
ArXiv (abs)PDFHTML

Papers citing "The NLP Task Effectiveness of Long-Range Transformers"

21 / 21 papers shown
Title
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models
Julianna Piskorz
Cristina Pinneri
Alvaro H.C. Correia
Motasem Alfarra
Risheek Garrepalli
Christos Louizos
DiffM
98
0
0
26 Nov 2025
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs
Yan Wang
Penglei Gao
Shengyuan Lin
Jaisal Patel
Jeff Zhao
...
Lingfei Qian
J. Huang
Efstathia Soufleri
Xiao-Yang Liu
J. Nie
72
0
0
10 Oct 2025
AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems
AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems
Rohan Kadekodi
Zhan Jin
Keisuke Kamahori
Yile Gu
Sean Khatiri
Noah H. Bayindirli
Sergey Gorbunov
Baris Kasikci
124
0
0
30 Sep 2025
Lost at the Beginning of Reasoning
Lost at the Beginning of Reasoning
Baohao Liao
Xinyi Chen
Sara Rajaee
Yuhui Xu
Christian Herold
Anders Søgaard
Maarten de Rijke
Christof Monz
LRM
154
4
0
27 Jun 2025
Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer Approach
Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer ApproachAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Rochana Chaturvedi
Peyman Baghershahi
Sourav Medya
Barbara Di Eugenio
310
1
0
23 Mar 2025
Path Pooling: Training-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation
Path Pooling: Training-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation
Han Wang
Yuan Feng
Xike Xie
S.Kevin Zhou
250
0
0
07 Mar 2025
Multilingual Needle in a Haystack: Investigating Long-Context Behavior
  of Multilingual Large Language Models
Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Amey Hengle
Prasoon Bajpai
Soham Dan
Tanmoy Chakraborty
LRM
188
5
0
19 Aug 2024
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented
  Analysis Generation
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation
Abe Bohan Hou
Orion Weller
Guanghui Qin
Eugene Yang
Dawn J Lawrie
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
AILawELM
277
18
0
24 Jun 2024
Attention Instruction: Amplifying Attention in the Middle via Prompting
Attention Instruction: Amplifying Attention in the Middle via Prompting
Meiru Zhang
Zaiqiao Meng
Nigel Collier
228
8
0
24 Jun 2024
CItruS: Chunked Instruction-aware State Eviction for Long Sequence
  Modeling
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling
Yu Bai
Xiyuan Zou
Heyan Huang
Sanxing Chen
Marc-Antoine Rondeau
Yang Gao
Jackie Chi Kit Cheung
179
7
0
17 Jun 2024
Are queries and keys always relevant? A case study on Transformer wave functions
Are queries and keys always relevant? A case study on Transformer wave functions
Riccardo Rende
Luciano Loris Viteritti
234
11
0
29 May 2024
THREAD: Thinking Deeper with Recursive Spawning
THREAD: Thinking Deeper with Recursive Spawning
Philip Schroeder
Nathaniel Morgan
Hongyin Luo
James R. Glass
LRMLLMAGReLM
248
8
0
27 May 2024
Length-Aware Multi-Kernel Transformer for Long Document Classification
Length-Aware Multi-Kernel Transformer for Long Document Classification
Guangzeng Han
Jack Tsao
Xiaolei Huang
VLMRALM
165
8
0
11 May 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems
  in Commonsense Reasoning
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Daojian Zeng
Kang Liu
Jun Zhao
LRM
238
16
0
28 Feb 2024
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Dodo: Dynamic Contextual Compression for Decoder-only LMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
149
17
0
03 Oct 2023
Nugget: Neural Agglomerative Embeddings of Text
Nugget: Neural Agglomerative Embeddings of TextInternational Conference on Machine Learning (ICML), 2023
Guanghui Qin
Benjamin Van Durme
155
23
0
03 Oct 2023
Lost in the Middle: How Language Models Use Long Contexts
Lost in the Middle: How Language Models Use Long ContextsTransactions of the Association for Computational Linguistics (TACL), 2023
Nelson F. Liu
Kevin Lin
John Hewitt
Ashwin Paranjape
Michele Bevilacqua
Fabio Petroni
Abigail Z. Jacobs
RALM
463
2,452
0
06 Jul 2023
Personality Traits in Large Language Models
Personality Traits in Large Language Models
Gregory Serapio-García
Mustafa Safdari
Clément Crepy
Luning Sun
Stephen Fitz
P. Romero
Marwa Abdulhai
Aleksandra Faust
Maja J. Matarić
LM&MALLMAG
609
173
0
01 Jul 2023
Domain-specific Continued Pretraining of Language Models for Capturing
  Long Context in Mental Health
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Shaoxiong Ji
Tianlin Zhang
Kailai Yang
Sophia Ananiadou
Xiaoshi Zhong
Jörg Tiedemann
AI4MHALM
147
37
0
20 Apr 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic
  Speech Recognition
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022
Yukun Feng
Ming Tu
Rui Xia
Chuanzeng Huang
Yuxuan Wang
RALM
188
0
0
30 Dec 2022
UL2: Unifying Language Learning Paradigms
UL2: Unifying Language Learning ParadigmsInternational Conference on Learning Representations (ICLR), 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
429
354
0
10 May 2022
1