v1v2 (latest)

The NLP Task Effectiveness of Long-Range Transformers

Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

16 February 2022

Papers citing "The NLP Task Effectiveness of Long-Range Transformers"

21 / 21 papers shown

Title
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models Julianna Piskorz Cristina Pinneri Alvaro H.C. Correia Motasem Alfarra Risheek Garrepalli Christos Louizos DiffM 98 0 0 26 Nov 2025
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs Yan Wang Penglei Gao Shengyuan Lin Jaisal Patel Jeff Zhao ... Lingfei Qian J. Huang Efstathia Soufleri Xiao-Yang Liu J. Nie 72 0 0 10 Oct 2025
AgentFlux: Decoupled Fine-Tuning & Inference for On-Device Agentic Systems Rohan Kadekodi Zhan Jin Keisuke Kamahori Yile Gu Sean Khatiri Noah H. Bayindirli Sergey Gorbunov Baris Kasikci 124 0 0 30 Sep 2025
Lost at the Beginning of Reasoning Baohao Liao Xinyi Chen Sara Rajaee Yuhui Xu Christian Herold Anders Søgaard Maarten de Rijke Christof Monz LRM 154 4 0 27 Jun 2025
Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer ApproachAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Rochana Chaturvedi Peyman Baghershahi Sourav Medya Barbara Di Eugenio 310 1 0 23 Mar 2025
Path Pooling: Training-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation Han Wang Yuan Feng Xike Xie S.Kevin Zhou 250 0 0 07 Mar 2025
Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Amey Hengle Prasoon Bajpai Soham Dan Tanmoy Chakraborty LRM 188 5 0 19 Aug 2024
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation Abe Bohan Hou Orion Weller Guanghui Qin Eugene Yang Dawn J Lawrie Nils Holzenberger Andrew Blair-Stanek Benjamin Van Durme AILaw ELM 277 18 0 24 Jun 2024
Attention Instruction: Amplifying Attention in the Middle via Prompting Meiru Zhang Zaiqiao Meng Nigel Collier 228 8 0 24 Jun 2024
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling Yu Bai Xiyuan Zou Heyan Huang Sanxing Chen Marc-Antoine Rondeau Yang Gao Jackie Chi Kit Cheung 179 7 0 17 Jun 2024
Are queries and keys always relevant? A case study on Transformer wave functions Riccardo Rende Luciano Loris Viteritti 234 11 0 29 May 2024
THREAD: Thinking Deeper with Recursive Spawning Philip Schroeder Nathaniel Morgan Hongyin Luo James R. Glass LRM LLMAG ReLM 248 8 0 27 May 2024
Length-Aware Multi-Kernel Transformer for Long Document Classification Guangzeng Han Jack Tsao Xiaolei Huang VLM RALM 165 8 0 11 May 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning Jiachun Li Pengfei Cao Chenhao Wang Zhuoran Jin Yubo Chen Daojian Zeng Kang Liu Jun Zhao LRM 238 16 0 28 Feb 2024
Dodo: Dynamic Contextual Compression for Decoder-only LMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Guanghui Qin Corby Rosset Ethan C. Chau Nikhil Rao Benjamin Van Durme 149 17 0 03 Oct 2023
Nugget: Neural Agglomerative Embeddings of TextInternational Conference on Machine Learning (ICML), 2023 Guanghui Qin Benjamin Van Durme 155 23 0 03 Oct 2023
Lost in the Middle: How Language Models Use Long ContextsTransactions of the Association for Computational Linguistics (TACL), 2023 Nelson F. Liu Kevin Lin John Hewitt Ashwin Paranjape Michele Bevilacqua Fabio Petroni Abigail Z. Jacobs RALM 463 2,452 0 06 Jul 2023
Personality Traits in Large Language Models Gregory Serapio-García Mustafa Safdari Clément Crepy Luning Sun Stephen Fitz P. Romero Marwa Abdulhai Aleksandra Faust Maja J. Matarić LM&MA LLMAG 609 173 0 01 Jul 2023
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health Shaoxiong Ji Tianlin Zhang Kailai Yang Sophia Ananiadou Xiaoshi Zhong Jörg Tiedemann AI4MH ALM 147 37 0 20 Apr 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022 Yukun Feng Ming Tu Rui Xia Chuanzeng Huang Yuxuan Wang RALM 188 0 0 30 Dec 2022
UL2: Unifying Language Learning ParadigmsInternational Conference on Learning Representations (ICLR), 2022 Yi Tay Mostafa Dehghani Vinh Q. Tran Xavier Garcia Jason W. Wei ... Tal Schuster H. Zheng Denny Zhou N. Houlsby Donald Metzler AI4CE 429 354 0 10 May 2022