v1v2 (latest)

Efficient Attentions for Long Document Summarization

North American Chapter of the Association for Computational Linguistics (NAACL), 2021

5 April 2021

L. Huang

Shuyang Cao

Nikolaus Nova Parulian

Heng Ji

Lu Wang

ArXiv (abs)PDF HTML

Papers citing "Efficient Attentions for Long Document Summarization"

50 / 220 papers shown

What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models

...

156

03 Dec 2025

SpecPV: Improving Self-Speculative Decoding for Long-Context Generation via Partial Verification

180

02 Dec 2025

ScaleFormer: Span Representation Cumulation for Long-Context Transformer

Jiangshu Du

Wenpeng Yin

Philip S. Yu

13 Nov 2025

SynClaimEval: A Framework for Evaluating the Utility of Synthetic Data in Long-Context Claim Verification

Mohamed Elaraby

Jyoti Prakash Maheswari

SyDa

143

12 Nov 2025

Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies

305

02 Nov 2025

Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models

Sriram Balasubramaniam

420

29 Oct 2025

Citation Failure: Definition, Analysis and Efficient Mitigation

Jan Buchmann

Iryna Gurevych

150

23 Oct 2025

Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference

252

21 Oct 2025

AcademicEval: Live Long-Context LLM Benchmark

162

20 Oct 2025

Taming the Fragility of KV Cache Eviction in LLM Inference

199

15 Oct 2025

LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA

Tommaso Bonomo

Luca Gioffrè

Roberto Navigli

169

15 Oct 2025

Quality Estimation Reranking for Document-Level Translation

Krzysztof Mrozinski

Minji Kang

Ahmed Khota

Vincent Michael Sutanto

Giovanni Gatti De Giacomo

171

10 Oct 2025

Revisiting Long-context Modeling from Context Denoising Perspective

208

07 Oct 2025

Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

223

06 Oct 2025

Rethinking RoPE Scaling in Quantized LLM: Theory, Outlier, and Channel-Band Analysis with Weight Rescaling

140

26 Sep 2025

Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization

258

25 Sep 2025

Mamba Modulation: On the Length Generalization of Mamba

372

23 Sep 2025

Long document summarization using page specific target text alignment and distilling page importance

Pushpa Devi

Ayush Agrawal

Ashutosh Dubey

C. Ravindranath Chowdary

RALM

163

20 Sep 2025

Value-Guided KV Compression for LLMs via Approximated CUR Decomposition

Ayan Sengupta

Siddhant Chaudhary

Tanmoy Chakraborty

187

18 Sep 2025

ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

166

16 Sep 2025

HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking

294

15 Sep 2025

A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator

Elahe Delavari

Feeza Khan Khanzada

Jaerock Kwon

250

10 Sep 2025

HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models

202

05 Sep 2025

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Krishna Teja Chitty-Venkata

173

04 Sep 2025

From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users

396

24 Aug 2025

Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models

244

21 Aug 2025

SCOPE: A Generative Approach for LLM Prompt Compression

Tinghui Zhang

Yifan Wang

Daisy Zhe Wang

160

16 Aug 2025

Flora: Effortless Context Construction to Arbitrary Length and Scale

374

26 Jul 2025

Smooth Reading: Bridging the Gap of Recurrent LLM to Self-Attention LLM on Long-Context Tasks

297

25 Jul 2025

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

...

Blaise Agüera y Arcas

João Sacramento

402

05 Jun 2025

Towards Multi-dimensional Evaluation of LLM Summarization across Domains and LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

296

31 May 2025

SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences

436

27 May 2025

Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration

430

27 May 2025

MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

386

26 May 2025

Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query

477

24 May 2025

MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing

Juncheng Yang

Yue Cheng

332

24 May 2025

Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization

672

21 May 2025

Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification

364

19 May 2025

A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting

Lhuqita Fazry

VLM

535

11 May 2025

Rethinking Memory in LLM based Agents: Representations, Operations, and Emerging Topics

799

01 May 2025

HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection

465

01 May 2025

The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance

387

04 Apr 2025

Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction

Ryan Dempsey

Jonathan Ethier

Halim Yanikomeroglu

308

04 Apr 2025

ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback

263

27 Mar 2025

WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference

471

23 Mar 2025

GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments

363

18 Mar 2025

A Survey on Transformer Context Extension: Approaches and Evaluation

585

17 Mar 2025

AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation

504

13 Mar 2025

LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

449

04 Mar 2025

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

370

04 Mar 2025