ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.02112
  4. Cited By
Efficient Attentions for Long Document Summarization
v1v2 (latest)

Efficient Attentions for Long Document Summarization

North American Chapter of the Association for Computational Linguistics (NAACL), 2021
5 April 2021
L. Huang
Shuyang Cao
Nikolaus Nova Parulian
Heng Ji
Lu Wang
ArXiv (abs)PDFHTML

Papers citing "Efficient Attentions for Long Document Summarization"

50 / 220 papers shown
What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models
What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models
Tianchen Deng
Yue Pan
Shenghai Yuan
Dong Li
Chen Wang
...
Danwei W. Wang
Jingchuan Wang
Javier Civera
Hesheng Wang
Weidong Chen
156
17
0
03 Dec 2025
SpecPV: Improving Self-Speculative Decoding for Long-Context Generation via Partial Verification
SpecPV: Improving Self-Speculative Decoding for Long-Context Generation via Partial Verification
Zhendong Tan
Xingjun Zhang
Chaoyi Hu
Junjie Peng
Kun Xia
LRM
180
0
0
02 Dec 2025
ScaleFormer: Span Representation Cumulation for Long-Context Transformer
ScaleFormer: Span Representation Cumulation for Long-Context Transformer
Jiangshu Du
Wenpeng Yin
Philip S. Yu
96
0
0
13 Nov 2025
SynClaimEval: A Framework for Evaluating the Utility of Synthetic Data in Long-Context Claim Verification
SynClaimEval: A Framework for Evaluating the Utility of Synthetic Data in Long-Context Claim Verification
Mohamed Elaraby
Jyoti Prakash Maheswari
SyDa
143
0
0
12 Nov 2025
Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies
Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies
Yuxuan Hu
Jianchao Tan
Jiaqi Zhang
Wen Zan
Pingwei Sun
Yifan Lu
Yerui Sun
Yuchen Xie
Xunliang Cai
Jing Zhang
305
0
0
02 Nov 2025
Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models
Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models
Sriram Balasubramaniam
S. Basu
Koustava Goswami
Ryan Rossi
Varun Manjunatha
Roshan Santhosh
Ruiyi Zhang
Soheil Feizi
Nedim Lipka
LRMReLM
420
1
0
29 Oct 2025
Citation Failure: Definition, Analysis and Efficient Mitigation
Citation Failure: Definition, Analysis and Efficient Mitigation
Jan Buchmann
Iryna Gurevych
150
0
0
23 Oct 2025
Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference
Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference
Siyuan Yan
Guo-qing Jiang
Y. Zhang
Xiaoxing Ma
Ran Zhu
Chun Cao
Jingwei Xu
OffRL
252
2
0
21 Oct 2025
AcademicEval: Live Long-Context LLM Benchmark
AcademicEval: Live Long-Context LLM Benchmark
Haozhen Zhang
Tao Feng
Pengrui Han
Jiaxuan You
162
4
0
20 Oct 2025
Taming the Fragility of KV Cache Eviction in LLM Inference
Taming the Fragility of KV Cache Eviction in LLM Inference
Yuan Feng
Haoyu Guo
Junlin Lv
S.Kevin Zhou
Xike Xie
199
2
0
15 Oct 2025
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA
Tommaso Bonomo
Luca Gioffrè
Roberto Navigli
169
3
0
15 Oct 2025
Quality Estimation Reranking for Document-Level Translation
Quality Estimation Reranking for Document-Level Translation
Krzysztof Mrozinski
Minji Kang
Ahmed Khota
Vincent Michael Sutanto
Giovanni Gatti De Giacomo
171
6
0
10 Oct 2025
Revisiting Long-context Modeling from Context Denoising Perspective
Revisiting Long-context Modeling from Context Denoising Perspective
Zecheng Tang
Baibei Ji
Juntao Li
Lijun Wu
Haijia Gui
Min Zhang
208
1
0
07 Oct 2025
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
Hybrid Architectures for Language Models: Systematic Analysis and Design Insights
Sangmin Bae
Bilge Acun
Haroun Habeeb
S. Kim
Chien-Yu Lin
Liang Luo
Junjie Wang
Carole-Jean Wu
Mamba
223
6
0
06 Oct 2025
Rethinking RoPE Scaling in Quantized LLM: Theory, Outlier, and Channel-Band Analysis with Weight Rescaling
Rethinking RoPE Scaling in Quantized LLM: Theory, Outlier, and Channel-Band Analysis with Weight Rescaling
Ye Qiao
Haocheng Xu
Xiaofan Zhang
Sitao Huang
MQ
140
0
0
26 Sep 2025
Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization
Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
LLMAGAAMLRALM
258
1
0
25 Sep 2025
Mamba Modulation: On the Length Generalization of Mamba
Mamba Modulation: On the Length Generalization of Mamba
Peng Lu
Jerry Huang
Qiuhao Zeng
X. Wang
Boxing Wang
Philippe Langlais
Yufei Cui
Mamba
372
0
0
23 Sep 2025
Long document summarization using page specific target text alignment and distilling page importance
Long document summarization using page specific target text alignment and distilling page importance
Pushpa Devi
Ayush Agrawal
Ashutosh Dubey
C. Ravindranath Chowdary
RALM
163
0
0
20 Sep 2025
Value-Guided KV Compression for LLMs via Approximated CUR Decomposition
Value-Guided KV Compression for LLMs via Approximated CUR Decomposition
Ayan Sengupta
Siddhant Chaudhary
Tanmoy Chakraborty
MQ
187
1
0
18 Sep 2025
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Hengrui Zhang
Yulong Hui
Yihao Liu
Huanchen Zhang
OffRL
166
0
0
16 Sep 2025
HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
Wensheng Lu
Keyu Chen
Ruizhi Qiao
Xing Sun
294
5
0
15 Sep 2025
A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator
A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator
Elahe Delavari
Feeza Khan Khanzada
Jaerock Kwon
250
4
0
10 Sep 2025
HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models
HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models
Chang Dai
Hongyu Shan
Mingyang Song
Di Liang
202
3
0
05 Sep 2025
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference
Krishna Teja Chitty-Venkata
Jie Ye
Xian-He Sun
Anthony Kougkas
M. Emani
V. Vishwanath
Bogdan Nicolae
173
3
0
04 Sep 2025
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users
Sadia Sultana Chowa
Riasad Alvi
Subhey Sadi Rahman
M. R
M. R
M. Islam
Mukhtar Hussain
Sami Azam
LLMAGLM&RoELM
396
20
0
24 Aug 2025
Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models
Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models
Tobias Schreieder
Tim Schopf
Michael Färber
HILM
244
5
0
21 Aug 2025
SCOPE: A Generative Approach for LLM Prompt Compression
SCOPE: A Generative Approach for LLM Prompt Compression
Tinghui Zhang
Yifan Wang
Daisy Zhe Wang
160
3
0
16 Aug 2025
Flora: Effortless Context Construction to Arbitrary Length and Scale
Flora: Effortless Context Construction to Arbitrary Length and Scale
Tianxiang Chen
Zhentao Tan
Xiaofan Bo
Yue Wu
Tao Gong
Qi Chu
Jieping Ye
Nenghai Yu
CLLLRM
374
2
0
26 Jul 2025
Smooth Reading: Bridging the Gap of Recurrent LLM to Self-Attention LLM on Long-Context Tasks
Smooth Reading: Bridging the Gap of Recurrent LLM to Self-Attention LLM on Long-Context Tasks
Kai Liu
Zhan Su
Peijie Dong
Fengran Mo
Jianfei Gao
ShaoTing Zhang
Kai-xiang Chen
297
2
0
25 Jul 2025
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
J. Oswald
Nino Scherrer
Seijin Kobayashi
Luca Versari
Songlin Yang
...
Guillaume Lajoie
Charlotte Frenkel
Razvan Pascanu
Blaise Agüera y Arcas
João Sacramento
402
22
0
05 Jun 2025
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Hyangsuk Min
Yuho Lee
Minjeong Ban
Jiaqi Deng
Nicole Hee-Yeon Kim
Taewon Yun
Hang Su
Jason (Jinglun) Cai
Hwanjun Song
ELM
296
7
0
31 May 2025
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences
Jungyoub Cha
Hyunjong Kim
Sungzoon Cho
VLM
436
1
0
27 May 2025
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Sibo Xiao
Zixin Lin
Wenyang Gao
Hui Chen
Yue Zhang
LLMAG
430
4
0
27 May 2025
MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models
MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhongzhan Huang
Guoming Ling
Shanshan Zhong
Hefeng Wu
Liang Lin
386
1
0
26 May 2025
Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query
Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query
Yixuan Wang
Shiyu Ji
Yijun Liu
Yuzhuang Xu
Yang Xu
Qingfu Zhu
Wanxiang Che
477
5
0
24 May 2025
MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing
MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Quantized Layer Swapping and KV Cache Resizing
Zhaoyuan Su
Tingfeng Lan
Zirui Wang
Juncheng Yang
Yue Cheng
Juncheng Yang
Yue Cheng
332
1
0
24 May 2025
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization
Joonho Yang
Seunghyun Yoon
Hwan Chang
Byeongjeong Kim
Hwanhee Lee
HILM
672
4
0
21 May 2025
Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Jikai Wang
Zhenxu Tian
Jilong Li
Qingrong Xia
Xinyu Duan
Zhefeng Wang
Baoxing Huai
Min Zhang
364
5
0
19 May 2025
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting
Lhuqita Fazry
VLM
535
0
0
11 May 2025
Rethinking Memory in LLM based Agents: Representations, Operations, and Emerging Topics
Rethinking Memory in LLM based Agents: Representations, Operations, and Emerging Topics
Yiming Du
Wenyu Huang
Danna Zheng
Zhaowei Wang
Sébastien Montella
Mirella Lapata
Kam-Fai Wong
Jeff Z. Pan
KELMMU
799
17
0
01 May 2025
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection
Deanna Emery
Michael Goitia
Freddie Vargus
Iulia Neagu
HILMVLM
465
2
0
01 May 2025
The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance
The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance
Jeremy D. Webb
Michael Bowman
Songpo Li
Xiaoli Zhang
387
5
0
04 Apr 2025
Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction
Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction
Ryan Dempsey
Jonathan Ethier
Halim Yanikomeroglu
308
5
0
04 Apr 2025
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
Taewon Yun
Jihwan Oh
Hyangsuk Min
Yuho Lee
Jihwan Bang
Jason (Jinglun) Cai
Hwanjun Song
OffRLLRM
263
3
0
27 Mar 2025
WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference
WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference
Youhui Zuo
Sibo Wei
C. Zhang
Zhuorui Liu
Sibo Wei
Dawei Song
VLM
471
1
0
23 Mar 2025
GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments
GPU-Accelerated Motion Planning of an Underactuated Forestry Crane in Cluttered Environments
M. Vu
Gerald Ebmer
Alexander Watcher
Marc-Philip Ecker
Giang Nguyen
Tobias Glueck
363
6
0
18 Mar 2025
A Survey on Transformer Context Extension: Approaches and Evaluation
A Survey on Transformer Context Extension: Approaches and Evaluation
Yijun Liu
Jinzheng Yu
Yang Xu
Zhongyang Li
Qingfu Zhu
LLMAG
585
15
0
17 Mar 2025
AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation
AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation
Yixiong Fang
Tianran Sun
Yuling Shi
Xiaodong Gu
504
8
0
13 Mar 2025
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jianghao Chen
Junhong Wu
Yangyifan Xu
J.N. Zhang
449
11
0
04 Mar 2025
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
Zhibin Lan
Liqiang Niu
Fandong Meng
Jie Zhou
Jinsong Su
VLM
370
1
0
04 Mar 2025
12345
Next
Page 1 of 5