Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.09571
Cited By
Extending Context Window of Large Language Models via Semantic Compression
15 December 2023
WeiZhi Fei
Xueyan Niu
Pingyi Zhou
Lu Hou
Bo Bai
Lei Deng
Wei Han
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extending Context Window of Large Language Models via Semantic Compression"
23 / 23 papers shown
Title
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
46
0
0
23 Apr 2025
Understanding and Improving Information Preservation in Prompt Compression for LLMs
Weronika Łajewska
Momchil Hardalov
Laura Aina
Neha Anna John
Hang Su
Lluís Marquez
58
0
0
24 Mar 2025
OkraLong: A Flexible Retrieval-Augmented Framework for Long-Text Query Processing
Yulong Hui
Y. Liu
Yao Lu
Huanchen Zhang
RALM
121
0
0
04 Mar 2025
U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack
Yunfan Gao
Yun Xiong
Wenlong Wu
Zijing Huang
Bohan Li
H. Wang
52
3
0
01 Mar 2025
Lost in the Passage: Passage-level In-context Learning Does Not Necessarily Need a "Passage"
Hao Sun
Chenming Tang
Gengyang Li
Yunfang Wu
AIMat
42
0
0
15 Feb 2025
Can LLMs Maintain Fundamental Abilities under KV Cache Compression?
Xiang Liu
Zhenheng Tang
Hong Chen
Peijie Dong
Zeyu Li
Xiuze Zhou
Bo Li
Xuming Hu
Xiaowen Chu
83
3
0
04 Feb 2025
Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference
WeiZhi Fei
Xueyan Niu
Guoqing Xie
Yingqing Liu
Bo Bai
Wei Han
28
1
0
22 Jan 2025
Reducing Distraction in Long-Context Language Models by Focused Learning
Zijun Wu
Bingyuan Liu
Ran Yan
L. Chen
Thomas Delteil
RALM
26
2
0
08 Nov 2024
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Qitan Lv
Jie Wang
Hanzhu Chen
Bin Li
Yongdong Zhang
Feng Wu
HILM
17
3
0
19 Oct 2024
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey
Sourav Verma
RALM
3DV
19
2
0
20 Sep 2024
Schrodinger's Memory: Large Language Models
Wei Wang
Qing Li
29
1
0
16 Sep 2024
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
Jinliang Lu
Ziliang Pang
Min Xiao
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
29
17
0
08 Jul 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
67
13
0
20 Jun 2024
Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding
WeiZhi Fei
Xueyan Niu
Guoqing Xie
Yanhua Zhang
Bo Bai
Lei Deng
Wei Han
LRM
KELM
RALM
21
5
0
18 Jun 2024
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory
Xueyan Niu
Bo Bai
Lei Deng
Wei Han
31
6
0
14 May 2024
An LLM-Tool Compiler for Fused Parallel Function Calling
Simranjit Singh
Andreas Karatzas
Michael Fore
Iraklis Anagnostopoulos
Dimitrios Stamoulis
LLMAG
24
6
0
07 May 2024
A Survey on Efficient Inference for Large Language Models
Zixuan Zhou
Xuefei Ning
Ke Hong
Tianyu Fu
Jiaming Xu
...
Shengen Yan
Guohao Dai
Xiao-Ping Zhang
Yuhan Dong
Yu-Xiang Wang
46
78
0
22 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
30
32
0
01 Apr 2024
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Xupeng Miao
Gabriele Oliaro
Zhihao Zhang
Xinhao Cheng
Hongyi Jin
Tianqi Chen
Zhihao Jia
53
75
0
23 Dec 2023
Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use
Yuhan Chen
Ang Lv
Ting-En Lin
C. Chen
Yuchuan Wu
Fei Huang
Yongbin Li
Rui Yan
18
24
0
07 Dec 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
28
53
0
21 Nov 2023
RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text
Wangchunshu Zhou
Yuchen Eleanor Jiang
Peng Cui
Tiannan Wang
Zhenxin Xiao
Yifan Hou
Ryan Cotterell
Mrinmaya Sachan
RALM
LLMAG
82
58
0
22 May 2023
Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs
Zihao W. Wang
Hang Yin
Yangqiu Song
24
29
0
18 Sep 2021
1