Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.04997
Cited By
v1
v2 (latest)
Adapting LLMs for Efficient Context Processing through Soft Prompt Compression
7 April 2024
Cangqing Wang
Yutian Yang
Ruisi Li
Dan Sun
Ruicong Cai
Yuzhu Zhang
Chengqian Fu
Lillian Floyd
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adapting LLMs for Efficient Context Processing through Soft Prompt Compression"
19 / 19 papers shown
Title
LLM-Driven Adaptive Source-Sink Identification and False Positive Mitigation for Static Analysis
Shiyin Lin
60
2
0
06 Nov 2025
Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises
Shiyin Lin
RALM
LRM
342
2
0
06 Nov 2025
Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback
Shiyin Lin
153
2
0
06 Nov 2025
JSPLIT: A Taxonomy-based Solution for Prompt Bloating in Model Context Protocol
Emanuele Antonioni
Stefan Markovic
Anirudha Shankar
Jaime Bernardo
Lovro Markovic
Silvia Pareti
Benedetto Proietti
LLMAG
72
0
0
16 Oct 2025
LongCodeZip: Compress Long Context for Code Language Models
Yuling Shi
Yichun Qian
Hongyu Zhang
Beijun Shen
Xiaodong Gu
132
1
0
01 Oct 2025
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Sikuan Yan
Xiufeng Yang
Zuchao Huang
Ercong Nie
Zifeng Ding
...
Hinrich Schutze
Volker Tresp
Yunpu Ma
Volker Tresp
Yunpu Ma
LLMAG
KELM
152
24
0
27 Aug 2025
Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Peiran Zhou
Junnan Zhu
Yichen Shen
Ruoxi Yu
RALM
104
0
0
26 Aug 2025
Mockingbird: How does LLM perform in general machine learning tasks?
Haoyu Jia
Yoshiki Obinata
Kento Kawaharazuka
Kei Okada
LLMAG
LRM
72
0
0
06 Aug 2025
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Kele Shao
Keda Tao
Kejia Zhang
Sicheng Feng
Mu Cai
Yuzhang Shang
Haoxuan You
Can Qin
Yang Sui
Huan Wang
489
10
0
27 Jul 2025
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
616
2
0
23 Apr 2025
Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor
Barys Liskavets
Shuvendu Roy
Maxim Ushakov
Mark Klibanov
Ali Etemad
Shane Luke
124
0
0
20 Feb 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
296
3
0
28 Jan 2025
CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction
Minghao Liu
Mingxiu Sui
Yi Nan
Cangqing Wang
Zhijie Zhou
201
12
0
05 Sep 2024
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference
AAAI Conference on Artificial Intelligence (AAAI), 2024
Barys Liskavets
Maxim Ushakov
Shuvendu Roy
Mark Klibanov
Ali Etemad
Shane Luke
315
29
0
02 Sep 2024
Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
Cangqing Wang
Mingxiu Sui
Dan Sun
Zecheng Zhang
Yan Zhou
176
39
0
22 May 2024
Enhancing user experience in large language models through human-centered design: Integrating theoretical insights with an experimental study to meet diverse software learning needs with a single document knowledge base
Yuchen Wang
Yin-Shan Lin
Ruixin Huang
Jinyin Wang
Sensen Liu
175
8
0
19 May 2024
Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding
Houze Liu
Chongqing Wang
Xiaoan Zhan
Haotian Zheng
Chang Che
153
7
0
13 May 2024
Exploring Diverse Methods in Visual Question Answering
Panfeng Li
Qikai Yang
Xieming Geng
Wenjing Zhou
Zhicheng Ding
Yi Nian
339
65
0
21 Apr 2024
Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs
Ngoc Quach
Qi Wang
Zijun Gao
Qifeng Sun
Bo Guan
Lillian Floyd
OffRL
GNN
118
14
0
19 Apr 2024
1