Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.04997
Cited By
v1
v2 (latest)
Adapting LLMs for Efficient Context Processing through Soft Prompt Compression
7 April 2024
Cangqing Wang
Yutian Yang
Ruisi Li
Dan Sun
Ruicong Cai
Yuzhu Zhang
Chengqian Fu
Lillian Floyd
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adapting LLMs for Efficient Context Processing through Soft Prompt Compression"
19 / 19 papers shown
LLM-Driven Adaptive Source-Sink Identification and False Positive Mitigation for Static Analysis
Shiyin Lin
80
3
0
06 Nov 2025
Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises
Shiyin Lin
RALM
LRM
366
3
0
06 Nov 2025
Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback
Shiyin Lin
178
2
0
06 Nov 2025
JSPLIT: A Taxonomy-based Solution for Prompt Bloating in Model Context Protocol
Emanuele Antonioni
Stefan Markovic
Anirudha Shankar
Jaime Bernardo
Lovro Markovic
Silvia Pareti
Benedetto Proietti
LLMAG
84
0
0
16 Oct 2025
LongCodeZip: Compress Long Context for Code Language Models
Yuling Shi
Yichun Qian
Hongyu Zhang
Beijun Shen
Xiaodong Gu
144
4
0
01 Oct 2025
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Sikuan Yan
Xiufeng Yang
Zuchao Huang
Ercong Nie
Zifeng Ding
...
Volker Tresp
Yunpu Ma
Volker Tresp
Yunpu Ma
Yunpu Ma
LLMAG
KELM
205
35
0
27 Aug 2025
Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
Peiran Zhou
Junnan Zhu
Yichen Shen
Ruoxi Yu
RALM
112
0
0
26 Aug 2025
Mockingbird: How does LLM perform in general machine learning tasks?
Haoyu Jia
Yoshiki Obinata
Kento Kawaharazuka
Kei Okada
LLMAG
LRM
92
0
0
06 Aug 2025
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Kele Shao
Keda Tao
Kejia Zhang
Sicheng Feng
Mu Cai
Yuzhang Shang
Haoxuan You
Can Qin
Yang Sui
Huan Wang
501
10
0
27 Jul 2025
PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression
Lizhe Chen
Binjia Zhou
Yuyao Ge
Jiayi Chen
Shiguang NI
636
2
0
23 Apr 2025
Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor
Barys Liskavets
Shuvendu Roy
Maxim Ushakov
Mark Klibanov
Ali Etemad
Shane Luke
124
0
0
20 Feb 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
312
3
0
28 Jan 2025
CA-BERT: Leveraging Context Awareness for Enhanced Multi-Turn Chat Interaction
Minghao Liu
Mingxiu Sui
Yi Nan
Cangqing Wang
Zhijie Zhou
209
12
0
05 Sep 2024
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference
AAAI Conference on Artificial Intelligence (AAAI), 2024
Barys Liskavets
Maxim Ushakov
Shuvendu Roy
Mark Klibanov
Ali Etemad
Shane Luke
335
31
0
02 Sep 2024
Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
Cangqing Wang
Mingxiu Sui
Dan Sun
Zecheng Zhang
Yan Zhou
216
39
0
22 May 2024
Enhancing user experience in large language models through human-centered design: Integrating theoretical insights with an experimental study to meet diverse software learning needs with a single document knowledge base
Yuchen Wang
Yin-Shan Lin
Ruixin Huang
Jinyin Wang
Sensen Liu
183
8
0
19 May 2024
Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding
Houze Liu
Chongqing Wang
Xiaoan Zhan
Haotian Zheng
Chang Che
173
7
0
13 May 2024
Exploring Diverse Methods in Visual Question Answering
Panfeng Li
Qikai Yang
Xieming Geng
Wenjing Zhou
Zhicheng Ding
Yi Nian
363
65
0
21 Apr 2024
Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs
Ngoc Quach
Qi Wang
Zijun Gao
Qifeng Sun
Bo Guan
Lillian Floyd
OffRL
GNN
134
14
0
19 Apr 2024
1