Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.08467
Cited By
Learning to Compress Prompts with Gist Tokens
17 April 2023
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Compress Prompts with Gist Tokens"
50 / 164 papers shown
Title
Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval
Joao Coelho
Bruno Martins
João Magalhães
Jamie Callan
Chenyan Xiong
RALM
30
4
0
05 Apr 2024
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization
Tobias Schnabel
Jennifer Neville
LRM
29
6
0
02 Apr 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
69
49
0
02 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
30
32
0
01 Apr 2024
CoLLEGe: Concept Embedding Generation for Large Language Models
Ryan Teehan
Brenden Lake
Mengye Ren
33
3
0
22 Mar 2024
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Zhuoshi Pan
Qianhui Wu
Huiqiang Jiang
Menglin Xia
Xufang Luo
...
Yuqing Yang
Chin-Yew Lin
H. V. Zhao
Lili Qiu
Dongmei Zhang
VLM
44
88
0
19 Mar 2024
Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference
Piotr Nawrot
Adrian Lañcucki
Marcin Chochowski
David Tarjan
E. Ponti
30
50
0
14 Mar 2024
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
E. Zelikman
Georges Harik
Yijia Shao
Varuna Jayasiri
Nick Haber
Noah D. Goodman
LLMAG
ReLM
LRM
47
108
0
14 Mar 2024
Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference
Muhammad Adnan
Akhil Arunkumar
Gaurav Jain
Prashant J. Nair
Ilya Soloveychik
Purushotham Kamath
22
52
0
14 Mar 2024
MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning
Yichuan Li
Xiyao Ma
Sixing Lu
Kyumin Lee
Xiaohu Liu
Chenlei Guo
19
6
0
11 Mar 2024
Learning to Compress Prompt in Natural Language Formats
Yu-Neng Chuang
Tianwei Xing
Chia-Yuan Chang
Zirui Liu
Xun Chen
Xia Hu
26
16
0
28 Feb 2024
Investigating the Effectiveness of HyperTuning via Gisting
Jason Phang
38
0
0
26 Feb 2024
Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Xinze Li
Zhenghao Liu
Chenyan Xiong
Shi Yu
Yukun Yan
Shuo Wang
Ge Yu
VLM
36
4
0
25 Feb 2024
User-LLM: Efficient LLM Contextualization with User Embeddings
Lin Ning
Luyang Liu
Jiaxing Wu
Neo Wu
D. Berlowitz
Sushant Prakash
Bradley Green
S. O’Banion
Jun Xie
42
32
0
21 Feb 2024
CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory
Zexue He
Leonid Karlinsky
Donghyun Kim
Julian McAuley
Dmitry Krotov
Rogerio Feris
KELM
RALM
35
10
0
21 Feb 2024
Extensible Embedding: A Flexible Multipler For LLM's Context Length
Ninglu Shao
Shitao Xiao
Zheng Liu
Peitian Zhang
19
1
0
18 Feb 2024
Multi-word Tokenization for Sequence Compression
Leonidas Gee
Leonardo Rigutini
Marco Ernandes
Andrea Zugarini
18
8
0
15 Feb 2024
Anchor-based Large Language Models
Jianhui Pang
Fanghua Ye
Derek F. Wong
Xin He
Wanshun Chen
Longyue Wang
KELM
56
8
0
12 Feb 2024
CigaR: Cost-efficient Program Repair with LLMs
Dávid Hidvégi
K. Etemadi
Sofia Bobadilla
Martin Monperrus
15
20
0
09 Feb 2024
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Junhong Shen
Neil Tenenholtz
James Hall
David Alvarez-Melis
Nicolò Fusi
42
21
0
06 Feb 2024
Affordable Generative Agents
Yangbin Yu
Qin Zhang
Junyou Li
Qiang Fu
Deheng Ye
LLMAG
AI4CE
41
5
0
03 Feb 2024
LoMA: Lossless Compressed Memory Attention
Yumeng Wang
Zhenyang Xiao
14
3
0
16 Jan 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
Zongxin Yang
Guikun Chen
Xiaodi Li
Wenguan Wang
Yi Yang
LM&Ro
LLMAG
48
35
0
16 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
29
25
0
15 Jan 2024
Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization
Ninglu Shao
Shitao Xiao
Zheng Liu
Peitian Zhang
26
4
0
15 Jan 2024
User Embedding Model for Personalized Language Prompting
Sumanth Doddapaneni
Krishna Sayana
Ambarish Jash
Sukhdeep S. Sodhi
Dima Kuzmin
RALM
35
9
0
10 Jan 2024
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning
Xijie Huang
Li Lyna Zhang
Kwang-Ting Cheng
Fan Yang
Mao Yang
LRM
ReLM
24
8
0
14 Dec 2023
DiSK: A Diffusion Model for Structured Knowledge
O. Kitouni
Niklas Nolte
James Hensman
Bhaskar Mitra
DiffM
17
3
0
08 Dec 2023
Compressed Context Memory For Online Language Model Interaction
Jang-Hyun Kim
Junyoung Yeom
Sangdoo Yun
Hyun Oh Song
KELM
34
14
1
06 Dec 2023
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin
Abhilasha Ravichander
Ximing Lu
Nouha Dziri
Melanie Sclar
Khyathi Raghavi Chandu
Chandra Bhagavatula
Yejin Choi
17
164
0
04 Dec 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
28
54
0
21 Nov 2023
GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks
Shivanshu Gupta
Clemens Rosenbaum
Ethan R. Elenberg
LRM
32
6
0
16 Nov 2023
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Fei Wu
Jiwei Li
Tianwei Zhang
Guoyin Wang
27
16
0
03 Nov 2023
Large Language Models are Visual Reasoning Coordinators
Liangyu Chen
Bo Li
Sheng Shen
Jingkang Yang
Chunyuan Li
Kurt Keutzer
Trevor Darrell
Ziwei Liu
VLM
LRM
34
47
0
23 Oct 2023
Function Vectors in Large Language Models
Eric Todd
Millicent Li
Arnab Sen Sharma
Aaron Mueller
Byron C. Wallace
David Bau
8
99
0
23 Oct 2023
Context Compression for Auto-regressive Transformers with Sentinel Tokens
Siyu Ren
Qi Jia
Kenny Q. Zhu
11
11
0
12 Oct 2023
CacheGen: KV Cache Compression and Streaming for Fast Language Model Serving
Yuhan Liu
Hanchen Li
Yihua Cheng
Siddhant Ray
Yuyang Huang
...
Ganesh Ananthanarayanan
Michael Maire
Henry Hoffmann
Ari Holtzman
Junchen Jiang
50
41
0
11 Oct 2023
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
Huiqiang Jiang
Qianhui Wu
Xufang Luo
Dongsheng Li
Chin-Yew Lin
Yuqing Yang
Lili Qiu
RALM
101
182
0
10 Oct 2023
Compressing Context to Enhance Inference Efficiency of Large Language Models
Yucheng Li
Bo Dong
Chenghua Lin
Frank Guerin
17
54
0
09 Oct 2023
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang
Qianhui Wu
Chin-Yew Lin
Yuqing Yang
Lili Qiu
24
100
0
09 Oct 2023
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Page-Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
31
2
0
09 Oct 2023
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation
Fangyuan Xu
Weijia Shi
Eunsol Choi
RALM
22
143
0
06 Oct 2023
EcoAssistant: Using LLM Assistant More Affordably and Accurately
Jieyu Zhang
Ranjay Krishna
Ahmed Hassan Awadallah
Chi Wang
30
33
0
03 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
19
7
0
03 Oct 2023
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Suyu Ge
Yunan Zhang
Liyuan Liu
Minjia Zhang
Jiawei Han
Jianfeng Gao
4
215
0
03 Oct 2023
Discrete Prompt Compression with Reinforcement Learning
Hoyoun Jung
Kyung-Joong Kim
19
24
0
17 Aug 2023
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
18
8
0
21 Jul 2023
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models
Zhiyuan Peng
Xuyang Wu
Qifan Wang
Yihan Fang
VLM
RALM
38
11
0
17 Jul 2023
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Wei Wei
Tingbo Hou
Yael Pritch
Neal Wadhwa
Michael Rubinstein
Kfir Aberman
DiffM
25
172
0
13 Jul 2023
In-context Autoencoder for Context Compression in a Large Language Model
Tao Ge
Jing Hu
Lei Wang
Xun Wang
Si-Qing Chen
Furu Wei
RALM
32
66
0
13 Jul 2023
Previous
1
2
3
4
Next