Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.14380
Cited By
Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection
27 March 2022
Xin Huang
A. Khetan
Rene Bidart
Zohar S. Karnin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pyramid-BERT: Reducing Complexity via Successive Core-set based Token Selection"
12 / 12 papers shown
Title
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
Qizhe Zhang
Aosong Cheng
Ming Lu
Zhiyong Zhuo
Minqi Wang
Jiajun Cao
Shaobo Guo
Qi She
Shanghang Zhang
VLM
88
11
0
02 Dec 2024
Self-Satisfied: An end-to-end framework for SAT generation and prediction
Christopher R. Serrano
Jonathan Gallagher
Kenji Yamada
Alexei Kopylov
Michael A. Warren
21
0
0
18 Oct 2024
Rethinking Token Reduction for State Space Models
Zheng Zhan
Yushu Wu
Zhenglun Kong
Changdi Yang
Yifan Gong
Xuan Shen
Xue Lin
Pu Zhao
Yanzhi Wang
Mamba
25
4
0
16 Oct 2024
Finch: Prompt-guided Key-Value Cache Compression
Giulio Corallo
Paolo Papotti
33
3
0
31 Jul 2024
LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Jieneng Chen
Luoxin Ye
Ju He
Zhao-Yang Wang
Daniel Khashabi
Alan Yuille
VLM
19
5
0
28 Jun 2024
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Suyu Ge
Yunan Zhang
Liyuan Liu
Minjia Zhang
Jiawei Han
Jianfeng Gao
4
213
0
03 Oct 2023
DiffusionNER: Boundary Diffusion for Named Entity Recognition
Yongliang Shen
Kaitao Song
Xuejiao Tan
Dongsheng Li
Weiming Lu
Y. Zhuang
DiffM
45
33
0
22 May 2023
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Zhanpeng Zeng
Cole Hawkins
Min-Fong Hong
Aston Zhang
Nikolaos Pappas
Vikas Singh
Shuai Zheng
11
6
0
07 May 2023
Centroid Transformers: Learning to Abstract with Attention
Lemeng Wu
Xingchao Liu
Qiang Liu
3DPC
50
27
0
17 Feb 2021
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
1,982
0
28 Jul 2020
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
214
571
0
12 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1