Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.11057
Cited By
KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models
17 September 2024
Bo Lv
Quan Zhou
Xuanang Ding
Yan Wang
Zeming Ma
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models"
3 / 3 papers shown
Title
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
Xingrun Xing
Zheng Liu
Shitao Xiao
Boyan Gao
Yiming Liang
Wanpeng Zhang
Haokun Lin
Guoqi Li
Jiajun Zhang
LRM
64
1
0
10 Feb 2025
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
Bo-Kyeong Kim
Geonmin Kim
Tae-Ho Kim
Thibault Castells
Shinkook Choi
Junho Shin
Hyoung-Kyu Song
62
30
0
05 Feb 2024
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
1