Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.09632
Cited By
MoDeGPT: Modular Decomposition for Large Language Model Compression
19 August 2024
Chi-Heng Lin
Shangqian Gao
James Seale Smith
Abhishek Patel
Shikhar Tuli
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MoDeGPT: Modular Decomposition for Large Language Model Compression"
4 / 4 papers shown
Title
Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation
Chuan-Wei Kuo
Siyu Chen
Chenqi Yan
Yu Liu
53
0
0
28 Mar 2025
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
Xin Wang
Samiul Alam
Zhongwei Wan
H. Shen
M. Zhang
MQ
50
0
0
16 Mar 2025
IteRABRe: Iterative Recovery-Aided Block Reduction
Haryo Akbarianto Wibowo
Haiyue Song
Hideki Tanaka
Masao Utiyama
Alham Fikri Aji
Raj Dabre
41
0
0
08 Mar 2025
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
Mohsen Gholami
Mohammad Akbari
Kevin Cannons
Yong Zhang
61
0
0
07 Mar 2025
1