All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() MesonGS: Post-training Compression of 3D Gaussians via Efficient
Attribute TransformationEuropean Conference on Computer Vision (ECCV), 2024 |
![]() Cherry on Top: Parameter Heterogeneity and Quantization in Large
Language ModelsNeural Information Processing Systems (NeurIPS), 2024 |
![]() Optimize Weight Rounding via Signed Gradient Descent for the
Quantization of LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() A Survey on Model Compression for Large Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023 |
![]() AWQ: Activation-aware Weight Quantization for LLM Compression and
AccelerationConference on Machine Learning and Systems (MLSys), 2023 |