Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.14928
Cited By
Class-based Quantization for Neural Networks
27 November 2022
Wenhao Sun
Grace Li Zhang
Huaxi Gu
Bing Li
Ulf Schlichtmann
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Class-based Quantization for Neural Networks"
5 / 5 papers shown
Title
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
Jingcun Wang
Yu-Guang Chen
Ing-Chao Lin
Bing Li
Grace Li Zhang
27
4
0
02 Oct 2024
LiveMind: Low-latency Large Language Models with Simultaneous Inference
Chuangtao Chen
Grace Li Zhang
Xunzhao Yin
Cheng Zhuo
Ulf Schlichtmann
Bing Li
LRM
21
3
0
20 Jun 2024
Early-Exit with Class Exclusion for Efficient Inference of Neural Networks
Jing Wang
Bing Li
Grace Li Zhang
9
4
0
23 Sep 2023
Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Kangwei Xu
Grace Li Zhang
Ulf Schlichtmann
Bing Li
17
2
0
19 Sep 2023
Computational and Storage Efficient Quadratic Neurons for Deep Neural Networks
Chuangtao Chen
Grace Li Zhang
Xunzhao Yin
Cheng Zhuo
Ulf Schlichtmann
Bing Li
6
0
0
10 Jun 2023
1