
![]() Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model
CompressionInternational Conference on Learning Representations (ICLR), 2024 |
![]() Early-Exit with Class Exclusion for Efficient Inference of Neural
NetworksInternational Conference on Artificial Intelligence Circuits and Systems (ICAICS), 2023 |
![]() Logic Design of Neural Networks for High-Throughput and Low-Power
ApplicationsAsia and South Pacific Design Automation Conference (ASP-DAC), 2023 |
![]() Computational and Storage Efficient Quadratic Neurons for Deep Neural
NetworksDesign, Automation and Test in Europe (DATE), 2023 |