Structural Pruning via Latency-Saliency KnapsackNeural Information Processing Systems (NeurIPS), 2022 |
A Fast Post-Training Pruning Framework for TransformersNeural Information Processing Systems (NeurIPS), 2022 |
Dynamic ConvNets on Tiny Devices via Nested SparsityIEEE Internet of Things Journal (IEEE IoT J.), 2022 |
Coarsening the Granularity: Towards Structurally Sparse Lottery TicketsInternational Conference on Machine Learning (ICML), 2022 |
Neural Architecture Search as Program Transformation ExplorationInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021 |
Accelerating convolutional neural network by exploiting sparsity on GPUsACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2019 |