Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the EdgeAAAI Conference on Artificial Intelligence (AAAI), 2023 |
Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of
Partitioned Edge LearningIEEE Transactions on Wireless Communications (TWC), 2020 |