Memory- and Communication-Aware Model Compression for Distributed Deep
Learning Inference on IoTACM Transactions on Embedded Computing Systems (ACM TECS), 2019 |
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network
Inference On MicrocontrollersConference on Machine Learning and Systems (MLSys), 2019 |
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT
ApplicationsUSENIX workshop on Tackling computer systems problems with machine learning techniques (SysML), 2019 |
Efficient keyword spotting using dilated convolutions and gatingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018 |
Stochastic Adaptive Neural Architecture Search for Keyword SpottingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018 |