Accelerating Attention through Gradient-Based Learned Runtime PruningInternational Symposium on Computer Architecture (ISCA), 2022 |
MAFIA: Machine Learning Acceleration on FPGAs for IoT ApplicationsInternational Conference on Field-Programmable Logic and Applications (FPL), 2021 |