168

ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity

Main:11 Pages
22 Figures
Bibliography:2 Pages
Abstract

Transformers, composed of QKV generation, attention computation, and FFNs,

View on arXiv
Comments on this paper