
v1v2 (latest)
Energon: Towards Efficient Acceleration of Transformers Using Dynamic
Sparse Attention
Papers citing "Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention"
10 / 10 papers shown
Title |
---|
![]() PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation
Invariant Transformation Ningxin Zheng Huiqiang Jiang Quan Zhang Zhenhua Han Yuqing Yang ...Fan Yang Chengruidong Zhang Lili Qiu Mao Yang Lidong Zhou |