
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Haihao Shen
Hengyu Meng
Bo Dong
Zhe Wang
Ofir Zafrir
Yi Ding
Yunqian Luo
Hanwen Chang
Qun Gao
Zi. Wang
Guy Boudoukh
Moshe Wasserblat
Papers citing "An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs"
1 / 1 papers shown