Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.11062
Cited By
Answer Fast: Accelerating BERT on the Tensor Streaming Processor
22 June 2022
I. Ahmed
Sahil Parmar
Matthew Boyd
Michael Beidler
Kris Kang
Bill Liu
Kyle Roach
John Kim
D. Abts
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Answer Fast: Accelerating BERT on the Tensor Streaming Processor"
4 / 4 papers shown
Title
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
46
1
0
28 Jan 2025
PyGen: A Collaborative Human-AI Approach to Python Package Creation
Saikat Barua
Mostafizur Rahman
Md Jafor Sadek
Rafiul Islam
Shehnaz Khaled
Md. Shohrab Hossain
44
1
0
13 Nov 2024
DGEMM on Integer Matrix Multiplication Unit
Hiroyuki Ootomo
K. Ozaki
Rio Yokota
9
12
0
21 Jun 2023
Accelerating Transformer Inference for Translation via Parallel Decoding
Andrea Santilli
Silvio Severino
Emilian Postolache
Valentino Maiorca
Michele Mancusi
R. Marin
Emanuele Rodolà
31
78
0
17 May 2023
1