Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.07882
Cited By
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
16 June 2022
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Swagath Venkataramani
G. Saon
Xiaodong Cui
Brian Kingsbury
K. Gopalakrishnan
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization"
2 / 2 papers shown
Title
Alignment-Free Training for Transducer-based Multi-Talker ASR
Takafumi Moriya
Shota Horiguchi
Marc Delcroix
Ryo Masumura
Takanori Ashihara
Hiroshi Sato
Kohei Matsuura
Masato Mimura
31
1
0
30 Sep 2024
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization
Edward Fish
Umberto Michieli
Mete Ozay
MQ
22
4
0
24 Jul 2023
1