Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.06909
Cited By
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
13 June 2021
Guoguo Chen
Shuzhou Chai
Guan-Bo Wang
Jiayu Du
Weiqiang Zhang
Chao Weng
Dan Su
Daniel Povey
J. Trmal
Junbo Zhang
Mingjie Jin
Sanjeev Khudanpur
Shinji Watanabe
Shuaijiang Zhao
Wei Zou
Xiangang Li
Xuchen Yao
Yongqing Wang
Yujun Wang
Zhao You
Zhiyong Yan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio"
7 / 257 papers shown
Title
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
17
217
0
07 Oct 2021
Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development
Mingkuan Liu
Chi Zhang
Hua Xing
C. Feng
Mon-Chu Chen
Judith Bishop
Grace Ngapo
19
3
0
01 Sep 2021
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
13
21
0
30 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
14
18
0
02 Jul 2021
Semantic-WER: A Unified Metric for the Evaluation of ASR Transcript for End Usability
Somnath Roy
11
8
0
03 Jun 2021
SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Patrick K. O’Neill
Vitaly Lavrukhin
Somshubra Majumdar
Vahid Noroozi
Yuekai Zhang
...
Keenan Freyberg
Michael D. Shulman
Boris Ginsburg
Shinji Watanabe
Georg Kucsko
AI4TS
18
59
0
05 Apr 2021
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Chao Weng
Chengzhu Yu
Jia Cui
Chunlei Zhang
Dong Yu
69
39
0
28 Nov 2019
Previous
1
2
3
4
5
6