Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1709.05522
Cited By
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
16 September 2017
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline"
50 / 451 papers shown
Multi-mode Transformer Transducer with Stochastic Future Context
Interspeech (Interspeech), 2021
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
91
10
0
17 Jun 2021
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition
Xiong Wang
Sining Sun
Lei Xie
Long Ma
113
21
0
17 Jun 2021
Layer Pruning on Demand with Intermediate CTC
Jaesong Lee
Jingu Kang
Shinji Watanabe
129
21
0
17 Jun 2021
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Di Wu
Binbin Zhang
Chao Yang
Zhendong Peng
Wenjing Xia
Xiaoyu Chen
X. Lei
218
55
0
10 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
293
901
0
08 Jun 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
399
12
0
05 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Interspeech (Interspeech), 2021
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
147
13
0
02 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
SSL
257
37
0
01 Jun 2021
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Neural Information Processing Systems (NeurIPS), 2021
Yichong Leng
Xu Tan
Linchen Zhu
Jin Xu
Renqian Luo
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
KELM
251
76
0
09 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Liqiang He
Shulin Feng
Jane Polak Scowcroft
Dong Yu
233
0
0
08 May 2021
Efficient conformer-based speech recognition with linear attention
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
194
26
0
14 Apr 2021
Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Yukun Liu
Ta Li
Pengyuan Zhang
Yonghong Yan
AI4TS
117
7
0
12 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
139
13
0
11 Apr 2021
Non-autoregressive Transformer-based End-to-end ASR using BERT
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Fu-Hao Yu
Kuan-Yu Chen
141
32
0
10 Apr 2021
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR
Automatic Speech Recognition & Understanding (ASRU), 2021
Fan Yu
Haoneng Luo
Pengcheng Guo
Yuhao Liang
Zhuoyuan Yao
Lei Xie
Yingying Gao
Leijing Hou
Shilei Zhang
105
14
0
10 Apr 2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Interspeech (Interspeech), 2021
Yihui Fu
Luyao Cheng
Shubo Lv
Yukai Jv
Yuxiang Kong
...
Jian Wu
Hui Bu
Xin Xu
Jun Du
Jingdong Chen
320
134
0
08 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
142
18
0
08 Apr 2021
Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture Search For End-to-End ASR
Xian Shi
Pan Zhou
Wei Chen
Lei Xie
149
19
0
07 Apr 2021
Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
Interspeech (Interspeech), 2021
Jumon Nozaki
Tatsuya Komatsu
254
87
0
06 Apr 2021
Extremely Low Footprint End-to-End ASR System for Smart Device
Interspeech (Interspeech), 2021
Zhifu Gao
Yiwu Yao
Shiliang Zhang
Jun Yang
Ming Lei
Ian Mcloughlin
104
14
0
06 Apr 2021
Non-autoregressive Mandarin-English Code-switching Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Shun-Po Chuang
Heng-Jui Chang
Sung-Feng Huang
Hung-yi Lee
230
16
0
06 Apr 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
149
12
0
02 Apr 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Interspeech (Interspeech), 2021
Helin Wang
Bo Wu
Lianwu Chen
Meng Yu
Jianwei Yu
Yong-mei Xu
Shi-Xiong Zhang
Chao Weng
Jane Polak Scowcroft
Dong Yu
147
9
0
31 Mar 2021
MediaSpeech: Multilanguage ASR Benchmark and Dataset
Rostislav Kolobov
Olga Okhapkina
Olga Omelchishina
A. Platunov
Roman Bedyakin
V. Moshkin
Dmitry Menshikov
N. Mikhaylovskiy
122
28
0
30 Mar 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention
Interspeech (Interspeech), 2021
Chen Liang
Menglong Xu
Xiao-Lei Zhang
191
9
0
29 Mar 2021
Mutually-Constrained Monotonic Multihead Attention for Online ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jae-gyun Song
Hajin Shim
Eunho Yang
80
0
0
26 Mar 2021
BART based semantic correction for Mandarin automatic speech recognition system
Interspeech (Interspeech), 2021
Yun Zhao
Xuerui Yang
Jinchao Wang
Yongyu Gao
Chao Yan
Yuanfu Zhou
VLM
145
35
0
26 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
143
32
0
19 Mar 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems
Applied Soft Computing (Appl Soft Comput), 2021
Yi Lin
Bo Yang
Linchao Li
Dongyue Guo
Jianwei Zhang
Hu Chen
Yi Zhang
159
32
0
17 Feb 2021
Improving speech recognition models with small samples for air traffic control systems
Neurocomputing (Neurocomputing), 2021
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
182
33
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Ye Bai
Jiangyan Yi
Jianhua Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
208
59
0
15 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jaesong Lee
Shinji Watanabe
251
157
0
05 Feb 2021
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shengkui Zhao
Hao Wang
Trung Hieu Nguyen
B. Ma
128
21
0
03 Feb 2021
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
Interspeech (Interspeech), 2021
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
355
307
0
02 Feb 2021
Speech Recognition by Simply Fine-tuning BERT
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wen-Chin Huang
Chia-Hua Wu
Shang-Bao Luo
Kuan-Yu Chen
Hsin-Min Wang
Tomoki Toda
255
32
0
30 Jan 2021
A phonetic model of non-native spoken word processing
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Yevgen Matusevych
Herman Kamper
Thomas Schatz
Naomi H Feldman
Sharon Goldwater
273
8
0
27 Jan 2021
Interspeech 2021 Deep Noise Suppression Challenge
Interspeech (Interspeech), 2021
Chandan K. A. Reddy
Harishchandra Dubey
K. Koishida
A. Nair
Vishak Gopal
Ross Cutler
Sebastian Braun
H. Gamper
R. Aichner
Sriram Srinivasan
AI4CE
415
189
0
06 Jan 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
138
8
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
Speech Communication (Speech Commun.), 2020
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
Tianshi Zheng
Dong Wang
209
142
0
23 Dec 2020
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Binbin Zhang
Di Wu
Zhuoyuan Yao
Xiong Wang
F. Yu
Chao Yang
Liyong Guo
Yaguang Hu
Lei Xie
X. Lei
246
86
0
10 Dec 2020
Transformer-Transducers for Code-Switched Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Siddharth Dalmia
Yuzong Liu
S. Ronanki
Katrin Kirchhoff
222
50
0
30 Nov 2020
Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Spoken Language Technology Workshop (SLT), 2020
Xiong Wang
Zhuoyuan Yao
Xian Shi
Lei Xie
131
34
0
17 Nov 2020
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition
Cunhang Fan
Jiangyan Yi
Jianhua Tao
Zhengkun Tian
Bin Liu
Zhengqi Wen
122
87
0
09 Nov 2020
Stochastic Attention Head Removal: A simple and effective method for improving Transformer Based ASR Models
Shucong Zhang
Erfan Loweimi
P. Bell
Steve Renals
215
0
0
08 Nov 2020
IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
Yihui Fu
Zhuoyuan Yao
Weipeng He
Jian Wu
Xiong Wang
...
Lei Xie
Dongyan Huang
Hui Bu
P. Motlícek
J. Odobez
143
3
0
04 Nov 2020
Improving RNN transducer with normalized jointer network
Mingkun Huang
Jun Zhang
Meng Cai
Yang Zhang
Jiali Yao
Yongbin You
Yi He
Zejun Ma
203
9
0
03 Nov 2020
Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Yan Jia
Zexin Cai
Murong Ma
Zeqing Zhao
Xuyang Wang
Junjie Wang
Ming Li
95
3
0
03 Nov 2020
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Xingcheng Song
Zhiyong Wu
Yiheng Huang
Chao Weng
Jane Polak Scowcroft
Helen Meng
168
40
0
28 Oct 2020
INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Yiwu Yao
Yuchao Li
Chengyu Wang
Tianhang Yu
Houjiang Chen
...
Jun Yang
Yanjie Liang
Jialin Li
Hui Shu
Chengfei Lv
MQ
159
8
0
28 Oct 2020
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
146
42
0
28 Oct 2020
Previous
1
2
3
...
10
7
8
9
Next