ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.05522
  4. Cited By
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech
  Recognition Baseline

AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline

16 September 2017
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
    CVBM
ArXiv (abs)PDFHTML

Papers citing "AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline"

50 / 451 papers shown
Multi-mode Transformer Transducer with Stochastic Future Context
Multi-mode Transformer Transducer with Stochastic Future ContextInterspeech (Interspeech), 2021
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
91
10
0
17 Jun 2021
Efficient Conformer with Prob-Sparse Attention Mechanism for
  End-to-EndSpeech Recognition
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition
Xiong Wang
Sining Sun
Lei Xie
Long Ma
113
21
0
17 Jun 2021
Layer Pruning on Demand with Intermediate CTC
Layer Pruning on Demand with Intermediate CTC
Jaesong Lee
Jingu Kang
Shinji Watanabe
129
21
0
17 Jun 2021
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech
  Recognition
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Di Wu
Binbin Zhang
Chao Yang
Zhendong Peng
Wenjing Xia
Xiaoyu Chen
X. Lei
218
55
0
10 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
293
901
0
08 Jun 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for
  Signal Recognition
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
399
12
0
05 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
  Identity in Dysarthric Voice Conversion
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionInterspeech (Interspeech), 2021
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
147
13
0
02 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by
  Self-Supervised Learning
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAMLSSL
257
37
0
01 Jun 2021
FastCorrect: Fast Error Correction with Edit Alignment for Automatic
  Speech Recognition
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021
Yichong Leng
Xu Tan
Linchen Zhu
Jin Xu
Renqian Luo
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
KELM
251
76
0
09 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech
  Recognition
Latency-Controlled Neural Architecture Search for Streaming Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Liqiang He
Shulin Feng
Jane Polak Scowcroft
Dong Yu
233
0
0
08 May 2021
Efficient conformer-based speech recognition with linear attention
Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
194
26
0
14 Apr 2021
Improved Conformer-based End-to-End Speech Recognition Using Neural
  Architecture Search
Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search
Yukun Liu
Ta Li
Pengyuan Zhang
Yonghong Yan
AI4TS
117
7
0
12 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
139
13
0
11 Apr 2021
Non-autoregressive Transformer-based End-to-end ASR using BERT
Non-autoregressive Transformer-based End-to-end ASR using BERTIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Fu-Hao Yu
Kuan-Yu Chen
141
32
0
10 Apr 2021
Boundary and Context Aware Training for CIF-based Non-Autoregressive
  End-to-end ASR
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
Fan Yu
Haoneng Luo
Pengcheng Guo
Yuhao Liang
Zhuoyuan Yao
Lei Xie
Yingying Gao
Leijing Hou
Shilei Zhang
105
14
0
10 Apr 2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation,
  Recognition and Speaker Diarization in Conference Scenario
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference ScenarioInterspeech (Interspeech), 2021
Yihui Fu
Luyao Cheng
Shubo Lv
Yukai Jv
Yuxiang Kong
...
Jian Wu
Hui Bu
Xin Xu
Jun Du
Jingdong Chen
320
134
0
08 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech
  Recognition
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
142
18
0
08 Apr 2021
Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture
  Search For End-to-End ASR
Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture Search For End-to-End ASR
Xian Shi
Pan Zhou
Wei Chen
Lei Xie
149
19
0
07 Apr 2021
Relaxing the Conditional Independence Assumption of CTC-based ASR by
  Conditioning on Intermediate Predictions
Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate PredictionsInterspeech (Interspeech), 2021
Jumon Nozaki
Tatsuya Komatsu
254
87
0
06 Apr 2021
Extremely Low Footprint End-to-End ASR System for Smart Device
Extremely Low Footprint End-to-End ASR System for Smart DeviceInterspeech (Interspeech), 2021
Zhifu Gao
Yiwu Yao
Shiliang Zhang
Jun Yang
Ming Lei
Ian Mcloughlin
104
14
0
06 Apr 2021
Non-autoregressive Mandarin-English Code-switching Speech Recognition
Non-autoregressive Mandarin-English Code-switching Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Shun-Po Chuang
Heng-Jui Chang
Sung-Feng Huang
Hung-yi Lee
230
16
0
06 Apr 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field
  Multi-Channel Speech Enhancement for Video Conferencing
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
149
12
0
02 Apr 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware
  Speech Dereverberation
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech DereverberationInterspeech (Interspeech), 2021
Helin Wang
Bo Wu
Lianwu Chen
Meng Yu
Jianwei Yu
Yong-mei Xu
Shi-Xiong Zhang
Chao Weng
Jane Polak Scowcroft
Dong Yu
147
9
0
31 Mar 2021
MediaSpeech: Multilanguage ASR Benchmark and Dataset
MediaSpeech: Multilanguage ASR Benchmark and Dataset
Rostislav Kolobov
Olga Okhapkina
Olga Omelchishina
A. Platunov
Roman Bedyakin
V. Moshkin
Dmitry Menshikov
N. Mikhaylovskiy
122
28
0
30 Mar 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attentionInterspeech (Interspeech), 2021
Chen Liang
Menglong Xu
Xiao-Lei Zhang
191
9
0
29 Mar 2021
Mutually-Constrained Monotonic Multihead Attention for Online ASR
Mutually-Constrained Monotonic Multihead Attention for Online ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jae-gyun Song
Hajin Shim
Eunho Yang
80
0
0
26 Mar 2021
BART based semantic correction for Mandarin automatic speech recognition
  system
BART based semantic correction for Mandarin automatic speech recognition systemInterspeech (Interspeech), 2021
Yun Zhao
Xuerui Yang
Jinchao Wang
Yongyu Gao
Chao Yan
Yuanfu Zhou
VLM
145
35
0
26 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
143
32
0
19 Mar 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for
  air traffic control systems
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systemsApplied Soft Computing (Appl Soft Comput), 2021
Yi Lin
Bo Yang
Linchao Li
Dongyue Guo
Jianwei Zhang
Hu Chen
Yi Zhang
159
32
0
17 Feb 2021
Improving speech recognition models with small samples for air traffic
  control systems
Improving speech recognition models with small samples for air traffic control systemsNeurocomputing (Neurocomputing), 2021
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
182
33
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and
  Cross-Modal Knowledge Transferring from BERT
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERTIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Ye Bai
Jiangyan Yi
Jianhua Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
208
59
0
15 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
Intermediate Loss Regularization for CTC-based Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jaesong Lee
Shinji Watanabe
251
157
0
05 Feb 2021
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on
  Neural TTS Model and Phonetic Posteriorgram
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic PosteriorgramIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shengkui Zhao
Hao Wang
Trung Hieu Nguyen
B. Ma
128
21
0
03 Feb 2021
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech
  Recognition Toolkit
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition ToolkitInterspeech (Interspeech), 2021
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
355
307
0
02 Feb 2021
Speech Recognition by Simply Fine-tuning BERT
Speech Recognition by Simply Fine-tuning BERTIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wen-Chin Huang
Chia-Hua Wu
Shang-Bao Luo
Kuan-Yu Chen
Hsin-Min Wang
Tomoki Toda
255
32
0
30 Jan 2021
A phonetic model of non-native spoken word processing
A phonetic model of non-native spoken word processingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Yevgen Matusevych
Herman Kamper
Thomas Schatz
Naomi H Feldman
Sharon Goldwater
273
8
0
27 Jan 2021
Interspeech 2021 Deep Noise Suppression Challenge
Interspeech 2021 Deep Noise Suppression ChallengeInterspeech (Interspeech), 2021
Chandan K. A. Reddy
Harishchandra Dubey
K. Koishida
A. Nair
Vishak Gopal
Ross Cutler
Sebastian Braun
H. Gamper
R. Aichner
Sriram Srinivasan
AI4CE
415
189
0
06 Jan 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
A Principle Solution for Enroll-Test Mismatch in Speaker RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
138
8
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
CN-Celeb: multi-genre speaker recognitionSpeech Communication (Speech Commun.), 2020
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
Tianshi Zheng
Dong Wang
209
142
0
23 Dec 2020
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech
  Recognition
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Binbin Zhang
Di Wu
Zhuoyuan Yao
Xiong Wang
F. Yu
Chao Yang
Liyong Guo
Yaguang Hu
Lei Xie
X. Lei
246
86
0
10 Dec 2020
Transformer-Transducers for Code-Switched Speech Recognition
Transformer-Transducers for Code-Switched Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Siddharth Dalmia
Yuzong Liu
S. Ronanki
Katrin Kirchhoff
222
50
0
30 Nov 2020
Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin
  Speech Recognition with a Syllable-to-Character Converter
Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character ConverterSpoken Language Technology Workshop (SLT), 2020
Xiong Wang
Zhuoyuan Yao
Xian Shi
Lei Xie
131
34
0
17 Nov 2020
Gated Recurrent Fusion with Joint Training Framework for Robust
  End-to-End Speech Recognition
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition
Cunhang Fan
Jiangyan Yi
Jianhua Tao
Zhengkun Tian
Bin Liu
Zhengqi Wen
122
87
0
09 Nov 2020
Stochastic Attention Head Removal: A simple and effective method for
  improving Transformer Based ASR Models
Stochastic Attention Head Removal: A simple and effective method for improving Transformer Based ASR Models
Shucong Zhang
Erfan Loweimi
P. Bell
Steve Renals
215
0
0
08 Nov 2020
IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules
  and Baselines
IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
Yihui Fu
Zhuoyuan Yao
Weipeng He
Jian Wu
Xiong Wang
...
Lei Xie
Dongyan Huang
Hui Bu
P. Motlícek
J. Odobez
143
3
0
04 Nov 2020
Improving RNN transducer with normalized jointer network
Improving RNN transducer with normalized jointer network
Mingkun Huang
Jun Zhang
Meng Cai
Yang Zhang
Jiali Yao
Yongbin You
Yi He
Zejun Ma
203
9
0
03 Nov 2020
Training Wake Word Detection with Synthesized Speech Data on Confusion
  Words
Training Wake Word Detection with Synthesized Speech Data on Confusion Words
Yan Jia
Zexin Cai
Murong Ma
Zeqing Zhao
Xuyang Wang
Junjie Wang
Ming Li
95
3
0
03 Nov 2020
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder InputIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Xingcheng Song
Zhiyong Wu
Yiheng Huang
Chao Weng
Jane Polak Scowcroft
Helen Meng
168
40
0
28 Oct 2020
INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on
  Mobile Devices
INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices
Yiwu Yao
Yuchao Li
Chengyu Wang
Tianhang Yu
Houjiang Chen
...
Jun Yang
Yanjie Liang
Jialin Li
Hui Shu
Chengfei Lv
MQ
159
8
0
28 Oct 2020
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer
  for Speech Recognition
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
146
42
0
28 Oct 2020
Previous
123...10789
Next