ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.05522
  4. Cited By
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech
  Recognition Baseline

AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline

16 September 2017
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
    CVBM
ArXiv (abs)PDFHTML

Papers citing "AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline"

50 / 451 papers shown
Integrating Lattice-Free MMI into End-to-End Speech Recognition
Integrating Lattice-Free MMI into End-to-End Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
283
10
0
29 Mar 2022
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
WeNet 2.0: More Productive End-to-End Speech Recognition ToolkitInterspeech (Interspeech), 2022
Binbin Zhang
Di Wu
Zhendong Peng
Xingcheng Song
Zhuoyuan Yao
Hang Lv
Linfu Xie
Chao Yang
Fuping Pan
Jianwei Niu
VLM
274
127
0
29 Mar 2022
Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASRInternational Conference on Neural Information Processing (ICONIP), 2022
Fangyuan Wang
Bo Xu
158
5
0
29 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under
  Unseen Conditions
Analyzing Language-Independent Speaker Anonymization Framework under Unseen ConditionsInterspeech (Interspeech), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
131
15
0
28 Mar 2022
Disentangleing Content and Fine-grained Prosody Information via Hybrid
  ASR Bottleneck Features for Voice Conversion
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Xintao Zhao
Feng Liu
Changhe Song
Zhiyong Wu
Shiyin Kang
Deyi Tuo
Helen Meng
165
29
0
24 Mar 2022
Variational Auto-Encoder based Mandarin Speech Cloning
Variational Auto-Encoder based Mandarin Speech Cloning
Qingyu Xing
Xiaohan Ma
177
0
0
06 Mar 2022
Language-Independent Speaker Anonymization Approach using
  Self-Supervised Pre-Trained Models
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained ModelsThe Speaker and Language Recognition Workshop (Odyssey), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
354
36
0
26 Feb 2022
Improving CTC-based speech recognition via knowledge transferring from
  pre-trained language models
Improving CTC-based speech recognition via knowledge transferring from pre-trained language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
Gaofeng Cheng
Ji Xu
Pengyuan Zhang
132
32
0
22 Feb 2022
AISHELL-NER: Named Entity Recognition from Chinese Speech
AISHELL-NER: Named Entity Recognition from Chinese SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Boli Chen
Guangwei Xu
Xiaobin Wang
Pengjun Xie
Meishan Zhang
Fei Huang
126
39
0
17 Feb 2022
ADD 2022: the First Audio Deep Synthesis Detection Challenge
ADD 2022: the First Audio Deep Synthesis Detection ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiangyan Yi
Ruibo Fu
Jianhua Tao
Shuai Nie
Haoxin Ma
...
Le Xu
Zhengqi Wen
Haizhou Li
Zheng Lian
Bin Liu
249
235
0
17 Feb 2022
Run-and-back stitch search: novel block synchronous decoding for
  streaming encoder-decoder ASR
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
140
3
0
25 Jan 2022
Improving non-autoregressive end-to-end speech recognition with
  pre-trained acoustic and language models
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
194
29
0
25 Jan 2022
A Study of Transducer based End-to-End ASR with ESPnet: Architecture,
  Auxiliary Loss and Decoding Strategies
A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding StrategiesAutomatic Speech Recognition & Understanding (ASRU), 2021
Florian Boyer
Yusuke Shinohara
Takaaki Ishii
Hirofumi Inaguma
Shinji Watanabe
260
40
0
14 Jan 2022
Cross-Modal ASR Post-Processing System for Error Correction and
  Utterance Rejection
Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Jing Du
Shiliang Pu
Qinbo Dong
Chao Jin
Xin Qi
Dian Gu
Ru Wu
Hongwei Zhou
242
11
0
10 Jan 2022
Automatic Speech Recognition Datasets in Cantonese: A Survey and New
  Dataset
Automatic Speech Recognition Datasets in Cantonese: A Survey and New DatasetInternational Conference on Language Resources and Evaluation (LREC), 2022
Tiezheng Yu
Rita Frieske
Peng Xu
Samuel Cahyawijaya
Cheuk Tung Shadow Yiu
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
RALM
221
18
0
07 Jan 2022
Improving Mandarin End-to-End Speech Recognition with Word N-gram
  Language Model
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelIEEE Signal Processing Letters (SPL), 2022
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
179
14
0
06 Jan 2022
Generating Adversarial Samples For Training Wake-up Word Detection
  Systems Against Confusing Words
Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Haoxu Wang
Yan Jia
Zeqing Zhao
Xuyang Wang
Junjie Wang
Ming Li
AAML
191
2
0
01 Jan 2022
Integrating Knowledge in End-to-End Automatic Speech Recognition for
  Mandarin-English Code-Switching
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-SwitchingInternational Conference on Asian Language Processing (IALP), 2019
Chia-Yu Li
Ngoc Thang Vu
150
12
0
19 Dec 2021
Improving Hybrid CTC/Attention End-to-end Speech Recognition with
  Pretrained Acoustic and Language Model
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
88
32
0
14 Dec 2021
Improving Code-switching Language Modeling with Artificially Generated
  Texts using Cycle-consistent Adversarial Networks
Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Chia-Yu Li
Ngoc Thang Vu
108
14
0
12 Dec 2021
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in
  Multi-turn Conversation
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Holy Lovenia
Samuel Cahyawijaya
Genta Indra Winata
Peng Xu
Xu Yan
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
397
45
0
12 Dec 2021
Speaker Embedding-aware Neural Diarization for Flexible Number of
  Speakers with Textual Information
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information
Zhihao Du
Shiliang Zhang
Siqi Zheng
Weilong Huang
Ming Lei
BDL
187
2
0
28 Nov 2021
A Study on Decoupled Probabilistic Linear Discriminant Analysis
A Study on Decoupled Probabilistic Linear Discriminant Analysis
Ding Wang
Lantian Li
Hongzhi Yu
Dong Wang
91
0
0
24 Nov 2021
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Multi-Channel Multi-Speaker ASR Using 3D Spatial FeatureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
146
17
0
22 Nov 2021
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing
  Linguistic Information and Noisy Data
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Zhu Li
Yuqing Zhang
Mengxi Nie
Ming Yan
Mengnan He
Ruixiong Zhang
Caixia Gong
137
3
0
15 Nov 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription
  Challenge
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
235
160
0
14 Oct 2021
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
SRU++: Pioneering Fast Recurrence with Attention for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jing Pan
Tao Lei
Kwangyoun Kim
Kyu Jeong Han
Shinji Watanabe
VLM
104
12
0
11 Oct 2021
An Exploration of Self-Supervised Pretrained Representations for
  End-to-End Speech Recognition
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Xuankai Chang
Takashi Maekaku
Pengcheng Guo
Jing Shi
Yen-Ju Lu
...
Tianzi Wang
Shu-Wen Yang
Yu Tsao
Hung-yi Lee
Shinji Watanabe
SSLAI4TS
171
86
0
09 Oct 2021
Data Augmentation with Locally-time Reversed Speech for Automatic Speech
  Recognition
Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Si-Ioi Ng
Tan Lee
135
2
0
09 Oct 2021
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASRInterspeech (Interspeech), 2021
Hanjing Zhu
Li Wang
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
SSLVLM
250
10
0
09 Oct 2021
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
SCaLa: Supervised Contrastive Learning for End-to-End Speech RecognitionInterspeech (Interspeech), 2021
Li Fu
Xiaoxiao Li
Runyu Wang
Lu Fan
Zhengchen Zhang
Meng Chen
Youzheng Wu
Xiaodong He
SSL
153
3
0
08 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
  Recognition
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
407
286
0
07 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation
  of Hidden-unit BERT
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
610
202
0
05 Oct 2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for
  Automatic Speech Recognition
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng
Xu Tan
Rui Wang
Linchen Zhu
Jin Xu
...
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
283
45
0
29 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning
  for Low-Resource Speech Recognition
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
202
27
0
19 Sep 2021
Non-autoregressive Transformer with Unified Bidirectional Decoder for
  Automatic Speech Recognition
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Chuan-Fei Zhang
Wenshu Fan
Tianren Zhang
Songlu Chen
Feng Chen
Xu-Cheng Yin
143
11
0
14 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection
  Fusion Module for Noise-robust ASR
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASRIEEE International Conference on Multimedia and Expo (ICME), 2021
Fu-An Chao
J. Hung
Berlin Chen
136
7
0
26 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
192
12
0
24 Aug 2021
Decoupling recognition and transcription in Mandarin ASR
Decoupling recognition and transcription in Mandarin ASR
Jiahong Yuan
Xingyu Cai
Dongji Gao
Renjie Zheng
Liang Huang
Kenneth Church
172
13
0
02 Aug 2021
Automatic recognition of suprasegmentals in speech
Automatic recognition of suprasegmentals in speech
Jiahong Yuan
Neville Ryant
Xingyu Cai
Kenneth Church
M. Liberman
128
13
0
02 Aug 2021
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition
  Experiments
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition ExperimentsInternational Conference on Speech and Computer (SPECOM), 2021
M. Musaev
Saida Mussakhojayeva
Ilyos Khujayorov
Yerbolat Khassanov
M. Ochilov
H. A. Varol
75
23
0
30 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency
  Domain Features and a Pre-trained Acoustic Model
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic ModelAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
231
0
0
23 Jul 2021
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Streaming End-to-End ASR based on Blockwise Non-Autoregressive ModelsInterspeech (Interspeech), 2021
Tianzi Wang
Yuya Fujita
Xuankai Chang
Shinji Watanabe
194
17
0
20 Jul 2021
Multi-Task Audio Source Separation
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
134
13
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position
  Embedding
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
200
9
0
13 Jul 2021
Multilingual and crosslingual speech recognition using
  phonological-vector based phone embeddings
Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Chengrui Zhu
Keyu An
Huahuan Zheng
Zhijian Ou
205
10
0
11 Jul 2021
The HCCL Speaker Verification System for Far-Field Speaker Verification
  Challenge
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge
Zhuo Li
Ce Fang
Runqiu Xiao
Zhigao Chen
Wenchao Wang
Yonghong Yan
125
2
0
03 Jul 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
344
435
0
29 Jun 2021
SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement
  Challenge for Video Conferencing
SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing
R. Raj
Rohit Kumar
M. Jayesh
Anurenjan Purushothaman
Sriram Ganapathy
Basha Shaik
85
2
0
24 Jun 2021
An Improved Single Step Non-autoregressive Transformer for Automatic
  Speech Recognition
An Improved Single Step Non-autoregressive Transformer for Automatic Speech RecognitionInterspeech (Interspeech), 2021
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
Abeer Alwan
238
20
0
18 Jun 2021
Previous
123...106789
Next