Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1709.05522
Cited By
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
16 September 2017
Hui Bu
Jiayu Du
Xingyu Na
Bengu Wu
Hao Zheng
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline"
50 / 451 papers shown
Integrating Lattice-Free MMI into End-to-End Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
283
10
0
29 Mar 2022
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Interspeech (Interspeech), 2022
Binbin Zhang
Di Wu
Zhendong Peng
Xingcheng Song
Zhuoyuan Yao
Hang Lv
Linfu Xie
Chao Yang
Fuping Pan
Jianwei Niu
VLM
274
127
0
29 Mar 2022
Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
International Conference on Neural Information Processing (ICONIP), 2022
Fangyuan Wang
Bo Xu
158
5
0
29 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
Interspeech (Interspeech), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
131
15
0
28 Mar 2022
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Xintao Zhao
Feng Liu
Changhe Song
Zhiyong Wu
Shiyin Kang
Deyi Tuo
Helen Meng
165
29
0
24 Mar 2022
Variational Auto-Encoder based Mandarin Speech Cloning
Qingyu Xing
Xiaohan Ma
177
0
0
06 Mar 2022
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
The Speaker and Language Recognition Workshop (Odyssey), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
354
36
0
26 Feb 2022
Improving CTC-based speech recognition via knowledge transferring from pre-trained language models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
Gaofeng Cheng
Ji Xu
Pengyuan Zhang
132
32
0
22 Feb 2022
AISHELL-NER: Named Entity Recognition from Chinese Speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Boli Chen
Guangwei Xu
Xiaobin Wang
Pengjun Xie
Meishan Zhang
Fei Huang
126
39
0
17 Feb 2022
ADD 2022: the First Audio Deep Synthesis Detection Challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiangyan Yi
Ruibo Fu
Jianhua Tao
Shuai Nie
Haoxin Ma
...
Le Xu
Zhengqi Wen
Haizhou Li
Zheng Lian
Bin Liu
249
235
0
17 Feb 2022
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
140
3
0
25 Jan 2022
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
194
29
0
25 Jan 2022
A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies
Automatic Speech Recognition & Understanding (ASRU), 2021
Florian Boyer
Yusuke Shinohara
Takaaki Ishii
Hirofumi Inaguma
Shinji Watanabe
260
40
0
14 Jan 2022
Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Jing Du
Shiliang Pu
Qinbo Dong
Chao Jin
Xin Qi
Dian Gu
Ru Wu
Hongwei Zhou
242
11
0
10 Jan 2022
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
International Conference on Language Resources and Evaluation (LREC), 2022
Tiezheng Yu
Rita Frieske
Peng Xu
Samuel Cahyawijaya
Cheuk Tung Shadow Yiu
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
RALM
221
18
0
07 Jan 2022
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
IEEE Signal Processing Letters (SPL), 2022
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
179
14
0
06 Jan 2022
Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words
Haoxu Wang
Yan Jia
Zeqing Zhao
Xuyang Wang
Junjie Wang
Ming Li
AAML
191
2
0
01 Jan 2022
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
International Conference on Asian Language Processing (IALP), 2019
Chia-Yu Li
Ngoc Thang Vu
150
12
0
19 Dec 2021
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
88
32
0
14 Dec 2021
Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Chia-Yu Li
Ngoc Thang Vu
108
14
0
12 Dec 2021
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
Holy Lovenia
Samuel Cahyawijaya
Genta Indra Winata
Peng Xu
Xu Yan
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
397
45
0
12 Dec 2021
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information
Zhihao Du
Shiliang Zhang
Siqi Zheng
Weilong Huang
Ming Lei
BDL
187
2
0
28 Nov 2021
A Study on Decoupled Probabilistic Linear Discriminant Analysis
Ding Wang
Lantian Li
Hongzhi Yu
Dong Wang
91
0
0
24 Nov 2021
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
146
17
0
22 Nov 2021
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Zhu Li
Yuqing Zhang
Mengxi Nie
Ming Yan
Mengnan He
Ruixiong Zhang
Caixia Gong
137
3
0
15 Nov 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
235
160
0
14 Oct 2021
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jing Pan
Tao Lei
Kwangyoun Kim
Kyu Jeong Han
Shinji Watanabe
VLM
104
12
0
11 Oct 2021
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Xuankai Chang
Takashi Maekaku
Pengcheng Guo
Jing Shi
Yen-Ju Lu
...
Tianzi Wang
Shu-Wen Yang
Yu Tsao
Hung-yi Lee
Shinji Watanabe
SSL
AI4TS
171
86
0
09 Oct 2021
Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Si-Ioi Ng
Tan Lee
135
2
0
09 Oct 2021
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR
Interspeech (Interspeech), 2021
Hanjing Zhu
Li Wang
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
SSL
VLM
250
10
0
09 Oct 2021
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
Interspeech (Interspeech), 2021
Li Fu
Xiaoxiao Li
Runyu Wang
Lu Fan
Zhengchen Zhang
Meng Chen
Youzheng Wu
Xiaodong He
SSL
153
3
0
08 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
407
286
0
07 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
610
202
0
05 Oct 2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng
Xu Tan
Rui Wang
Linchen Zhu
Jin Xu
...
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
283
45
0
29 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
202
27
0
19 Sep 2021
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Chuan-Fei Zhang
Wenshu Fan
Tianren Zhang
Songlu Chen
Feng Chen
Xu-Cheng Yin
143
11
0
14 Sep 2021
Cross-domain Single-channel Speech Enhancement Model with Bi-projection Fusion Module for Noise-robust ASR
IEEE International Conference on Multimedia and Expo (ICME), 2021
Fu-An Chao
J. Hung
Berlin Chen
136
7
0
26 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
192
12
0
24 Aug 2021
Decoupling recognition and transcription in Mandarin ASR
Jiahong Yuan
Xingyu Cai
Dongji Gao
Renjie Zheng
Liang Huang
Kenneth Church
172
13
0
02 Aug 2021
Automatic recognition of suprasegmentals in speech
Jiahong Yuan
Neville Ryant
Xingyu Cai
Kenneth Church
M. Liberman
128
13
0
02 Aug 2021
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
International Conference on Speech and Computer (SPECOM), 2021
M. Musaev
Saida Mussakhojayeva
Ilyos Khujayorov
Yerbolat Khassanov
M. Ochilov
H. A. Varol
75
23
0
30 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
231
0
0
23 Jul 2021
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Interspeech (Interspeech), 2021
Tianzi Wang
Yuya Fujita
Xuankai Chang
Shinji Watanabe
194
17
0
20 Jul 2021
Multi-Task Audio Source Separation
Lu Zhang
Chenxing Li
Feng Deng
Xiaorui Wang
134
13
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
200
9
0
13 Jul 2021
Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Chengrui Zhu
Keyu An
Huahuan Zheng
Zhijian Ou
205
10
0
11 Jul 2021
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge
Zhuo Li
Ce Fang
Runqiu Xiao
Zhigao Chen
Wenchao Wang
Yonghong Yan
125
2
0
03 Jul 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
344
435
0
29 Jun 2021
SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing
R. Raj
Rohit Kumar
M. Jayesh
Anurenjan Purushothaman
Sriram Ganapathy
Basha Shaik
85
2
0
24 Jun 2021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Interspeech (Interspeech), 2021
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
Abeer Alwan
238
20
0
18 Jun 2021
Previous
1
2
3
...
10
6
7
8
9
Next