Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 1,096 papers shown
Title
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
321
279
0
07 Oct 2021
Back from the future: bidirectional CTC decoding using future information in speech recognition
Namkyu Jung
Geon-min Kim
Han-Gyu Kim
207
3
0
07 Oct 2021
BERT Attends the Conversation: Improving Low-Resource Conversational ASR
Pablo Ortiz
Simen Burud
116
5
0
05 Oct 2021
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems
J. C. Duarte
S. Colcher
54
4
0
04 Oct 2021
Adversarial Regression with Doubly Non-negative Weighting Matrices
Tam Le
Truyen V. Nguyen
M. Yamada
Jose H. Blanchet
Viet Anh Nguyen
185
5
0
30 Sep 2021
VoxCeleb Enrichment for Age and Gender Recognition
Khaled Hechmi
Trung Ngo Trong
Ville Hautamaki
Tomi Kinnunen
168
37
0
28 Sep 2021
DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning
Tongan Cai
Haomiao Ni
Ming-Chieh Yu
Xiaolei Huang
K. Wong
John Volpi
Chao Guo
Stephen T. C. Wong
143
25
0
24 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
170
27
0
19 Sep 2021
Enforcing fairness in private federated learning via the modified method of differential multipliers
Borja Rodríguez Gálvez
Filip Granqvist
Rogier van Dalen
M. Seigel
FedML
199
58
0
17 Sep 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Desh Raj
Liang Lu
Zhuo Chen
Yashesh Gaur
Jinyu Li
107
19
0
17 Sep 2021
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Chen Zhang
Jiaxing Yu
Luchin Chang
Xu Tan
Jiawei Chen
Tao Qin
Kecheng Zhang
127
16
0
16 Sep 2021
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Felix Wu
Kwangyoun Kim
Jing Pan
Kyu Jeong Han
Kilian Q. Weinberger
Yoav Artzi
157
82
0
14 Sep 2021
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing Deep Neural Networks for Wearables
IEEE Internet of Things Journal (IEEE IoT Journal), 2021
B. Prabakaran
Asima Akhtar
Semeen Rehman
Osman Hasan
Mohamed Bennai
97
11
0
07 Sep 2021
SEC4SR: A Security Analysis Platform for Speaker Recognition
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Yang Liu
AAML
139
13
0
04 Sep 2021
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory
ACM Multimedia (ACM MM), 2021
Zhijie Lin
Zhou Zhao
Haoyuan Li
Jinglin Liu
Meng Zhang
Xingshan Zeng
Xiaofei He
121
18
0
31 Aug 2021
Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring
International Conference on Information and Knowledge Management (CIKM), 2021
Yaman Kumar Singla
Avykat Gupta
Shaurya Bagga
Changyou Chen
Balaji Krishnamurthy
R. Shah
164
15
0
30 Aug 2021
CrossedWires: A Dataset of Syntactically Equivalent but Semantically Disparate Deep Learning Models
Max Zvyagin
Thomas Brettin
Arvind Ramanathan
Sumit Kumar Jha
104
1
0
29 Aug 2021
Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Interspeech (Interspeech), 2021
Juntae Kim
Jee-Hye Lee
178
8
0
22 Aug 2021
Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Neural Processing Letters (NPL), 2021
Arash Dehghani
Seyyed Ali Seyyedsalehi
195
1
0
09 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
156
51
0
08 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Interspeech (Interspeech), 2021
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
157
29
0
05 Aug 2021
Imperceptible Adversarial Examples by Spatial Chroma-Shift
A. Aydin
Deniz Sen
Berat Tuna Karli
Oguz Hanoglu
A. Temi̇zel
AAML
137
18
0
05 Aug 2021
Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-Temporal Sparsity
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Chang Gao
T. Delbruck
Shih-Chii Liu
273
52
0
04 Aug 2021
Transformer-based Map Matching Model with Limited Ground-Truth Data using Transfer-Learning Approach
Zhixiong Jin
Jiwon Kim
H. Yeo
Seongjin Choi
203
33
0
01 Aug 2021
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
205
24
0
30 Jul 2021
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
International Conference on Computational Processing of the Portuguese Language (PROPOR), 2021
L. Gris
Edresson Casanova
F. S. Oliveira
A. S. Soares
A. Júnior
148
22
0
23 Jul 2021
Semantic Communications for Speech Recognition
Global Communications Conference (GLOBECOM), 2021
Zhenzi Weng
Zhijin Qin
Geoffrey Ye Li
145
40
0
22 Jul 2021
CREW: Computation Reuse and Efficient Weight Storage for Hardware-accelerated MLPs and RNNs
Journal of systems architecture (JSA), 2021
Marc Riera
J. Arnau
Antonio González
70
5
0
20 Jul 2021
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Anirudh Sreeram
Nicholas Mehlman
Raghuveer Peri
D. Knox
Shrikanth Narayanan
81
6
0
12 Jul 2021
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data
K. Cheuk
Dorien Herremans
Li Su
329
39
0
11 Jul 2021
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Huahuan Zheng
Wenjie Peng
Zhijian Ou
Jinsong Zhang
197
5
0
07 Jul 2021
ARM-Net: Adaptive Relation Modeling Network for Structured Data
Shaofeng Cai
Kaiping Zheng
Gang Chen
H. V. Jagadish
Beng Chin Ooi
Meihui Zhang
245
61
0
05 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
130
20
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
398
111
0
01 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
332
20
0
01 Jul 2021
Realtime Robust Malicious Traffic Detection via Frequency Domain Analysis
Conference on Computer and Communications Security (CCS), 2021
Chuanpu Fu
Qi Li
Meng Shen
Ke Xu
AAML
148
198
0
28 Jun 2021
Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Interspeech (Interspeech), 2021
Jianrong Wang
Zi-yue Tang
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
BDL
115
17
0
25 Jun 2021
Where are we in semantic concept extraction for Spoken Language Understanding?
Sahar Ghannay
Antoine Caubrière
Salima Mdhaffar
G. Laperriere
Bassam Jabaian
Yannick Esteve
175
18
0
24 Jun 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training
Neural Information Processing Systems (NeurIPS), 2021
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
69
1
0
22 Jun 2021
Silent Speech and Emotion Recognition from Vocal Tract Shape Dynamics in Real-Time MRI
Laxmi Pandey
A. Arif
101
8
0
16 Jun 2021
Exploiting Large-scale Teacher-Student Training for On-device Acoustic Models
Workshop on Time-Delay Systems (TS), 2021
Jing Liu
Rupak Vignesh Swaminathan
S. Parthasarathi
Chunchuan Lyu
Athanasios Mouchtaris
Siegfried Kunzmann
119
9
0
11 Jun 2021
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Di Wu
Binbin Zhang
Chao Yang
Zhendong Peng
Wenjing Xia
Xiaoyu Chen
X. Lei
178
55
0
10 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Speech Communication (Speech Commun.), 2021
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
131
65
0
09 Jun 2021
Handcrafted Backdoors in Deep Neural Networks
Neural Information Processing Systems (NeurIPS), 2021
Sanghyun Hong
Nicholas Carlini
Alexey Kurakin
216
87
0
08 Jun 2021
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
International Conference on Machine Learning (ICML), 2021
Dong Min
Dong Bok Lee
Eunho Yang
Sung Ju Hwang
302
206
0
06 Jun 2021
Escaping Saddle Points Faster with Stochastic Momentum
International Conference on Learning Representations (ICLR), 2020
Jun-Kun Wang
Chi-Heng Lin
Jacob D. Abernethy
ODL
171
24
0
05 Jun 2021
Bottom-up and top-down approaches for the design of neuromorphic processing systems: Tradeoffs and synergies between natural and artificial intelligence
Proceedings of the IEEE (Proc. IEEE), 2021
Charlotte Frenkel
D. Bol
Giacomo Indiveri
233
56
0
02 Jun 2021
A Generalizable Approach to Learning Optimizers
Diogo Almeida
Clemens Winter
Jie Tang
Wojciech Zaremba
AI4CE
250
33
0
02 Jun 2021
A Sum-of-Ratios Multi-Dimensional-Knapsack Decomposition for DNN Resource Scheduling
IEEE Conference on Computer Communications (INFOCOM), 2021
Menglu Yu
Chuan Wu
Bo Ji
Jia Liu
113
10
0
28 May 2021
End-to-End Deep Fault Tolerant Control
IEEE/ASME transactions on mechatronics (IEEE/ASME Trans. Mechatronics), 2021
Daulet Baimukashev
Bexultan Rakhim
Matteo Rubagotti
H. A. Varol
106
13
0
28 May 2021
Previous
1
2
3
...
7
8
9
...
20
21
22
Next