Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 1,096 papers shown
Title
Dynamic Network selection for the Object Detection task: why it matters and what we (didn't) achieve
International Conference / Workshop on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS), 2021
Emanuele Vitali
Anton Lokhmotov
G. Palermo
67
1
0
27 May 2021
BackEISNN: A Deep Spiking Neural Network with Adaptive Self-Feedback and Balanced Excitatory-Inhibitory Neurons
Neural Networks (NN), 2021
Dongcheng Zhao
Yi Zeng
Yang Li
155
52
0
27 May 2021
DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications
Yaoyu Zhang
Wai Teng Tang
Matthew Kay Fei Lee
Chuping Qu
Weng-Fai Wong
Rick Siow Mong Goh
126
0
0
25 May 2021
Unsupervised Speech Recognition
Neural Information Processing Systems (NeurIPS), 2021
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
349
292
0
24 May 2021
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey
Xiaoyu Zhang
Chao Chen
Yi Xie
Xiaofeng Chen
Jun Zhang
Yang Xiang
FedML
94
7
0
13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
96
4
0
13 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Liqiang He
Shulin Feng
Jane Polak Scowcroft
Dong Yu
143
0
0
08 May 2021
Relative stability toward diffeomorphisms indicates performance in deep nets
Neural Information Processing Systems (NeurIPS), 2021
Leonardo Petrini
Alessandro Favero
Mario Geiger
Matthieu Wyart
OOD
243
15
0
06 May 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
171
53
0
03 May 2021
End-to-End Speech Recognition from Federated Acoustic Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yan Gao
Titouan Parcollet
Salah Zaiem
Javier Fernandez-Marques
Pedro Porto Buarque de Gusmão
Daniel J. Beutel
Nicholas D. Lane
175
46
0
29 Apr 2021
On Addressing Practical Challenges for RNN-Transducer
Automatic Speech Recognition & Understanding (ASRU), 2021
Rui Zhao
Jian Xue
Jinyu Li
Wenning Wei
Lei He
Jiawei Liu
217
33
0
27 Apr 2021
Protecting gender and identity with disentangled speech representations
Interspeech (Interspeech), 2021
Dimitrios Stoidis
Andrea Cavallaro
175
12
0
22 Apr 2021
Dual Head Adversarial Training
IEEE International Joint Conference on Neural Network (IJCNN), 2021
Yujing Jiang
Jiabo He
S. Erfani
James Bailey
AAML
158
7
0
21 Apr 2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Yusuke Kida
Tatsuya Komatsu
M. Togami
89
1
0
21 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
Šimon Mandlík
Tomás Pevný
178
6
0
19 Apr 2021
BM-NAS: Bilevel Multimodal Neural Architecture Search
AAAI Conference on Artificial Intelligence (AAAI), 2021
Yihang Yin
Siyu Huang
Xiang Zhang
199
34
0
19 Apr 2021
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques
Kaiqi Fu
Jones Lin
Dengfeng Ke
Yanlu Xie
Jinsong Zhang
Binghuai Lin
150
46
0
17 Apr 2021
Efficient conformer-based speech recognition with linear attention
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
186
26
0
14 Apr 2021
Phoneme-based Distribution Regularization for Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yajing Liu
Xiulian Peng
Zhiwei Xiong
Yan Lu
67
5
0
08 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
114
18
0
08 Apr 2021
Pushing the Limits of Non-Autoregressive Speech Recognition
Interspeech (Interspeech), 2021
Edwin G. Ng
Chung-Cheng Chiu
Yu Zhang
William Chan
VLM
214
30
0
07 Apr 2021
GPU Domain Specialization via Composable On-Package Architecture
ACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2021
Yaosheng Fu
Evgeny Bolotin
Niladrish Chatterjee
D. Nellans
S. Keckler
105
15
0
05 Apr 2021
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Interspeech (Interspeech), 2021
Yangyang Shi
Varun K. Nagaraja
Chunyang Wu
Jay Mahadeokar
Duc Le
...
Ching-Feng Yeh
Julian Chan
Christian Fuegen
Ozlem Kalinli
M. Seltzer
135
16
0
05 Apr 2021
Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification
S. Silva
Arun Das
I. Scarff
Peyman Najafirad
AAML
140
1
0
05 Apr 2021
A Comparative Analysis of Machine Learning and Grey Models
Gang He
Khwaja Mutahir Ahmad
Wenxin Yu
Xiaochuan Xu
J. Kumar
SyDa
AI4TS
131
0
0
02 Apr 2021
Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Interspeech (Interspeech), 2021
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
171
6
0
31 Mar 2021
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr Żelasko
Sonal Joshi
Yiwen Shao
Jesus Villalba
J. Trmal
Najim Dehak
Sanjeev Khudanpur
AAML
115
36
0
31 Mar 2021
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint
Zilun Peng
Akshay Budhkar
Ilana Tuil
J. Levy
Parinaz Sobhani
Raphael Cohen
J. Nassour
158
35
0
29 Mar 2021
Construction of a Large-scale Japanese ASR Corpus on TV Recordings
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shintaro Ando
Hiromasa Fujihara
91
28
0
26 Mar 2021
HufuNet: Embedding the Left Piece as Watermark and Keeping the Right Piece for Ownership Verification in Deep Neural Networks
Peizhuo Lv
Pan Li
Shengzhi Zhang
Kai Chen
Ruigang Liang
Yue Zhao
Yingjiu Li
AAML
124
8
0
25 Mar 2021
Federated Quantum Machine Learning
Entropy (Entropy), 2021
Samuel Yen-Chi Chen
Shinjae Yoo
FedML
AI4CE
177
154
0
22 Mar 2021
SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
ACM Transactions on Privacy and Security (ACM TOPS), 2021
Yuxuan Chen
Jiangshan Zhang
Xuejing Yuan
Shengzhi Zhang
Kai Chen
Luyi Xing
Shanqing Guo
AAML
238
19
0
19 Mar 2021
Modeling the Second Player in Distributionally Robust Optimization
International Conference on Learning Representations (ICLR), 2021
Paul Michel
Tatsunori Hashimoto
Graham Neubig
203
36
0
18 Mar 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation
Interspeech (Interspeech), 2021
Md. Akmal Haidar
Chao Xing
Mehdi Rezagholizadeh
166
6
0
17 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
122
18
0
13 Mar 2021
Learning spectro-temporal representations of complex sounds with parameterized neural networks
Journal of the Acoustical Society of America (JASA), 2021
Rachid Riad
Julien Karadayi
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
108
8
0
12 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
IEEE Transactions on Affective Computing (TAC), 2021
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
146
70
0
10 Mar 2021
Deep Learning for Android Malware Defenses: a Systematic Literature Review
ACM Computing Surveys (CSUR), 2021
Yue Liu
Chakkrit Tantithamthavorn
Li Li
Yepang Liu
AAML
230
99
0
09 Mar 2021
Consistency Regularization for Adversarial Robustness
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jihoon Tack
Sihyun Yu
Jongheon Jeong
Minseon Kim
Sung Ju Hwang
Jinwoo Shin
AAML
256
69
0
08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial Examples
USENIX Security Symposium (USENIX Security), 2021
Shehzeen Samarah Hussain
Paarth Neekhara
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
138
82
0
04 Mar 2021
Explaining Adversarial Vulnerability with a Data Sparsity Hypothesis
Neurocomputing (Neurocomputing), 2021
Mahsa Paknezhad
Cuong Phuc Ngo
Amadeus Aristo Winarto
Alistair Cheong
Beh Chuen Yang
Wu Jiayang
Lee Hwee Kuan
OOD
AAML
216
10
0
01 Mar 2021
Experiments with Rich Regime Training for Deep Learning
Xinyan Li
A. Banerjee
133
2
0
26 Feb 2021
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech
Juliette Millet
J. King
255
35
0
25 Feb 2021
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
121
70
0
25 Feb 2021
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jian Luo
Jianzong Wang
Ning Cheng
Jing Xiao
RALM
102
6
0
23 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Computer Speech and Language (CSL), 2021
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
141
63
0
19 Feb 2021
One Shot Audio to Animated Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
H. Mujtaba
Pranshu Agarwal
D. Sarkar
VGen
84
1
0
19 Feb 2021
Do End-to-End Speech Recognition Models Care About Context?
Interspeech (Interspeech), 2020
Lasse Borgholt
Jakob Drachmann Havtorn
Zeljko Agic
Anders Søgaard
Lars Maaløe
Christian Igel
102
8
0
17 Feb 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems
Applied Soft Computing (Appl Soft Comput), 2021
Yi Lin
Bo Yang
Linchao Li
Dongyue Guo
Jianwei Zhang
Hu Chen
Yi Zhang
139
32
0
17 Feb 2021
Improving speech recognition models with small samples for air traffic control systems
Neurocomputing (Neurocomputing), 2021
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
166
33
0
16 Feb 2021
Previous
1
2
3
...
8
9
10
...
20
21
22
Next