Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 1,096 papers shown
Title
Jira: a Kurdish Speech Recognition System Designing and Building Speech Corpus and Pronunciation Lexicon
H. Veisi
Hawre Hosseini
Mohammad MohammadAmini
Wirya Fathy
Aso Mahmudi
78
4
0
15 Feb 2021
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
107
20
0
15 Feb 2021
Straggler-Resilient Distributed Machine Learning with Dynamic Backup Workers
Efstathia Soufleri
Gang Yan
Rahul Singh
Jian Li
111
13
0
11 Feb 2021
An Investigation of End-to-End Models for Robust Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Archiki Prasad
Preethi Jyothi
R. Velmurugan
128
23
0
11 Feb 2021
BembaSpeech: A Speech Recognition Corpus for the Bemba Language
International Conference on Language Resources and Evaluation (LREC), 2021
Claytone Sikasote
Antonios Anastasopoulos
79
25
0
09 Feb 2021
Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
International Conference on Machine Learning and Applications (ICMLA), 2020
James Mou
Jun Li
61
5
0
03 Feb 2021
Unbox the Black-box for the Medical Explainable AI via Multi-modal and Multi-centre Data Fusion: A Mini-Review, Two Showcases and Beyond
Information Fusion (Inf. Fusion), 2021
Guang Yang
Qinghao Ye
Jun Xia
261
568
0
03 Feb 2021
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
Interspeech (Interspeech), 2021
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
314
304
0
02 Feb 2021
High Fidelity Speech Regeneration with Application to Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Adam Polyak
Lior Wolf
Yossi Adi
Ori Kabeli
Yaniv Taigman
149
19
0
31 Jan 2021
Curriculum Learning: A Survey
International Journal of Computer Vision (IJCV), 2021
Petru Soviany
Radu Tudor Ionescu
Paolo Rota
Andrii Zadaianchuk
ODL
529
469
0
25 Jan 2021
Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Social Science Research Network (SSRN), 2021
D. Pinto
J. Arnau
Antonio González
55
0
0
22 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and Human
Computer Speech and Language (CSL), 2021
A. Hussein
Shinji Watanabe
Ahmed M. Ali
VLM
153
55
0
21 Jan 2021
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
ETRI Journal (ETRI J.), 2021
Y. Oh
Kiyoung Park
Jeongue Park
OffRL
253
5
0
14 Jan 2021
Self-Adaptive Reconfigurable Arrays (SARA): Using ML to Assist Scaling GEMM Acceleration
A. Samajdar
Michael Pellauer
T. Krishna
169
4
0
12 Jan 2021
Model-Based Machine Learning for Communications
Stefano Rini
Nariman Farsad
Yonina C. Eldar
Andrea J. Goldsmith
160
23
0
12 Jan 2021
Noise Sensitivity-Based Energy Efficient and Robust Adversary Detection in Neural Networks
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2021
Rachel Sterneck
Abhishek Moitra
Priyadarshini Panda
AAML
107
9
0
05 Jan 2021
Robustness Testing of Language Understanding in Task-Oriented Dialog
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Jiexi Liu
Ryuichi Takanobu
Jiaxin Wen
Dazhen Wan
Hongguang Li
Weiran Nie
Cheng Li
Wei Peng
Shiyu Huang
ELM
452
50
0
30 Dec 2020
Perspective: A Phase Diagram for Deep Learning unifying Jamming, Feature Learning and Lazy Training
Mario Geiger
Leonardo Petrini
Matthieu Wyart
DRL
157
11
0
30 Dec 2020
IIRC: Incremental Implicitly-Refined Classification
Computer Vision and Pattern Recognition (CVPR), 2020
Mohamed Abdelsalam
Mojtaba Faramarzi
Shagun Sodhani
A. Chandar
CLL
164
32
0
23 Dec 2020
DISCO: Dynamic and Invariant Sensitive Channel Obfuscation for deep neural networks
Computer Vision and Pattern Recognition (CVPR), 2020
Abhishek Singh
Ayush Chopra
Vivek Sharma
Ethan Garza
Emily Zhang
Praneeth Vepakomma
Ramesh Raskar
184
56
0
20 Dec 2020
Robust One Shot Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
H. Mujtaba
VGen
100
14
0
14 Dec 2020
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
113
5
0
14 Dec 2020
Data Appraisal Without Data Sharing
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Mimee Xu
Laurens van der Maaten
Awni Y. Hannun
TDI
261
6
0
11 Dec 2020
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition
Binbin Zhang
Di Wu
Zhuoyuan Yao
Xiong Wang
F. Yu
Chao Yang
Liyong Guo
Yaguang Hu
Lei Xie
X. Lei
234
86
0
10 Dec 2020
Contrastive Predictive Coding for Human Activity Recognition
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2020
H. Haresamudram
Irfan Essa
Thomas Ploetz
274
139
0
09 Dec 2020
Creativity of Deep Learning: Conceptualization and Assessment
International Conference on Agents and Artificial Intelligence (ICAART), 2020
Marcus Basalla
Johannes Schneider
Jan vom Brocke
260
14
0
03 Dec 2020
End to End ASR System with Automatic Punctuation Insertion
Yushi Guan
3DV
86
6
0
03 Dec 2020
TimeSHAP: Explaining Recurrent Models through Sequence Perturbations
Knowledge Discovery and Data Mining (KDD), 2020
João Bento
Pedro Saleiro
André F. Cruz
Mário A. T. Figueiredo
P. Bizarro
FAtt
AI4TS
313
123
0
30 Nov 2020
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp
IEEE International Conference on Robotics and Automation (ICRA), 2020
Junfan Lin
Zhongzhan Huang
Keze Wang
Xiaodan Liang
Weiwei Chen
Liang Lin
136
12
0
30 Nov 2020
Unigram-Normalized Perplexity as a Language Model Performance Measure with Different Vocabulary Sizes
Jihyeon Roh
Sang-Hoon Oh
Soo-Young Lee
83
8
0
26 Nov 2020
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2020
Prakamya Mishra
101
0
0
23 Nov 2020
Deep Learning in EEG: Advance of the Last Ten-Year Critical Period
IEEE Transactions on Cognitive and Developmental Systems (TCDS), 2020
Shu Gong
Kaibo Xing
A. Cichocki
Junhua Li
VLM
331
85
0
22 Nov 2020
Low-Dimensional Manifolds Support Multiplexed Integrations in Recurrent Neural Networks
Neural Computation (Neural Comput.), 2020
Arnaud Fanthomme
R. Monasson
157
6
0
20 Nov 2020
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Alptekin Orbay
SLR
88
0
0
18 Nov 2020
Refining Automatic Speech Recognition System for older adults
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Liu Chen
Meysam Asgari
89
10
0
17 Nov 2020
Skin disease diagnosis with deep learning: a review
Hongfeng Li
Yini Pan
Jie Zhao
Li Zhang
252
127
0
11 Nov 2020
Recognizing More Emotions with Less Data Using Self-supervised Transfer Learning
Jonathan Boigne
Biman Liyanage
Ted Östrem
92
25
0
11 Nov 2020
Highly Available Data Parallel ML training on Mesh Networks
Sameer Kumar
N. Jouppi
MoE
AI4CE
105
15
0
06 Nov 2020
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
208
15
0
04 Nov 2020
Joint Masked CPC and CTC Training for ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Chaitanya Talnikar
Tatiana Likhomanenko
R. Collobert
Gabriel Synnaeve
SSL
260
28
0
30 Oct 2020
Greedy Optimization Provably Wins the Lottery: Logarithmic Number of Winning Tickets is Enough
Neural Information Processing Systems (NeurIPS), 2020
Mao Ye
Lemeng Wu
Qiang Liu
127
17
0
29 Oct 2020
Two-stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Seongbin Kim
Gyuwan Kim
Seongjin Shin
Sangmin Lee
VLM
285
21
0
25 Oct 2020
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
International Symposium on High-Performance Computer Architecture (HPCA), 2020
Yujeong Choi
Yunseong Kim
Minsoo Rhu
138
79
0
25 Oct 2020
Scale-, shift- and rotation-invariant diffractive optical networks
ACS Photonics (ACS Photonics), 2020
Deniz Mengu
Y. Rivenson
Aydogan Ozcan
204
71
0
24 Oct 2020
Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Menglong Xu
Shengqiang Li
Xiao-Lei Zhang
233
35
0
23 Oct 2020
Few-shot Image Recognition with Manifolds
Debasmit Das
J. Moon
C. S. George Lee
112
10
0
22 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
160
37
0
22 Oct 2020
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
R. Collobert
Gabriel Synnaeve
363
105
0
22 Oct 2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
James Qin
Daniel S. Park
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Quoc V. Le
Yonghui Wu
VLM
SSL
492
325
0
20 Oct 2020
Energy-based error bound of physics-informed neural network solutions in elasticity
Journal of engineering mechanics (J. Eng. Mech.), 2020
Mengwu Guo
E. Haghighat
PINN
222
34
0
18 Oct 2020
Previous
1
2
3
...
9
10
11
...
20
21
22
Next