Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 938 papers shown
Title
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
30
19
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
56
95
0
01 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
44
12
0
01 Jul 2021
Realtime Robust Malicious Traffic Detection via Frequency Domain Analysis
Chuanpu Fu
Qi Li
Meng Shen
Ke Xu
AAML
20
148
0
28 Jun 2021
Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Jianrong Wang
Zi-yue Tang
Xuewei Li
Mei Yu
Qiang Fang
Li Liu
BDL
11
14
0
25 Jun 2021
Where are we in semantic concept extraction for Spoken Language Understanding?
Sahar Ghannay
Antoine Caubrière
Salima Mdhaffar
G. Laperriere
Bassam Jabaian
Yannick Esteve
17
18
0
24 Jun 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
19
1
0
22 Jun 2021
Silent Speech and Emotion Recognition from Vocal Tract Shape Dynamics in Real-Time MRI
Laxmi Pandey
A. Arif
14
8
0
16 Jun 2021
Exploiting Large-scale Teacher-Student Training for On-device Acoustic Models
Jing Liu
R. Swaminathan
S. Parthasarathi
Chunchuan Lyu
Athanasios Mouchtaris
Siegfried Kunzmann
30
9
0
11 Jun 2021
U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Di Wu
Binbin Zhang
Chao Yang
Zhendong Peng
Wenjing Xia
Xiaoyu Chen
X. Lei
29
47
0
10 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Handcrafted Backdoors in Deep Neural Networks
Sanghyun Hong
Nicholas Carlini
Alexey Kurakin
19
71
0
08 Jun 2021
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Dong Min
Dong Bok Lee
Eunho Yang
Sung Ju Hwang
25
160
0
06 Jun 2021
Escaping Saddle Points Faster with Stochastic Momentum
Jun-Kun Wang
Chi-Heng Lin
Jacob D. Abernethy
ODL
11
22
0
05 Jun 2021
Bottom-up and top-down approaches for the design of neuromorphic processing systems: Tradeoffs and synergies between natural and artificial intelligence
Charlotte Frenkel
D. Bol
Giacomo Indiveri
36
34
0
02 Jun 2021
A Generalizable Approach to Learning Optimizers
Diogo Almeida
Clemens Winter
Jie Tang
Wojciech Zaremba
AI4CE
19
29
0
02 Jun 2021
A Sum-of-Ratios Multi-Dimensional-Knapsack Decomposition for DNN Resource Scheduling
Menglu Yu
Chuan Wu
Bo Ji
Jia Liu
13
9
0
28 May 2021
End-to-End Deep Fault Tolerant Control
Daulet Baimukashev
Bexultan Rakhim
Matteo Rubagotti
H. A. Varol
17
7
0
28 May 2021
Dynamic Network selection for the Object Detection task: why it matters and what we (didn't) achieve
Emanuele Vitali
Anton Lokhmotov
G. Palermo
6
1
0
27 May 2021
BackEISNN: A Deep Spiking Neural Network with Adaptive Self-Feedback and Balanced Excitatory-Inhibitory Neurons
Dongcheng Zhao
Yi Zeng
Yang Li
35
40
0
27 May 2021
DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications
Yaoyu Zhang
Wai Teng Tang
Matthew Kay Fei Lee
Chuping Qu
Weng-Fai Wong
Rick Siow Mong Goh
30
0
0
25 May 2021
Unsupervised Speech Recognition
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
28
271
0
24 May 2021
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey
Xiaoyu Zhang
Chao Chen
Yi Xie
Xiaofeng Chen
Jun Zhang
Yang Xiang
FedML
24
7
0
13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
29
3
0
13 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech Recognition
Liqiang He
Shulin Feng
Dan Su
Dong Yu
29
0
0
08 May 2021
Relative stability toward diffeomorphisms indicates performance in deep nets
Leonardo Petrini
Alessandro Favero
Mario Geiger
M. Wyart
OOD
38
15
0
06 May 2021
On the limit of English conversational speech recognition
Zoltán Tüske
G. Saon
Brian Kingsbury
22
50
0
03 May 2021
End-to-End Speech Recognition from Federated Acoustic Models
Yan Gao
Titouan Parcollet
Salah Zaiem
Javier Fernandez-Marques
Pedro Porto Buarque de Gusmão
Daniel J. Beutel
Nicholas D. Lane
28
43
0
29 Apr 2021
On Addressing Practical Challenges for RNN-Transducer
Rui Zhao
Jian Xue
Jinyu Li
Wenning Wei
Lei He
Jiawei Liu
25
31
0
27 Apr 2021
Protecting gender and identity with disentangled speech representations
Dimitrios Stoidis
Andrea Cavallaro
30
10
0
22 Apr 2021
Dual Head Adversarial Training
Yujing Jiang
Xingjun Ma
S. Erfani
James Bailey
AAML
19
4
0
21 Apr 2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Yusuke Kida
Tatsuya Komatsu
M. Togami
21
1
0
21 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
Šimon Mandlík
Tomás Pevný
27
5
0
19 Apr 2021
BM-NAS: Bilevel Multimodal Neural Architecture Search
Yihang Yin
Siyu Huang
Xiang Zhang
32
27
0
19 Apr 2021
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques
Kaiqi Fu
Jones Lin
Dengfeng Ke
Yanlu Xie
Jinsong Zhang
Binghuai Lin
23
40
0
17 Apr 2021
Efficient conformer-based speech recognition with linear attention
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
24
20
0
14 Apr 2021
Phoneme-based Distribution Regularization for Speech Enhancement
Yajing Liu
Xiulian Peng
Zhiwei Xiong
Yan Lu
10
4
0
08 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
34
17
0
08 Apr 2021
Pushing the Limits of Non-Autoregressive Speech Recognition
Edwin G. Ng
Chung-Cheng Chiu
Yu Zhang
William Chan
VLM
16
27
0
07 Apr 2021
GPU Domain Specialization via Composable On-Package Architecture
Yaosheng Fu
Evgeny Bolotin
Niladrish Chatterjee
D. Nellans
S. Keckler
22
12
0
05 Apr 2021
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Yangyang Shi
Varun K. Nagaraja
Chunyang Wu
Jay Mahadeokar
Duc Le
...
Ching-Feng Yeh
Julian Chan
Christian Fuegen
Ozlem Kalinli
M. Seltzer
27
15
0
05 Apr 2021
Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification
S. Silva
Arun Das
I. Scarff
Peyman Najafirad
AAML
20
1
0
05 Apr 2021
A Comparative Analysis of Machine Learning and Grey Models
Gang He
Khwaja Mutahir Ahmad
Wenxin Yu
Xiaochuan Xu
J. Kumar
SyDa
AI4TS
31
0
0
02 Apr 2021
Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
32
6
0
31 Mar 2021
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr Żelasko
Sonal Joshi
Yiwen Shao
Jesus Villalba
J. Trmal
Najim Dehak
Sanjeev Khudanpur
AAML
21
28
0
31 Mar 2021
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint
Zilun Peng
Akshay Budhkar
Ilana Tuil
J. Levy
Parinaz Sobhani
Raphael Cohen
J. Nassour
9
32
0
29 Mar 2021
Construction of a Large-scale Japanese ASR Corpus on TV Recordings
Shintaro Ando
Hiromasa Fujihara
11
21
0
26 Mar 2021
HufuNet: Embedding the Left Piece as Watermark and Keeping the Right Piece for Ownership Verification in Deep Neural Networks
Peizhuo Lv
Pan Li
Shengzhi Zhang
Kai Chen
Ruigang Liang
Yue Zhao
Yingjiu Li
AAML
6
5
0
25 Mar 2021
Federated Quantum Machine Learning
Samuel Yen-Chi Chen
Shinjae Yoo
FedML
AI4CE
24
117
0
22 Mar 2021
SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
Yuxuan Chen
Jiangshan Zhang
Xuejing Yuan
Shengzhi Zhang
Kai Chen
Xiaofeng Wang
Shanqing Guo
AAML
39
15
0
19 Mar 2021
Previous
1
2
3
...
7
8
9
...
17
18
19
Next