Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 937 papers shown
Title
A Survey on Adversarial Attacks for Malware Analysis
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
AAML
34
51
0
16 Nov 2021
Recurrent Neural Networks for Learning Long-term Temporal Dependencies with Reanalysis of Time Scale Representation
Kentaro Ohno
Atsutoshi Kumagai
CLL
AI4CE
6
8
0
05 Nov 2021
Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel
Kevin Eloff
Okko Rasanen
H. Engelbrecht
Arnu Pretorius
Herman Kamper
35
3
0
04 Nov 2021
Speech recognition for air traffic control via feature learning and end-to-end training
Peng Fan
Dongyue Guo
Yi Lin
Bo Yang
Jianwei Zhang
15
7
0
04 Nov 2021
RT-RCG: Neural Network and Accelerator Search Towards Effective and Real-time ECG Reconstruction from Intracardiac Electrograms
Yongan Zhang
Anton Banta
Yonggan Fu
M. John
A. Post
M. Razavi
Joseph R. Cavallaro
B. Aazhang
Yingyan Lin
26
4
0
04 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
35
363
0
02 Nov 2021
EfficientWord-Net: An Open Source Hotword Detection Engine based on One-shot Learning
R. Chidhambararajan
Aman Rangaur
S. C. Sethuraman
14
4
0
31 Oct 2021
Beyond
L
p
L_p
L
p
clipping: Equalization-based Psychoacoustic Attacks against ASRs
H. Abdullah
Muhammad Sajidur Rahman
Christian Peeters
Cassidy Gibson
Washington Garcia
Vincent Bindschaedler
T. Shrimpton
Patrick Traynor
AAML
19
9
0
25 Oct 2021
Asynchronous Decentralized Distributed Training of Acoustic Models
Xiaodong Cui
Wei Zhang
Abdullah Kayi
Mingrui Liu
Ulrich Finkler
Brian Kingsbury
G. Saon
David S. Kung
32
3
0
21 Oct 2021
Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning
Ningning Xie
Tamara Norman
Dominik Grewe
Dimitrios Vytiniotis
19
16
0
20 Oct 2021
Chunked Autoregressive GAN for Conditional Waveform Synthesis
Max Morrison
Rithesh Kumar
Kundan Kumar
Prem Seetharaman
Aaron Courville
Yoshua Bengio
GAN
41
69
0
19 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
Henry Gouk
Chen Change Loy
Timothy M. Hospedales
SSL
OOD
AI4TS
34
274
0
18 Oct 2021
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor
Anchit Gupta
Faizan Farooq Khan
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
24
6
0
16 Oct 2021
Multistage linguistic conditioning of convolutional layers for speech emotion recognition
Andreas Triantafyllopoulos
U. Reichel
Shuo Liu
Simon Huber
F. Eyben
Björn W. Schuller
29
9
0
13 Oct 2021
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Jounghee Kim
Pilsung Kang
VLM
29
6
0
11 Oct 2021
Boosting Fast Adversarial Training with Learnable Adversarial Initialization
Xiaojun Jia
Yong Zhang
Baoyuan Wu
Jue Wang
Xiaochun Cao
AAML
50
54
0
11 Oct 2021
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
Li Fu
Xiaoxiao Li
Runyu Wang
Lu Fan
Zhengchen Zhang
Meng Chen
Youzheng Wu
Xiaodong He
SSL
8
3
0
08 Oct 2021
Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Yuanchao Wang
Wenjing Du
Chenghao Cai
Yanyan Xu
34
1
0
08 Oct 2021
Speeding up Deep Model Training by Sharing Weights and Then Unsharing
Shuo Yang
Le Hou
Xiaodan Song
Qiang Liu
Denny Zhou
110
9
0
08 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
25
219
0
07 Oct 2021
Back from the future: bidirectional CTC decoding using future information in speech recognition
Namkyu Jung
Geon-min Kim
Han-Gyu Kim
33
3
0
07 Oct 2021
BERT Attends the Conversation: Improving Low-Resource Conversational ASR
Pablo Ortiz
Simen Burud
34
4
0
05 Oct 2021
Adversarial Regression with Doubly Non-negative Weighting Matrices
Tam Le
Truyen V. Nguyen
M. Yamada
Jose H. Blanchet
Viet Anh Nguyen
27
5
0
30 Sep 2021
VoxCeleb Enrichment for Age and Gender Recognition
Khaled Hechmi
Trung Ngo Trong
Ville Hautamaki
Tomi Kinnunen
24
30
0
28 Sep 2021
DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning
Tongan Cai
Haomiao Ni
Ming-Chieh Yu
Xiaolei Huang
K. Wong
John Volpi
Jianmin Wang
Stephen T. C. Wong
26
14
0
24 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
32
26
0
19 Sep 2021
Enforcing fairness in private federated learning via the modified method of differential multipliers
Borja Rodríguez Gálvez
Filip Granqvist
Rogier van Dalen
M. Seigel
FedML
48
52
0
17 Sep 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Desh Raj
Liang Lu
Zhuo Chen
Yashesh Gaur
Jinyu Li
19
17
0
17 Sep 2021
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Chen Zhang
Jiaxing Yu
Luchin Chang
Xu Tan
Jiawei Chen
Tao Qin
Kecheng Zhang
30
15
0
16 Sep 2021
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Felix Wu
Kwangyoun Kim
Jing Pan
Kyu Jeong Han
Kilian Q. Weinberger
Yoav Artzi
27
71
0
14 Sep 2021
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing Deep Neural Networks for Wearables
B. Prabakaran
Asima Akhtar
Semeen Rehman
Osman Hasan
Muhammad Shafique
11
9
0
07 Sep 2021
SEC4SR: A Security Analysis Platform for Speaker Recognition
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Yang Liu
AAML
25
12
0
04 Sep 2021
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory
Zhijie Lin
Zhou Zhao
Haoyuan Li
Jinglin Liu
Meng Zhang
Xingshan Zeng
Xiaofei He
30
18
0
31 Aug 2021
Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring
Yaman Kumar Singla
Avykat Gupta
Shaurya Bagga
Changyou Chen
Balaji Krishnamurthy
R. Shah
32
12
0
30 Aug 2021
CrossedWires: A Dataset of Syntactically Equivalent but Semantically Disparate Deep Learning Models
Max Zvyagin
Thomas Brettin
Arvind Ramanathan
Sumit Kumar Jha
19
1
0
29 Aug 2021
Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Juntae Kim
Jee-Hye Lee
32
6
0
22 Aug 2021
Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition
Arash Dehghani
Seyyed Ali Seyyedsalehi
32
1
0
09 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
54
47
0
08 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
36
25
0
05 Aug 2021
Imperceptible Adversarial Examples by Spatial Chroma-Shift
A. Aydin
Deniz Sen
Berat Tuna Karli
Oguz Hanoglu
A. Temi̇zel
AAML
26
16
0
05 Aug 2021
Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-Temporal Sparsity
Chang Gao
T. Delbruck
Shih-Chii Liu
21
44
0
04 Aug 2021
Transformer-based Map Matching Model with Limited Ground-Truth Data using Transfer-Learning Approach
Zhixiong Jin
Jiwon Kim
H. Yeo
Seongjin Choi
38
27
0
01 Aug 2021
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
26
21
0
30 Jul 2021
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
L. Gris
Edresson Casanova
F. S. Oliveira
A. S. Soares
A. Júnior
4
16
0
23 Jul 2021
Semantic Communications for Speech Recognition
Zhenzi Weng
Zhijin Qin
Geoffrey Ye Li
33
35
0
22 Jul 2021
CREW: Computation Reuse and Efficient Weight Storage for Hardware-accelerated MLPs and RNNs
Marc Riera
J. Arnau
Antonio González
15
5
0
20 Jul 2021
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems
Anirudh Sreeram
Nicholas Mehlman
Raghuveer Peri
D. Knox
Shrikanth Narayanan
29
5
0
12 Jul 2021
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data
K. Cheuk
Dorien Herremans
Li Su
58
32
0
11 Jul 2021
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers
Huahuan Zheng
Wenjie Peng
Zhijian Ou
Jinsong Zhang
28
5
0
07 Jul 2021
ARM-Net: Adaptive Relation Modeling Network for Structured Data
Shaofeng Cai
Kaiping Zheng
Gang Chen
H. V. Jagadish
Beng Chin Ooi
Meihui Zhang
35
50
0
05 Jul 2021
Previous
1
2
3
...
6
7
8
...
17
18
19
Next