Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.02595
Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"
50 / 931 papers shown
Title
COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
D. Kadiyala
Saeed Rashidi
Taekyung Heo
Abhimanyu Bambhaniya
T. Krishna
Alexandros Daglis
VLM
24
9
0
30 Nov 2022
High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors
Yun-Hao Bai
Yanbo Fan
Xuanxia Wang
Yong Zhang
Jingxiang Sun
Chun Yuan
Ying Shan
3DH
32
25
0
28 Nov 2022
Deep Learning Training Procedure Augmentations
Cristian Simionescu
11
1
0
25 Nov 2022
Dynamic Neural Portraits
M. Doukas
Stylianos Ploumpis
S. Zafeiriou
3DH
30
1
0
25 Nov 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Jiaxiang Tang
Kaisiyuan Wang
Hang Zhou
Xiaokang Chen
Dongliang He
Tianshu Hu
Jingtuo Liu
Gang Zeng
Jingdong Wang
3DH
40
76
0
22 Nov 2022
Phonemic Adversarial Attack against Audio Recognition in Real World
Jiakai Wang
Zhendong Chen
Zixin Yin
Qinghong Yang
Xianglong Liu
AAML
37
3
0
19 Nov 2022
Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications
Zhongkai Hao
Songming Liu
Yichi Zhang
Chengyang Ying
Yao Feng
Hang Su
Jun Zhu
PINN
AI4CE
37
91
0
15 Nov 2022
FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Hossein Katebi
Navidreza Asadi
M. Goudarzi
MQ
27
0
0
13 Nov 2022
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization
Zhengkun Tian
Hongyu Xiang
Min Li
Fei Lin
Ke Ding
Guanglu Wan
15
6
0
07 Nov 2022
H_eval: A new hybrid evaluation metric for automatic speech recognition tasks
Zitha Sasindran
Harsha Yelchuri
T. V. Prabhakar
Supreeth K. Rao
VLM
12
2
0
03 Nov 2022
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
Yonggan Fu
Yang Zhang
Kaizhi Qian
Zhifan Ye
Zhongzhi Yu
Cheng-I Jeff Lai
Yingyan Lin
30
8
0
02 Nov 2022
Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames
Che-Yuan Liang
Xiao-Lei Zhang
BinBin Zhang
Di Wu
Shengqiang Li
Xingcheng Song
Zhendong Peng
Fuping Pan
18
8
0
02 Nov 2022
Cover Reproducible Steganography via Deep Generative Models
Kejiang Chen
Hang Zhou
Yaofei Wang
Meng Li
Weiming Zhang
Neng H. Yu
DiffM
31
9
0
26 Oct 2022
Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla
Ahnaf Mozib Samin
M. Kobir
Md. Mushtaq Shahriyar Rafee
M. F. Ahmed
Mehedi Hasan
Partha Ghosh
Shafkat Kibria
M. S. Rahman
SSL
26
0
0
24 Oct 2022
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?
Pradip Pramanick
Chayan Sarkar
24
7
0
21 Oct 2022
Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks
Minsu Kim
Youngjoon Yu
Sungjune Park
Y. Ro
OOD
18
0
0
21 Oct 2022
Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses
C. Li
Ngoc Thang Vu
21
2
0
20 Oct 2022
On effects of Knowledge Distillation on Transfer Learning
Sushil Thapa
24
1
0
18 Oct 2022
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning
Ali Safaya
E. Erzin
21
1
0
13 Oct 2022
Deep learning model compression using network sensitivity and gradients
M. Sakthi
N. Yadla
Raj Pawate
21
2
0
11 Oct 2022
TAN Without a Burn: Scaling Laws of DP-SGD
Tom Sander
Pierre Stock
Alexandre Sablayrolles
FedML
47
42
0
07 Oct 2022
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
Vasista Sai Lodagala
Sreyan Ghosh
S. Umesh
SSL
46
18
0
05 Oct 2022
How deep convolutional neural networks lose spatial information with training
Umberto M. Tomasini
Leonardo Petrini
Francesco Cagnetta
M. Wyart
41
9
0
04 Oct 2022
A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition
Kyuhong Shim
Wonyong Sung
25
2
0
01 Oct 2022
CRISP: Curriculum based Sequential Neural Decoders for Polar Code Family
Ashwin Hebbar
Viraj Nadkarni
Ashok Vardhan Makkuva
S. Bhat
Sewoong Oh
Pramod Viswanath
30
6
0
01 Oct 2022
A Survey on Physical Adversarial Attack in Computer Vision
Donghua Wang
Wen Yao
Tingsong Jiang
Guijian Tang
Xiaoqian Chen
AAML
56
38
0
28 Sep 2022
InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Mu Yuan
Lan Zhang
Fengxiang He
Xueting Tong
Miao-Hui Song
Zhengyuan Xu
Xiang-Yang Li
32
2
0
28 Sep 2022
NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Jixun Yao
Qing Wang
Li Zhang
Pengcheng Guo
Yuhao Liang
Linfu Xie
PICV
26
16
0
24 Sep 2022
HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Lingjiao Chen
Zhihua Jin
Sabri Eyuboglu
Christopher Ré
Matei A. Zaharia
James Zou
51
9
0
18 Sep 2022
Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models
R. Olivier
H. Abdullah
Bhiksha Raj
AAML
26
1
0
17 Sep 2022
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Sindhu B. Hegde
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
45
10
0
01 Sep 2022
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
Jun Ling
Xuejiao Tan
Liyang Chen
Runnan Li
Yuchao Zhang
Sheng Zhao
Liang Song
CVBM
47
13
0
29 Aug 2022
Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems
Prasoon Sinha
Akhil Guliani
Rutwik Jain
Brandon Tran
Matthew D. Sinclair
Shivaram Venkataraman
19
17
0
23 Aug 2022
Low-Level Physiological Implications of End-to-End Learning of Speech Recognition
Louise Coppieters de Gibson
Philip N. Garner
21
1
0
22 Aug 2022
Resisting Adversarial Attacks in Deep Neural Networks using Diverse Decision Boundaries
Manaar Alam
Shubhajit Datta
Debdeep Mukhopadhyay
Arijit Mondal
P. Chakrabarti
AAML
12
5
0
18 Aug 2022
Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Georgios Karakasidis
Tamás Grósz
M. Kurimo
22
2
0
10 Aug 2022
OLLIE: Derivation-based Tensor Program Optimizer
Liyan Zheng
Haojie Wang
Jidong Zhai
Muyan Hu
Zixuan Ma
Tuowei Wang
Shizhi Tang
Lei Xie
Kezhao Huang
Zhihao Jia
46
3
0
02 Aug 2022
A 23
μ
μ
μ
W Keyword Spotting IC with Ring-Oscillator-Based Time-Domain Feature Extraction
Kwantae Kim
Chang Gao
Rui Gracca
Ilya Kiselev
H. Yoo
T. Delbruck
Shih-Chii Liu
16
21
0
01 Aug 2022
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
A. I. S. Ferreira
Gustavo dos Reis Oliveira
27
3
0
29 Jul 2022
Federated Selective Aggregation for Knowledge Amalgamation
Don Xie
Ruonan Yu
Gongfan Fang
Mingli Song
Zunlei Feng
Xinchao Wang
Li Sun
Mingli Song
FedML
38
3
0
27 Jul 2022
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant Systems for Machine Learning
Baolin Li
Tirthak Patel
S. Samsi
V. Gadepally
Devesh Tiwari
14
51
0
23 Jul 2022
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks
Andrew M. Saxe
Shagun Sodhani
Sam Lewallen
AI4CE
30
34
0
21 Jul 2022
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting
Thierry Desot
François Portet
Michel Vacher
27
12
0
17 Jul 2022
Data Augmentation for Low-Resource Quechua ASR Improvement
Rodolfo Zevallos
Núria Bel
Guillermo Cámbara
Mireia Farrús
Jordi Luque
VLM
SyDa
19
6
0
14 Jul 2022
The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression
Alice Baird
Panagiotis Tzirakis
Jeffrey A. Brooks
Christopher B. Gregory
Björn Schuller
A. Batliner
D. Keltner
Alan S. Cowen
25
17
0
07 Jul 2022
Rapid training of quantum recurrent neural networks
M. Siemaszko
A. Buraczewski
Bertrand Le Saux
Magdalena Stobiñska
27
9
0
01 Jul 2022
Data-Efficient Learning via Minimizing Hyperspherical Energy
Xiaofeng Cao
Weiyang Liu
Ivor W. Tsang
19
8
0
30 Jun 2022
An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations
F. Bonnet
Jocelyn Ahmed Mazari
T. Munzer
P. Yser
Patrick Gallinari
AI4CE
66
10
0
29 Jun 2022
Wav2Vec-Aug: Improved self-supervised training with limited data
Anuroop Sriram
Michael Auli
Alexei Baevski
SSL
VLM
22
15
0
27 Jun 2022
Self-Healing Robust Neural Networks via Closed-Loop Control
Zhuotong Chen
Qianxiao Li
Zheng-Wei Zhang
AAML
OOD
6
11
0
26 Jun 2022
Previous
1
2
3
4
5
...
17
18
19
Next