ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXivPDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 936 papers shown
Title
A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Yufeng Yang
Peidong Wang
DeLiang Wang
20
12
0
01 Mar 2022
A Survey of Multilingual Models for Automatic Speech Recognition
A Survey of Multilingual Models for Automatic Speech Recognition
Hemant Yadav
Sunayana Sitaram
24
35
0
25 Feb 2022
Differentially Private Speaker Anonymization
Differentially Private Speaker Anonymization
Ali Shahin Shamsabadi
B. M. L. Srivastava
A. Bellet
Nathalie Vauquier
Emmanuel Vincent
Mohamed Maouche
Marc Tommasi
Nicolas Papernot
MIACV
54
33
0
23 Feb 2022
Memory Planning for Deep Neural Networks
Memory Planning for Deep Neural Networks
Maksim Levental
31
4
0
23 Feb 2022
Korean Tokenization for Beam Search Rescoring in Speech Recognition
Korean Tokenization for Beam Search Rescoring in Speech Recognition
Kyuhong Shim
Hyewon Bae
Wonyong Sung
24
0
0
22 Feb 2022
HRel: Filter Pruning based on High Relevance between Activation Maps and
  Class Labels
HRel: Filter Pruning based on High Relevance between Activation Maps and Class Labels
C. Sarvani
Mrinmoy Ghorai
S. Dubey
S. H. Shabbeer Basha
VLM
39
37
0
22 Feb 2022
Adversarial Attacks on Speech Recognition Systems for Mission-Critical
  Applications: A Survey
Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Imran Razzak
Kevin Lee
Chetan Arora
Ali Hassani
A. Zaslavsky
AAML
34
6
0
22 Feb 2022
Spanish and English Phoneme Recognition by Training on Simulated
  Classroom Audio Recordings of Collaborative Learning Environments
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
24
0
0
21 Feb 2022
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning
  Preprocessing Pipelines
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines
Alexander Isenko
R. Mayer
Jeffrey Jedele
Hans-Arno Jacobsen
19
23
0
17 Feb 2022
Multi-style Training for South African Call Centre Audio
Multi-style Training for South African Call Centre Audio
Walter Heymans
Marelie Hattingh Davel
C. van Heerden
11
3
0
15 Feb 2022
DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting
  the Power Grid's Post-Fault Trajectories
DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting the Power Grid's Post-Fault Trajectories
Christian Moya
Shiqi Zhang
Meng Yue
Guang Lin
22
42
0
15 Feb 2022
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme
Franyell Silfa
J. Arnau
Antonio González
27
1
0
14 Feb 2022
Compute Trends Across Three Eras of Machine Learning
Compute Trends Across Three Eras of Machine Learning
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
39
269
0
11 Feb 2022
FAAG: Fast Adversarial Audio Generation through Interactive Attack
  Optimisation
FAAG: Fast Adversarial Audio Generation through Interactive Attack Optimisation
Yuantian Miao
Chao Chen
Lei Pan
Jun Zhang
Yang Xiang
AAML
20
2
0
11 Feb 2022
ASRPU: A Programmable Accelerator for Low-Power Automatic Speech
  Recognition
ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
D. Pinto
J. Arnau
Antonio González
33
0
0
10 Feb 2022
Conversational Agents: Theory and Applications
Conversational Agents: Theory and Applications
M. Wahde
M. Virgolin
LLMAG
26
24
0
07 Feb 2022
Towards Training Reproducible Deep Learning Models
Towards Training Reproducible Deep Learning Models
Boyuan Chen
Mingzhi Wen
Yong Shi
Dayi Lin
Gopi Krishnan Rajbahadur
Zhen Ming
Z. Jiang
SyDa
17
37
0
04 Feb 2022
Polyphonic pitch detection with convolutional recurrent neural networks
Polyphonic pitch detection with convolutional recurrent neural networks
Carl Thomé
Sven Ahlback
25
8
0
04 Feb 2022
Learning strides in convolutional neural networks
Learning strides in convolutional neural networks
Rachid Riad
O. Teboul
David Grangier
Neil Zeghidour
36
41
0
03 Feb 2022
Joint Speech Recognition and Audio Captioning
Joint Speech Recognition and Audio Captioning
Chaitanya Narisetty
E. Tsunoo
Xuankai Chang
Yosuke Kashiwagi
Michael Hentschel
Shinji Watanabe
21
10
0
03 Feb 2022
Imperceptible and Multi-channel Backdoor Attack against Deep Neural
  Networks
Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks
Mingfu Xue
S. Ni
Ying-Chang Wu
Yushu Zhang
Jian Wang
Weiqiang Liu
AAML
32
13
0
31 Jan 2022
The Norwegian Parliamentary Speech Corpus
The Norwegian Parliamentary Speech Corpus
Per Erik Solberg
Pablo Ortiz
6
13
0
26 Jan 2022
Internal Language Model Estimation Through Explicit Context Vector
  Learning for Attention-based Encoder-decoder ASR
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR
Yufei Liu
Rao Ma
Haihua Xu
Yi He
Zejun Ma
Weibin Zhang
28
12
0
26 Jan 2022
Improved Mispronunciation detection system using a hybrid CTC-ATT based
  approach for L2 English speakers
Improved Mispronunciation detection system using a hybrid CTC-ATT based approach for L2 English speakers
Neha Baranwal
Sharatkumar Chilaka
22
2
0
25 Jan 2022
A Noise-Robust Self-supervised Pre-training Model Based Speech
  Representation Learning for Automatic Speech Recognition
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition
Qiu-shi Zhu
Jie Zhang
Zi-qiang Zhang
Ming Wu
Xin Fang
Lirong Dai
123
40
0
22 Jan 2022
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
Xian Liu
Yinghao Xu
Qianyi Wu
Hang Zhou
Wayne Wu
Bolei Zhou
VGen
DiffM
3DH
45
140
0
19 Jan 2022
Transferability in Deep Learning: A Survey
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
34
101
0
15 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
39
90
0
05 Jan 2022
Discrete and continuous representations and processing in deep learning:
  Looking forward
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
33
20
0
04 Jan 2022
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face
  Attributes Neural Rendering
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering
Shunyu Yao
Ruizhe Zhong
Yichao Yan
Guangtao Zhai
Xiaokang Yang
CVBM
29
90
0
03 Jan 2022
Making AI 'Smart': Bridging AI and Cognitive Science
Making AI 'Smart': Bridging AI and Cognitive Science
Madhav Agarwal
Siddhant Bansal
31
0
0
31 Dec 2021
Towards Relatable Explainable AI with the Perceptual Process
Towards Relatable Explainable AI with the Perceptual Process
Wencan Zhang
Brian Y. Lim
AAML
XAI
25
62
0
28 Dec 2021
Multi-Dialect Arabic Speech Recognition
Multi-Dialect Arabic Speech Recognition
Abbas Raza Ali
22
15
0
25 Dec 2021
Multi-Variant Consistency based Self-supervised Learning for Robust
  Automatic Speech Recognition
Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
25
4
0
23 Dec 2021
A Comprehensive Analytical Survey on Unsupervised and Semi-Supervised
  Graph Representation Learning Methods
A Comprehensive Analytical Survey on Unsupervised and Semi-Supervised Graph Representation Learning Methods
Md. Khaledur Rahman
A. Azad
AI4TS
27
3
0
20 Dec 2021
Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated
  Label Mixing
Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing
Joonhyung Park
J. Yang
Jinwoo Shin
Sung Ju Hwang
Eunho Yang
33
23
0
16 Dec 2021
On the Use of External Data for Spoken Named Entity Recognition
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
40
16
0
14 Dec 2021
Real-Time Neural Voice Camouflage
Real-Time Neural Voice Camouflage
Mia Chiquier
Chengzhi Mao
Carl Vondrick
27
6
0
14 Dec 2021
Perceptual Loss with Recognition Model for Single-Channel Enhancement
  and Robust ASR
Perceptual Loss with Recognition Model for Single-Channel Enhancement and Robust ASR
Peter William VanHarn Plantinga
Deblin Bagchi
Eric Fosler-Lussier
46
10
0
11 Dec 2021
Are E2E ASR models ready for an industrial usage?
Are E2E ASR models ready for an industrial usage?
Valentin Vielzeuf
G. Antipov
26
8
0
09 Dec 2021
FastSGD: A Fast Compressed SGD Framework for Distributed Machine
  Learning
FastSGD: A Fast Compressed SGD Framework for Distributed Machine Learning
Keyu Yang
Lu Chen
Zhihao Zeng
Yunjun Gao
23
9
0
08 Dec 2021
A Transferable Approach for Partitioning Machine Learning Models on
  Multi-Chip-Modules
A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Xinfeng Xie
Prakash Prabhu
Ulysse Beaugnon
P. Phothilimthana
Sudip Roy
Azalia Mirhoseini
E. Brevdo
James Laudon
Yanqi Zhou
30
5
0
07 Dec 2021
Training end-to-end speech-to-text models on mobile phones
Training end-to-end speech-to-text models on mobile phones
S. Zitha
Raghavendra Rao Suresh
Pooja S B. Rao
T. V. Prabhakar
19
1
0
07 Dec 2021
On Large Batch Training and Sharp Minima: A Fokker-Planck Perspective
On Large Batch Training and Sharp Minima: A Fokker-Planck Perspective
Xiaowu Dai
Yuhua Zhu
27
4
0
02 Dec 2021
Automated Speech Scoring System Under The Lens: Evaluating and
  interpreting the linguistic cues for language proficiency
Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency
P. Bamdev
Manraj Singh Grover
Yaman Kumar Singla
Payman Vafaee
Mika Hama
R. Shah
26
12
0
30 Nov 2021
Factorized Fourier Neural Operators
Factorized Fourier Neural Operators
Alasdair Tran
A. Mathews
Lexing Xie
Cheng Soon Ong
AI4CE
34
142
0
27 Nov 2021
Romanian Speech Recognition Experiments from the ROBIN Project
Romanian Speech Recognition Experiments from the ROBIN Project
Andrei-Marius Avram
Vasile Puaics
Dan Tufics
16
4
0
23 Nov 2021
Human-Machine Interaction Speech Corpus from the ROBIN project
Human-Machine Interaction Speech Corpus from the ROBIN project
V. Pais
Radu Ion
Andrei-Marius Avram
Elena Irimia
V. Mititelu
Maria Mitrofan
17
6
0
22 Nov 2021
Denoised Internal Models: a Brain-Inspired Autoencoder against
  Adversarial Attacks
Denoised Internal Models: a Brain-Inspired Autoencoder against Adversarial Attacks
Kaiyuan Liu
Xingyu Li
Yu-Rui Lai
Hong Xie
Hang Su
Jiacheng Wang
Chunxu Guo
J. Guan
Yi Zhou
AAML
31
3
0
21 Nov 2021
The People's Speech: A Large-Scale Diverse English Speech Recognition
  Dataset for Commercial Usage
The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage
Daniel Galvez
G. Diamos
Juan Ciro
Juan Felipe Cerón
Keith Achorn
Anjali Gopi
David Kanter
Maximilian Lam
Mark Mazumder
Vijay Janapa Reddi
22
95
0
17 Nov 2021
Previous
123...567...171819
Next