Papers citing 'Deep Speech 2: End-to-End Speech Recognition in English and Mandarin'

Title
Dynamic Network selection for the Object Detection task: why it matters and what we (didn't) achieveInternational Conference / Workshop on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS), 2021 Emanuele Vitali Anton Lokhmotov G. Palermo 67 1 0 27 May 2021
BackEISNN: A Deep Spiking Neural Network with Adaptive Self-Feedback and Balanced Excitatory-Inhibitory NeuronsNeural Networks (NN), 2021 Dongcheng Zhao Yi Zeng Yang Li 155 52 0 27 May 2021
DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications Yaoyu Zhang Wai Teng Tang Matthew Kay Fei Lee Chuping Qu Weng-Fai Wong Rick Siow Mong Goh 126 0 0 25 May 2021
Unsupervised Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021 Alexei Baevski Wei-Ning Hsu Alexis Conneau Michael Auli SSL 349 292 0 24 May 2021
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey Xiaoyu Zhang Chao Chen Yi Xie Xiaofeng Chen Jun Zhang Yang Xiang FedML 94 7 0 13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition Khin Me Me Chit Laet Laet Lin 96 4 0 13 May 2021
Latency-Controlled Neural Architecture Search for Streaming Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021 Liqiang He Shulin Feng Jane Polak Scowcroft Dong Yu 143 0 0 08 May 2021
Relative stability toward diffeomorphisms indicates performance in deep netsNeural Information Processing Systems (NeurIPS), 2021 Leonardo Petrini Alessandro Favero Mario Geiger Matthieu Wyart OOD 243 15 0 06 May 2021
On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021 Zoltán Tüske G. Saon Brian Kingsbury 171 53 0 03 May 2021
End-to-End Speech Recognition from Federated Acoustic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Yan Gao Titouan Parcollet Salah Zaiem Javier Fernandez-Marques Pedro Porto Buarque de Gusmão Daniel J. Beutel Nicholas D. Lane 175 46 0 29 Apr 2021
On Addressing Practical Challenges for RNN-TransducerAutomatic Speech Recognition & Understanding (ASRU), 2021 Rui Zhao Jian Xue Jinyu Li Wenning Wei Lei He Jiawei Liu 217 33 0 27 Apr 2021
Protecting gender and identity with disentangled speech representationsInterspeech (Interspeech), 2021 Dimitrios Stoidis Andrea Cavallaro 175 12 0 22 Apr 2021
Dual Head Adversarial TrainingIEEE International Joint Conference on Neural Network (IJCNN), 2021 Yujing Jiang Jiabo He S. Erfani James Bailey AAML 158 7 0 21 Apr 2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers Yusuke Kida Tatsuya Komatsu M. Togami 89 1 0 21 Apr 2021
Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks Šimon Mandlík Tomás Pevný 178 6 0 19 Apr 2021
BM-NAS: Bilevel Multimodal Neural Architecture SearchAAAI Conference on Artificial Intelligence (AAAI), 2021 Yihang Yin Siyu Huang Xiang Zhang 199 34 0 19 Apr 2021
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques Kaiqi Fu Jones Lin Dengfeng Ke Yanlu Xie Jinsong Zhang Binghuai Lin 150 46 0 17 Apr 2021
Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021 Shengqiang Li Menglong Xu Xiao-Lei Zhang 186 26 0 14 Apr 2021
Phoneme-based Distribution Regularization for Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Yajing Liu Xiulian Peng Zhiwei Xiong Yan Lu 67 5 0 08 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition Zhichao Wang Wenwen Yang Pan Zhou Wei Chen RALM 114 18 0 08 Apr 2021
Pushing the Limits of Non-Autoregressive Speech RecognitionInterspeech (Interspeech), 2021 Edwin G. Ng Chung-Cheng Chiu Yu Zhang William Chan VLM 214 30 0 07 Apr 2021
GPU Domain Specialization via Composable On-Package ArchitectureACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2021 Yaosheng Fu Evgeny Bolotin Niladrish Chatterjee D. Nellans S. Keckler 105 15 0 05 Apr 2021
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For LatencyInterspeech (Interspeech), 2021 Yangyang Shi Varun K. Nagaraja Chunyang Wu Jay Mahadeokar Duc Le ... Ching-Feng Yeh Julian Chan Christian Fuegen Ozlem Kalinli M. Seltzer 135 16 0 05 Apr 2021
Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification S. Silva Arun Das I. Scarff Peyman Najafirad AAML 140 1 0 05 Apr 2021
A Comparative Analysis of Machine Learning and Grey Models Gang He Khwaja Mutahir Ahmad Wenxin Yu Xiaochuan Xu J. Kumar SyDa AI4TS 131 0 0 02 Apr 2021
Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary MatricesInterspeech (Interspeech), 2021 Gonçalo Mordido Matthijs Van Keirsbilck A. Keller 171 6 0 31 Mar 2021
Adversarial Attacks and Defenses for Speech Recognition Systems Piotr Żelasko Sonal Joshi Yiwen Shao Jesus Villalba J. Trmal Najim Dehak Sanjeev Khudanpur AAML 115 36 0 31 Mar 2021
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint Zilun Peng Akshay Budhkar Ilana Tuil J. Levy Parinaz Sobhani Raphael Cohen J. Nassour 158 35 0 29 Mar 2021
Construction of a Large-scale Japanese ASR Corpus on TV RecordingsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Shintaro Ando Hiromasa Fujihara 91 28 0 26 Mar 2021
HufuNet: Embedding the Left Piece as Watermark and Keeping the Right Piece for Ownership Verification in Deep Neural Networks Peizhuo Lv Pan Li Shengzhi Zhang Kai Chen Ruigang Liang Yue Zhao Yingjiu Li AAML 124 8 0 25 Mar 2021
Federated Quantum Machine LearningEntropy (Entropy), 2021 Samuel Yen-Chi Chen Shinjae Yoo FedML AI4CE 177 154 0 22 Mar 2021
SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition SystemsACM Transactions on Privacy and Security (ACM TOPS), 2021 Yuxuan Chen Jiangshan Zhang Xuejing Yuan Shengzhi Zhang Kai Chen Luyi Xing Shanqing Guo AAML 238 19 0 19 Mar 2021
Modeling the Second Player in Distributionally Robust OptimizationInternational Conference on Learning Representations (ICLR), 2021 Paul Michel Tatsunori Hashimoto Graham Neubig 203 36 0 18 Mar 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge DistillationInterspeech (Interspeech), 2021 Md. Akmal Haidar Chao Xing Mehdi Rezagholizadeh 166 6 0 17 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo Bonaventure F. P. Dossou Chris C. Emezue 122 18 0 13 Mar 2021
Learning spectro-temporal representations of complex sounds with parameterized neural networksJournal of the Acoustical Society of America (JASA), 2021 Rachid Riad Julien Karadayi Anne-Catherine Bachoud-Lévi Emmanuel Dupoux 108 8 0 12 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2021 Maurice Gerczuk Shahin Amiriparian Sandra Ottl Björn Schuller 146 70 0 10 Mar 2021
Deep Learning for Android Malware Defenses: a Systematic Literature ReviewACM Computing Surveys (CSUR), 2021 Yue Liu Chakkrit Tantithamthavorn Li Li Yepang Liu AAML 230 99 0 09 Mar 2021
Consistency Regularization for Adversarial RobustnessAAAI Conference on Artificial Intelligence (AAAI), 2021 Jihoon Tack Sihyun Yu Jongheon Jeong Minseon Kim Sung Ju Hwang Jinwoo Shin AAML 256 69 0 08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial ExamplesUSENIX Security Symposium (USENIX Security), 2021 Shehzeen Samarah Hussain Paarth Neekhara Shlomo Dubnov Julian McAuley F. Koushanfar AAML 138 82 0 04 Mar 2021
Explaining Adversarial Vulnerability with a Data Sparsity HypothesisNeurocomputing (Neurocomputing), 2021 Mahsa Paknezhad Cuong Phuc Ngo Amadeus Aristo Winarto Alistair Cheong Beh Chuen Yang Wu Jiayang Lee Hwee Kuan OOD AAML 216 10 0 01 Mar 2021
Experiments with Rich Regime Training for Deep Learning Xinyan Li A. Banerjee 133 2 0 26 Feb 2021
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech Juliette Millet J. King 255 35 0 25 Feb 2021
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in FramesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Takuhiro Kaneko Hirokazu Kameoka Kou Tanaka Nobukatsu Hojo 121 70 0 25 Feb 2021
Unidirectional Memory-Self-Attention Transducer for Online Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Jian Luo Jianzong Wang Ning Cheng Jing Xiao RALM 102 6 0 23 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical StudyComputer Speech and Language (CSL), 2021 Prashanth Gurunath Shivakumar Shrikanth Narayanan 141 63 0 19 Feb 2021
One Shot Audio to Animated Video Generation Neeraj Kumar Srishti Goel Ankur Narang Brejesh Lall H. Mujtaba Pranshu Agarwal D. Sarkar VGen 84 1 0 19 Feb 2021
Do End-to-End Speech Recognition Models Care About Context?Interspeech (Interspeech), 2020 Lasse Borgholt Jakob Drachmann Havtorn Zeljko Agic Anders Søgaard Lars Maaløe Christian Igel 102 8 0 17 Feb 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systemsApplied Soft Computing (Appl Soft Comput), 2021 Yi Lin Bo Yang Linchao Li Dongyue Guo Jianwei Zhang Hu Chen Yi Zhang 139 32 0 17 Feb 2021
Improving speech recognition models with small samples for air traffic control systemsNeurocomputing (Neurocomputing), 2021 Yi Lin Qin Li Bo Yang Zhen Yan Huachun Tan Zhengmao Chen 166 33 0 16 Feb 2021

Dynamic Network selection for the Object Detection task: why it matters and what we (didn't) achieveInternational Conference / Workshop on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS), 2021

Emanuele Vitali

Anton Lokhmotov

G. Palermo

67

1

0

27 May 2021

BackEISNN: A Deep Spiking Neural Network with Adaptive Self-Feedback and Balanced Excitatory-Inhibitory NeuronsNeural Networks (NN), 2021

Dongcheng Zhao

Yi Zeng

Yang Li

155

52

0

27 May 2021

DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications

126

0

25 May 2021

Unsupervised Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021

349

292

0

24 May 2021

Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey

94

7

0

13 May 2021

Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition

Khin Me Me Chit

Laet Laet Lin

96

4

0

13 May 2021

Latency-Controlled Neural Architecture Search for Streaming Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

143

0

08 May 2021

Relative stability toward diffeomorphisms indicates performance in deep netsNeural Information Processing Systems (NeurIPS), 2021

243

15

0

06 May 2021

On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021

Zoltán Tüske

G. Saon

Brian Kingsbury

171

53

0

03 May 2021

End-to-End Speech Recognition from Federated Acoustic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Yan Gao

Titouan Parcollet

Salah Zaiem

Javier Fernandez-Marques

Pedro Porto Buarque de Gusmão

Daniel J. Beutel

Nicholas D. Lane

175

46

0

29 Apr 2021

On Addressing Practical Challenges for RNN-TransducerAutomatic Speech Recognition & Understanding (ASRU), 2021

217

33

0

27 Apr 2021

Protecting gender and identity with disentangled speech representationsInterspeech (Interspeech), 2021

Dimitrios Stoidis

Andrea Cavallaro

175

12

0

22 Apr 2021

Dual Head Adversarial TrainingIEEE International Joint Conference on Neural Network (IJCNN), 2021

158

7

0

21 Apr 2021

Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers

Yusuke Kida

Tatsuya Komatsu

M. Togami

89

1

0

21 Apr 2021

Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks

Šimon Mandlík

Tomás Pevný

178

6

0

19 Apr 2021

BM-NAS: Bilevel Multimodal Neural Architecture SearchAAAI Conference on Artificial Intelligence (AAAI), 2021

Yihang Yin

Siyu Huang

Xiang Zhang

199

34

0

19 Apr 2021

A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques

150

46

0

17 Apr 2021

Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021

Shengqiang Li

Menglong Xu

Xiao-Lei Zhang

186

26

0

14 Apr 2021