Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00895
Cited By
v1
v2 (latest)
RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks
2 August 2016
P. Doetsch
Albert Zeyer
P. Voigtlaender
Ilya Kulikov
Ralf Schluter
Hermann Ney
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks"
33 / 33 papers shown
A Comparative Analysis on ASR System Combination for Attention, CTC, Factored Hybrid, and Transducer Models
Noureldin Bayoumi
Robin Schmitt
Tina Raissi
Albert Zeyer
Ralf Schluter
Hermann Ney
184
0
0
13 Aug 2025
Analysis of Domain Shift across ASR Architectures via TTS-Enabled Separation of Target Domain and Acoustic Conditions
Tina Raissi
Nick Rossenbach
Ralf Schluter
159
1
0
13 Aug 2025
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation
Benedikt Hilmes
Nick Rossenbach
Ralf Schluter
307
0
0
02 Jun 2025
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Khai-Nguyen Nguyen
Phuc Phan
Tan-Hanh Pham
Bach Phan Tat
Minh-Huong Ngo
Chris Ngo
Thanh Nguyen-Tang
Truong-Son Hy
LM&MA
413
9
0
21 Sep 2024
Investigating the Effect of Label Topology and Training Criterion on ASR Performance and Alignment Quality
Tina Raissi
Christoph Luscher
Simon Berger
Ralf Schluter
Hermann Ney
259
3
0
16 Jul 2024
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Nick Rossenbach
Benedikt Hilmes
Ralf Schluter
264
5
0
12 Oct 2023
End-to-End Training of a Neural HMM with Label and Transition Probabilities
Automatic Speech Recognition & Understanding (ASRU), 2023
Daniel Mann
Tina Raissi
Wilfried Michel
Ralf Schluter
Hermann Ney
BDL
290
2
0
04 Oct 2023
Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project
Khai-Nguyen Nguyen
265
2
0
26 Sep 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
362
276
0
03 Mar 2023
Efficient Utilization of Large Pre-Trained Models for Low Resource ASR
Peter Vieting
Christoph Luscher
Julian Dierkes
Ralf Schluter
Hermann Ney
340
7
0
26 Oct 2022
Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech
Christoph Luscher
Mohammad Zeineldeen
Zijian Yang
Tina Raissi
Peter Vieting
Khai-Nguyen Nguyen
Weiyue Wang
Ralf Schluter
Hermann Ney
365
9
0
24 Oct 2022
AppTek's Submission to the IWSLT 2022 Isometric Spoken Language Translation Task
International Workshop on Spoken Language Translation (IWSLT), 2022
P. Wilken
E. Matusov
168
6
0
12 May 2022
Recent Advances in End-to-End Automatic Speech Recognition
APSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
570
448
0
02 Nov 2021
Automatic Learning of Subword Dependent Model Scales
Felix Meyer
Wilfried Michel
Mohammad Zeineldeen
Ralf Schluter
Hermann Ney
74
0
0
18 Oct 2021
Differentiable Allophone Graphs for Language-Universal Speech Recognition
Interspeech (Interspeech), 2021
Brian Yan
Siddharth Dalmia
David R. Mortensen
Florian Metze
Shinji Watanabe
275
13
0
24 Jul 2021
Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept
Interspeech (Interspeech), 2021
Wei Zhou
Albert Zeyer
André Merboldt
Ralf Schluter
Hermann Ney
259
6
0
13 Apr 2021
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Automatic Speech Recognition & Understanding (ASRU), 2021
Nick Rossenbach
Mohammad Zeineldeen
Benedikt Hilmes
Ralf Schluter
Hermann Ney
245
12
0
12 Apr 2021
Early Stage LM Integration Using Local and Global Log-Linear Combination
Wilfried Michel
Ralf Schluter
Hermann Ney
231
12
0
20 May 2020
The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugment
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Zhou
Wilfried Michel
Kazuki Irie
M. Kitza
Ralf Schluter
Hermann Ney
218
45
0
02 Apr 2020
Attention based on-device streaming speech recognition with large speech corpus
Automatic Speech Recognition & Understanding (ASRU), 2019
Kwangyoun Kim
Kyungmin Lee
Dhananjaya N. Gowda
Junmo Park
Sungsoo Kim
...
Daehyun Kim
Seokyeong Jung
Jungin Lee
Myoungji Han
Chanwoo Kim
216
61
0
02 Jan 2020
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models
Automatic Speech Recognition & Understanding (ASRU), 2019
Abhinav Garg
Dhananjaya N. Gowda
Ankur Kumar
Kwangyoun Kim
Mehul Kumar
Chanwoo Kim
3DV
149
15
0
28 Dec 2019
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Automatic Speech Recognition & Understanding (ASRU), 2019
Chanwoo Kim
Mehul Kumar
Kwangyoun Kim
Dhananjaya N. Gowda
188
9
0
22 Dec 2019
end-to-end training of a large vocabulary end-to-end speech recognition system
Automatic Speech Recognition & Understanding (ASRU), 2019
Chanwoo Kim
Sungsoo Kim
Kwangyoun Kim
Mehul Kumar
Jiyeon Kim
...
Eunhyang Kim
Minkyoo Shin
Shatrughan Singh
Larry Heck
Dhananjaya N. Gowda
202
27
0
22 Dec 2019
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Nick Rossenbach
Albert Zeyer
Ralf Schluter
Hermann Ney
306
91
0
19 Dec 2019
On Using SpecAugment for End-to-End Speech Translation
International Workshop on Spoken Language Translation (IWSLT), 2019
Parnia Bahar
Albert Zeyer
Ralf Schluter
Hermann Ney
244
56
0
20 Nov 2019
uniblock: Scoring and Filtering Corpus with Unicode Block Information
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Yingbo Gao
Weiyue Wang
Hermann Ney
193
1
0
26 Aug 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Interspeech (Interspeech), 2019
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
474
38
0
09 May 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation
Interspeech (Interspeech), 2019
Christoph Luscher
Eugen Beck
Kazuki Irie
M. Kitza
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
537
240
0
08 May 2019
RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition
Albert Zeyer
Tamer Alkhouli
Hermann Ney
365
95
0
14 May 2018
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
253
280
0
08 May 2018
A comprehensive study of batch construction strategies for recurrent neural networks in MXNet
P. Doetsch
Pavel Golik
Hermann Ney
162
17
0
05 May 2017
Learning to detect and localize many objects from few examples
Bastien Moysset
Christopher Kermorvant
Christian Wolf
ObjD
174
6
0
17 Nov 2016
A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2016
Albert Zeyer
P. Doetsch
P. Voigtlaender
Ralf Schluter
Hermann Ney
177
174
0
22 Jun 2016
1
Page 1 of 1