Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1801.00059
Cited By
v1
v2 (latest)
The CAPIO 2017 Conversational Speech Recognition System
29 December 2017
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The CAPIO 2017 Conversational Speech Recognition System"
35 / 35 papers shown
Title
GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images
Florian Zager
Hamza A. A. Gardi
180
0
0
15 Sep 2025
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
171
53
0
03 May 2021
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Interspeech (Interspeech), 2021
Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Y. Khokhlov
A. Laptev
A. Andrusenko
A. Ilin
M. Korenevsky
Ivan Medennikov
A. Romanenko
KELM
LRM
106
2
0
06 Apr 2021
The Use of Voice Source Features for Sung Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Gerardo Roa Dabike
Jon Barker
79
2
0
20 Feb 2021
Context-aware RNNLM Rescoring for Conversational Speech Recognition
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2020
Kun Wei
Pengcheng Guo
Hang Lv
Zhen Tu
Lei Xie
130
5
0
18 Nov 2020
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Zhou
Simon Berger
Ralf Schluter
Hermann Ney
312
34
0
30 Oct 2020
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
R. Collobert
Gabriel Synnaeve
355
105
0
22 Oct 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan
Joshua Shapiro
Jeremy Wohlwend
Kyu Jeong Han
Tao Lei
T. Ma
138
23
0
21 May 2020
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham
Thanh-Le Ha
Tuan-Nam Nguyen
T. Nguyen
Elizabeth Salesky
S. Stueker
Jan Niehues
A. Waibel
137
41
0
20 May 2020
LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data
M. Harishankar
Jun Han
S. Srinivas
Faisal Alqarni
Shih-Yang Su
Shijia Pan
Hae Young Noh
Pei Zhang
Marco Gruteser
P. Tague
76
2
0
06 Apr 2020
The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugment
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Zhou
Wilfried Michel
Kazuki Irie
M. Kitza
Ralf Schluter
Hermann Ney
118
43
0
02 Apr 2020
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard
Interspeech (Interspeech), 2020
Zoltán Tüske
G. Saon
Kartik Audhkhasi
Brian Kingsbury
BDL
172
70
0
20 Jan 2020
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models
Automatic Speech Recognition & Understanding (ASRU), 2019
Abhinav Garg
Dhananjaya N. Gowda
Ankur Kumar
Kwangyoun Kim
Mehul Kumar
Chanwoo Kim
3DV
92
15
0
28 Dec 2019
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures
Gabriel Synnaeve
Qiantong Xu
Jacob Kahn
Tatiana Likhomanenko
Edouard Grave
Vineel Pratap
Anuroop Sriram
Vitaliy Liptchinsky
R. Collobert
SSL
AI4TS
382
260
0
19 Nov 2019
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
T. Nguyen
S. Stueker
Jan Niehues
A. Waibel
207
103
0
29 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2019
Duc Le
Xiaohui Zhang
Weiyi Zheng
C. Fügen
Geoffrey Zweig
M. Seltzer
163
64
0
02 Oct 2019
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Automatic Speech Recognition & Understanding (ASRU), 2019
Kyu Jeong Han
R. Prieto
Kaixing(Kai) Wu
T. Ma
249
76
0
01 Oct 2019
Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition
Piotr Żelasko
Jan Mizgajski
Mikolaj Morzy
Adrian Szymczak
Piotr Szymañski
Lukasz Augustyniak
Yishay Carmiel
198
0
0
21 Aug 2019
IMS-Speech: A Speech to Text Tool
Pavel Denisov
Ngoc Thang Vu
120
11
0
13 Aug 2019
LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring
Eugen Beck
Wei Zhou
Ralf Schluter
Hermann Ney
151
34
0
01 Jul 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Interspeech (Interspeech), 2019
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
303
38
0
09 May 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation
Interspeech (Interspeech), 2019
Christoph Luscher
Eugen Beck
Kazuki Irie
M. Kitza
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
374
238
0
08 May 2019
English Broadcast News Speech Recognition by Humans and Machines
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Samuel Thomas
Masayuki Suzuki
Yinghui Huang
Gakuto Kurata
Zoltán Tüske
...
Brian Kingsbury
M. Picheny
Tom Dibert
Alice Kaiser-Schatzlein
Bern Samko
133
15
0
30 Apr 2019
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
176
172
0
26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
438
3,798
0
18 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
216
27
0
09 Apr 2019
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Chun Lok Li
Vitaly Lavrukhin
Boris Ginsburg
Ryan Leary
Oleksii Kuchaiev
Jonathan M. Cohen
Huyen Nguyen
R. Gadde
DRL
VLM
AuLLM
199
276
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
147
104
0
04 Apr 2019
Improved Knowledge Distillation via Teacher Assistant
AAAI Conference on Artificial Intelligence (AAAI), 2019
Seyed Iman Mirzadeh
Mehrdad Farajtabar
Ang Li
Nir Levine
Akihiro Matsukawa
H. Ghasemzadeh
331
1,246
0
09 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
204
37
0
05 Feb 2019
Fully Convolutional Speech Recognition
Neil Zeghidour
Qiantong Xu
Vitaliy Liptchinsky
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
176
95
0
17 Dec 2018
The Marchex 2018 English Conversational Telephone Speech Recognition System
Xiaofeng Liu
Zhenhua Guo
J. You
B. Kumar
159
1
0
05 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Jason Chun Lok Li
R. Gadde
Boris Ginsburg
Vitaly Lavrukhin
137
58
0
02 Nov 2018
Open Source Automatic Speech Recognition for German
Benjamin Milde
Arne Köhn
VLM
159
40
0
26 Jul 2018
End-to-End Speech Recognition From the Raw Waveform
Neil Zeghidour
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
Emmanuel Dupoux
218
84
0
19 Jun 2018
1