ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.00059
  4. Cited By
The CAPIO 2017 Conversational Speech Recognition System
v1v2 (latest)

The CAPIO 2017 Conversational Speech Recognition System

29 December 2017
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
ArXiv (abs)PDFHTML

Papers citing "The CAPIO 2017 Conversational Speech Recognition System"

35 / 35 papers shown
Title
GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images
GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images
Florian Zager
Hamza A. A. Gardi
180
0
0
15 Sep 2025
On the limit of English conversational speech recognition
On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
171
53
0
03 May 2021
LT-LM: a novel non-autoregressive language model for single-shot lattice
  rescoring
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoringInterspeech (Interspeech), 2021
Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Y. Khokhlov
A. Laptev
A. Andrusenko
A. Ilin
M. Korenevsky
Ivan Medennikov
A. Romanenko
KELMLRM
106
2
0
06 Apr 2021
The Use of Voice Source Features for Sung Speech Recognition
The Use of Voice Source Features for Sung Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Gerardo Roa Dabike
Jon Barker
79
2
0
20 Feb 2021
Context-aware RNNLM Rescoring for Conversational Speech Recognition
Context-aware RNNLM Rescoring for Conversational Speech RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2020
Kun Wei
Pengcheng Guo
Hang Lv
Zhen Tu
Lei Xie
130
5
0
18 Nov 2020
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition
Phoneme Based Neural Transducer for Large Vocabulary Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Zhou
Simon Berger
Ralf Schluter
Hermann Ney
312
34
0
30 Oct 2020
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
R. Collobert
Gabriel Synnaeve
355
105
0
22 Oct 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech
  Recognition
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan
Joshua Shapiro
Jeremy Wohlwend
Kyu Jeong Han
Tao Lei
T. Ma
138
23
0
21 May 2020
Relative Positional Encoding for Speech Recognition and Direct
  Translation
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham
Thanh-Le Ha
Tuan-Nam Nguyen
T. Nguyen
Elizabeth Salesky
S. Stueker
Jan Niehues
A. Waibel
137
41
0
20 May 2020
LaNet: Real-time Lane Identification by Learning Road
  SurfaceCharacteristics from Accelerometer Data
LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data
M. Harishankar
Jun Han
S. Srinivas
Faisal Alqarni
Shih-Yang Su
Shijia Pan
Hae Young Noh
Pei Zhang
Marco Gruteser
P. Tague
76
2
0
06 Apr 2020
The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with
  SpecAugment
The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Zhou
Wilfried Michel
Kazuki Irie
M. Kitza
Ralf Schluter
Hermann Ney
118
43
0
02 Apr 2020
Single headed attention based sequence-to-sequence model for
  state-of-the-art results on Switchboard
Single headed attention based sequence-to-sequence model for state-of-the-art results on SwitchboardInterspeech (Interspeech), 2020
Zoltán Tüske
G. Saon
Kartik Audhkhasi
Brian Kingsbury
BDL
172
70
0
20 Jan 2020
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder
  Models
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder ModelsAutomatic Speech Recognition & Understanding (ASRU), 2019
Abhinav Garg
Dhananjaya N. Gowda
Ankur Kumar
Kwangyoun Kim
Mehul Kumar
Chanwoo Kim
3DV
92
15
0
28 Dec 2019
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern
  Architectures
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures
Gabriel Synnaeve
Qiantong Xu
Jacob Kahn
Tatiana Likhomanenko
Edouard Grave
Vineel Pratap
Anuroop Sriram
Vitaliy Liptchinsky
R. Collobert
SSLAI4TS
382
260
0
19 Nov 2019
Improving sequence-to-sequence speech recognition training with
  on-the-fly data augmentation
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
T. Nguyen
S. Stueker
Jan Niehues
A. Waibel
207
103
0
29 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid
  Speech Recognition
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019
Duc Le
Xiaohui Zhang
Weiyi Zheng
C. Fügen
Geoffrey Zweig
M. Seltzer
163
64
0
02 Oct 2019
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention
  With Dilated 1D Convolutions
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D ConvolutionsAutomatic Speech Recognition & Understanding (ASRU), 2019
Kyu Jeong Han
R. Prieto
Kaixing(Kai) Wu
T. Ma
249
76
0
01 Oct 2019
Towards Better Understanding of Spontaneous Conversations: Overcoming
  Automatic Speech Recognition Errors With Intent Recognition
Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition
Piotr Żelasko
Jan Mizgajski
Mikolaj Morzy
Adrian Szymczak
Piotr Szymañski
Lukasz Augustyniak
Yishay Carmiel
198
0
0
21 Aug 2019
IMS-Speech: A Speech to Text Tool
IMS-Speech: A Speech to Text Tool
Pavel Denisov
Ngoc Thang Vu
120
11
0
13 Aug 2019
LSTM Language Models for LVCSR in First-Pass Decoding and
  Lattice-Rescoring
LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring
Eugen Beck
Wei Zhou
Ralf Schluter
Hermann Ney
151
34
0
01 Jul 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech
  Recognition of Sparsely Overlapping Speech
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping SpeechInterspeech (Interspeech), 2019
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
303
38
0
09 May 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data
  Augmentation
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data AugmentationInterspeech (Interspeech), 2019
Christoph Luscher
Eugen Beck
Kazuki Irie
M. Kitza
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
374
238
0
08 May 2019
English Broadcast News Speech Recognition by Humans and Machines
English Broadcast News Speech Recognition by Humans and MachinesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Samuel Thomas
Masayuki Suzuki
Yinghui Huang
Gakuto Kurata
Zoltán Tüske
...
Brian Kingsbury
M. Picheny
Tom Dibert
Alice Kaiser-Schatzlein
Bern Samko
133
15
0
30 Apr 2019
Transformers with convolutional context for ASR
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
176
172
0
26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
438
3,798
0
18 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
216
27
0
09 Apr 2019
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Chun Lok Li
Vitaly Lavrukhin
Boris Ginsburg
Ryan Leary
Oleksii Kuchaiev
Jonathan M. Cohen
Huyen Nguyen
R. Gadde
DRLVLMAuLLM
199
276
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
147
104
0
04 Apr 2019
Improved Knowledge Distillation via Teacher Assistant
Improved Knowledge Distillation via Teacher AssistantAAAI Conference on Artificial Intelligence (AAAI), 2019
Seyed Iman Mirzadeh
Mehrdad Farajtabar
Ang Li
Nir Levine
Akihiro Matsukawa
H. Ghasemzadeh
331
1,246
0
09 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech
  Recognition
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
204
37
0
05 Feb 2019
Fully Convolutional Speech Recognition
Fully Convolutional Speech Recognition
Neil Zeghidour
Qiantong Xu
Vitaliy Liptchinsky
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
176
95
0
17 Dec 2018
The Marchex 2018 English Conversational Telephone Speech Recognition
  System
The Marchex 2018 English Conversational Telephone Speech Recognition System
Xiaofeng Liu
Zhenhua Guo
J. You
B. Kumar
159
1
0
05 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech
  Augmentation
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Jason Chun Lok Li
R. Gadde
Boris Ginsburg
Vitaly Lavrukhin
137
58
0
02 Nov 2018
Open Source Automatic Speech Recognition for German
Open Source Automatic Speech Recognition for German
Benjamin Milde
Arne Köhn
VLM
159
40
0
26 Jul 2018
End-to-End Speech Recognition From the Raw Waveform
End-to-End Speech Recognition From the Raw Waveform
Neil Zeghidour
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
Emmanuel Dupoux
218
84
0
19 Jun 2018
1