v1v2 (latest)

The CAPIO 2017 Conversational Speech Recognition System

29 December 2017

Kyu Jeong Han

Akshay Chandrashekaran

Jungsuk Kim

Ian Lane

ArXiv (abs)PDF HTML

Papers citing "The CAPIO 2017 Conversational Speech Recognition System"

35 / 35 papers shown

Title
GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images Florian Zager Hamza A. A. Gardi 180 0 0 15 Sep 2025
On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021 Zoltán Tüske G. Saon Brian Kingsbury 171 53 0 03 May 2021
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoringInterspeech (Interspeech), 2021 Anton Mitrofanov Mariya Korenevskaya Ivan Podluzhny Yuri Y. Khokhlov A. Laptev A. Andrusenko A. Ilin M. Korenevsky Ivan Medennikov A. Romanenko KELM LRM 106 2 0 06 Apr 2021
The Use of Voice Source Features for Sung Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Gerardo Roa Dabike Jon Barker 79 2 0 20 Feb 2021
Context-aware RNNLM Rescoring for Conversational Speech RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2020 Kun Wei Pengcheng Guo Hang Lv Zhen Tu Lei Xie 130 5 0 18 Nov 2020
Phoneme Based Neural Transducer for Large Vocabulary Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 Wei Zhou Simon Berger Ralf Schluter Hermann Ney 312 34 0 30 Oct 2020
Rethinking Evaluation in ASR: Are Our Models Robust Enough? Tatiana Likhomanenko Qiantong Xu Vineel Pratap Paden Tomasello Jacob Kahn Gilad Avidov R. Collobert Gabriel Synnaeve 355 105 0 22 Oct 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition Jing Pan Joshua Shapiro Jeremy Wohlwend Kyu Jeong Han Tao Lei T. Ma 138 23 0 21 May 2020
Relative Positional Encoding for Speech Recognition and Direct Translation Ngoc-Quan Pham Thanh-Le Ha Tuan-Nam Nguyen T. Nguyen Elizabeth Salesky S. Stueker Jan Niehues A. Waibel 137 41 0 20 May 2020
LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data M. Harishankar Jun Han S. Srinivas Faisal Alqarni Shih-Yang Su Shijia Pan Hae Young Noh Pei Zhang Marco Gruteser P. Tague 76 2 0 06 Apr 2020
The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 Wei Zhou Wilfried Michel Kazuki Irie M. Kitza Ralf Schluter Hermann Ney 118 43 0 02 Apr 2020
Single headed attention based sequence-to-sequence model for state-of-the-art results on SwitchboardInterspeech (Interspeech), 2020 Zoltán Tüske G. Saon Kartik Audhkhasi Brian Kingsbury BDL 172 70 0 20 Jan 2020
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder ModelsAutomatic Speech Recognition & Understanding (ASRU), 2019 Abhinav Garg Dhananjaya N. Gowda Ankur Kumar Kwangyoun Kim Mehul Kumar Chanwoo Kim 3DV 92 15 0 28 Dec 2019
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures Gabriel Synnaeve Qiantong Xu Jacob Kahn Tatiana Likhomanenko Edouard Grave Vineel Pratap Anuroop Sriram Vitaliy Liptchinsky R. Collobert SSL AI4TS 382 260 0 19 Nov 2019
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 T. Nguyen S. Stueker Jan Niehues A. Waibel 207 103 0 29 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019 Duc Le Xiaohui Zhang Weiyi Zheng C. Fügen Geoffrey Zweig M. Seltzer 163 64 0 02 Oct 2019
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D ConvolutionsAutomatic Speech Recognition & Understanding (ASRU), 2019 Kyu Jeong Han R. Prieto Kaixing(Kai) Wu T. Ma 249 76 0 01 Oct 2019
Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition Piotr Żelasko Jan Mizgajski Mikolaj Morzy Adrian Szymczak Piotr Szymañski Lukasz Augustyniak Yishay Carmiel 198 0 0 21 Aug 2019
IMS-Speech: A Speech to Text Tool Pavel Denisov Ngoc Thang Vu 120 11 0 13 Aug 2019
LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring Eugen Beck Wei Zhou Ralf Schluter Hermann Ney 151 34 0 01 Jul 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping SpeechInterspeech (Interspeech), 2019 T. Menne Ilya Sklyar Ralf Schluter Hermann Ney 303 38 0 09 May 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data AugmentationInterspeech (Interspeech), 2019 Christoph Luscher Eugen Beck Kazuki Irie M. Kitza Wilfried Michel Albert Zeyer Ralf Schluter Hermann Ney VLM 374 238 0 08 May 2019
English Broadcast News Speech Recognition by Humans and MachinesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019 Samuel Thomas Masayuki Suzuki Yinghui Huang Gakuto Kurata Zoltán Tüske ... Brian Kingsbury M. Picheny Tom Dibert Alice Kaiser-Schatzlein Bern Samko 133 15 0 30 Apr 2019
Transformers with convolutional context for ASR Abdel-rahman Mohamed Dmytro Okhonko Luke Zettlemoyer 176 172 0 26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition Daniel S. Park William Chan Yu Zhang Chung-Cheng Chiu Barret Zoph E. D. Cubuk Quoc V. Le VLM 438 3,798 0 18 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition Tatiana Likhomanenko Gabriel Synnaeve R. Collobert 216 27 0 09 Apr 2019
Jasper: An End-to-End Convolutional Neural Acoustic Model Jason Chun Lok Li Vitaly Lavrukhin Boris Ginsburg Ryan Leary Oleksii Kuchaiev Jonathan M. Cohen Huyen Nguyen R. Gadde DRL VLM AuLLM 199 276 0 05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions Awni Y. Hannun Ann Lee Qiantong Xu R. Collobert 147 104 0 04 Apr 2019
Improved Knowledge Distillation via Teacher AssistantAAAI Conference on Artificial Intelligence (AAAI), 2019 Seyed Iman Mirzadeh Mehrdad Farajtabar Ang Li Nir Levine Akihiro Matsukawa H. Ghasemzadeh 331 1,246 0 09 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition Kazuki Irie Rohit Prabhavalkar Anjuli Kannan A. Bruguier David Rybach Patrick Nguyen 204 37 0 05 Feb 2019
Fully Convolutional Speech Recognition Neil Zeghidour Qiantong Xu Vitaliy Liptchinsky Nicolas Usunier Gabriel Synnaeve R. Collobert 176 95 0 17 Dec 2018
The Marchex 2018 English Conversational Telephone Speech Recognition System Xiaofeng Liu Zhenhua Guo J. You B. Kumar 159 1 0 05 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation Jason Chun Lok Li R. Gadde Boris Ginsburg Vitaly Lavrukhin 137 58 0 02 Nov 2018
Open Source Automatic Speech Recognition for German Benjamin Milde Arne Köhn VLM 159 40 0 26 Jul 2018
End-to-End Speech Recognition From the Raw Waveform Neil Zeghidour Nicolas Usunier Gabriel Synnaeve R. Collobert Emmanuel Dupoux 218 84 0 19 Jun 2018