v1v2v3 (latest)

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard

Interspeech (Interspeech), 2020

20 January 2020

Kartik Audhkhasi

Papers citing "Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard"

50 / 52 papers shown

Self-Improvement for Audio Large Language Model using Unlabeled Speech

266

27 Jul 2025

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

175

17 Oct 2023

Investigating the Effect of Language Models in Sequence Discriminative Training for Neural TransducersAutomatic Speech Recognition & Understanding (ASRU), 2023

212

11 Oct 2023

On the Relation between Internal Language Model and Sequence Discriminative Training for Neural TransducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

349

25 Sep 2023

Chunked Attention-based Encoder-Decoder Model for Streaming Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

400

15 Sep 2023

Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You ThinkInterspeech (Interspeech), 2023

189

15 Jun 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

361

276

03 Mar 2023

Confidence Score Based Speaker Adaptation of Conformer Speech Recognition SystemsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Jiajun Deng

Tianzi Wang

Zengrui Jin

Shujie Hu

197

15 Feb 2023

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural TransducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

297

07 Dec 2022

Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

330

17 Nov 2022

Monotonic segmental attention for automatic speech recognitionSpoken Language Technology Workshop (SLT), 2022

159

26 Oct 2022

Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and UnderstandingInternational Conference on Machine Learning (ICML), 2022

311

199

06 Jul 2022

Improving the Training Recipe for a Robust Conformer-based Hybrid ModelInterspeech (Interspeech), 2022

218

26 Jun 2022

Confidence Score Based Conformer Speaker Adaptation for Speech RecognitionInterspeech (Interspeech), 2022

Jiajun Deng

Tianzi Wang

Zengrui Jin

232

24 Jun 2022

Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Switchboard CorpusInterspeech (Interspeech), 2022

270

23 Jun 2022

Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR SystemsInterspeech (Interspeech), 2022

Mingyu Cui

Jiajun Deng

Shoukang Hu

Xurong Xie

Tianzi Wang

Shujie Hu

197

23 Jun 2022

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit QuantizationInterspeech (Interspeech), 2022

A. Fasoli

Chia-Yu Chen

Mauricio Serrano

Swagath Venkataramani

208

16 Jun 2022

LegoNN: Building Modular Encoder-Decoder ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Sergey Edunov

Luke Zettlemoyer

232

07 Jun 2022

Efficient Training of Neural Transducer for Speech RecognitionInterspeech (Interspeech), 2022

245

22 Apr 2022

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent SystemsInterspeech (Interspeech), 2022

438

11 Apr 2022

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR SystemsInterspeech (Interspeech), 2022

366

01 Apr 2022

Improving End-to-End Models for Set Prediction in Spoken Language UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

154

28 Jan 2022

Improving Factored Hybrid HMM Acoustic Modeling without State TyingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

306

24 Jan 2022

Neural Architecture Search For LF-MMI Trained Time Delay Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Jiajun Deng

310

08 Jan 2022

Robust Self-Supervised Audio-Visual Speech RecognitionInterspeech (Interspeech), 2022

Bowen Shi

Wei-Ning Hsu

Abdel-rahman Mohamed

411

123

05 Jan 2022

Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

317

29 Nov 2021

Conformer-based Hybrid ASR System for Switchboard Dataset

Alexander Gerstenberger

Ralf Schluter

Hermann Ney

344

05 Nov 2021

On Language Model Integration for RNN Transducer based Speech Recognition

336

13 Oct 2021

ChannelAugment: Improving generalization of multi-channel ASR by training with input channel randomizationAutomatic Speech Recognition & Understanding (ASRU), 2021

208

23 Sep 2021

Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition

176

17 Sep 2021

4-bit Quantization of LSTM-based Speech Recognition ModelsInterspeech (Interspeech), 2021

...

Wei Zhang

183

27 Aug 2021

Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation

Samuel Cahyawijaya

228

24 Aug 2021

Reducing Exposure Bias in Training Recurrent Neural Network TransducersInterspeech (Interspeech), 2021

155

24 Aug 2021

Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts

229

14 Jun 2021

On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021

Zoltán Tüske

G. Saon

Brian Kingsbury

299

03 May 2021

Advanced Long-context End-to-end Speech Recognition Using Context-expanded TransformersInterspeech (Interspeech), 2021

199

19 Apr 2021

Acoustic Data-Driven Subword Modeling for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

300

19 Apr 2021

Equivalence of Segmental and Neural Transducer Modeling: A Proof of ConceptInterspeech (Interspeech), 2021

257

13 Apr 2021

Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR ModelsInterspeech (Interspeech), 2021

244

12 Apr 2021

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition ArchitecturesAutomatic Speech Recognition & Understanding (ASRU), 2021

243

12 Apr 2021

Towards Consistent Hybrid HMM Acoustic Modeling

404

06 Apr 2021

A study of latent monotonic attention variants

Albert Zeyer

Ralf Schluter

Hermann Ney

307

30 Mar 2021

Residual Energy-Based Models for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

229

25 Mar 2021

Advancing RNN Transducer Technology for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

294

104

17 Mar 2021

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced FrontendIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Wangyou Zhang

172

23 Feb 2021

Bayesian Learning for Deep Neural Network AdaptationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

530

14 Dec 2020

Phoneme Based Neural Transducer for Large Vocabulary Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

374

30 Oct 2020

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

434

07 Oct 2020

End-to-End Spoken Language Understanding Without Full Transcripts

Kartik Audhkhasi

232

30 Sep 2020

Semi-Supervised Learning with Data Augmentation for End-to-End ASRInterspeech (Interspeech), 2020

265

27 Jul 2020