v1v2 (latest)

Listen, Attend and Spell

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015

5 August 2015

Papers citing "Listen, Attend and Spell"

50 / 1,064 papers shown

Neural Architecture Search For LF-MMI Trained Time Delay Neural NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Jiajun Deng

247

08 Jan 2022

Two-Pass End-to-End ASR Model CompressionAutomatic Speech Recognition & Understanding (ASRU), 2021

08 Jan 2022

Sign Language Video Retrieval with Free-Form Textual QueriesComputer Vision and Pattern Recognition (CVPR), 2022

222

07 Jan 2022

Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelIEEE Signal Processing Letters (SPL), 2022

Yuexian Zou

180

06 Jan 2022

Discrete and continuous representations and processing in deep learning: Looking forwardAI Open (AO), 2022

300

04 Jan 2022

Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language QuestionThe VLDB journal (VLDBJ), 2022

Wailing Ng

Raymond Chi-Wing Wong

Xuefang Zhao

Chen Zhang

184

04 Jan 2022

Voice Quality and Pitch Features in Transformer-Based Speech RecognitionProceedings of the International Conference on Speech Prosody (ICSP), 2021

Guillermo Cámbara

Jordi Luque

Mireia Farrús

138

21 Dec 2021

Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing

176

16 Dec 2021

Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems

276

16 Dec 2021

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

14 Dec 2021

PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition

276

13 Dec 2021

Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMIIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Yuexian Zou

187

05 Dec 2021

Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding

Weiran Wang

Ke Hu

Tara N. Sainath

159

01 Dec 2021

Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

254

29 Nov 2021

Lattention: Lattice-attention in ASR rescoringIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

145

19 Nov 2021

A comparison of streaming models and data augmentation methods for robust speech recognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

123

19 Nov 2021

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

16 Nov 2021

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASRInterspeech (Interspeech), 2021

Ondˇrej Klejch

E. Wallington

P. Bell

170

12 Nov 2021

Enhancing Backdoor Attacks with Multi-Level MMD RegularizationIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2021

226

09 Nov 2021

Conformer-based Hybrid ASR System for Switchboard Dataset

Alexander Gerstenberger

Ralf Schluter

Hermann Ney

235

05 Nov 2021

Context-Aware Transformer Transducer for Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

Feng-Ju Chang

Jing Liu

Martin H. Radfar

Athanasios Mouchtaris

M. Omologo

Ariya Rastrow

Siegfried Kunzmann

188

05 Nov 2021

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

433

427

02 Nov 2021

With a Little Help from my Temporal Context: Multimodal Egocentric Action RecognitionBritish Machine Vision Conference (BMVC), 2021

Dima Damen

297

01 Nov 2021

Revealing and Protecting Labels in Distributed TrainingNeural Information Processing Systems (NeurIPS), 2021

Trung D. Q. Dang

Om Thakkar

Swaroop Indra Ramaswamy

Rajiv Mathews

Peter Chin

Franccoise Beaufays

104

31 Oct 2021

Pseudo-Labeling for Massively Multilingual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

299

30 Oct 2021

Cross-attention conformer for context modeling in speech enhancement for ASRAutomatic Speech Recognition & Understanding (ASRU), 2021

186

30 Oct 2021

An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASRAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021

20 Oct 2021

Automatic Learning of Subword Dependent Model Scales

18 Oct 2021

Sub-word Level Lip Reading With Visual Attention

Prajwal K R

Triantafyllos Afouras

Andrew Zisserman

225

111

14 Oct 2021

On Language Model Integration for RNN Transducer based Speech Recognition

268

13 Oct 2021

Reason induced visual attention for explainable autonomous driving

150

11 Oct 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text GenerationAutomatic Speech Recognition & Understanding (ASRU), 2021

Tianzi Wang

135

11 Oct 2021

K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and SyllablesInterspeech (Interspeech), 2021

Jounghee Kim

Pilsung Kang

VLM

120

11 Oct 2021

Advancing Momentum Pseudo-Labeling with Conformer and Initialization StrategyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

184

11 Oct 2021

Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

171

10 Oct 2021

SCaLa: Supervised Contrastive Learning for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

156

08 Oct 2021

Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword UnitsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

170

08 Oct 2021

Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient MaskIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

133

08 Oct 2021

Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees

145

08 Oct 2021

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Binbin Zhang

Hang Lv

Pengcheng Guo

Qijie Shao

Chao Yang

...

Hui Bu

407

289

07 Oct 2021

BERT Attends the Conversation: Improving Low-Resource Conversational ASR

Pablo Ortiz

Simen Burud

131

05 Oct 2021

ASR Rescoring and Confidence Estimation with ELECTRA

211

05 Oct 2021

Multi-axis Attentive Prediction for Sparse EventData: An Application to Crime Prediction

Yi Sui

Ga Wu

Scott Sanner

123

05 Oct 2021

Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

Tsendsuren Munkhdalai

224

05 Oct 2021

Towards efficient end-to-end speech recognition with biologically-inspired neural networks

178

04 Oct 2021

Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning

Muhammad Abdul-Mageed

VLM

229

01 Oct 2021

Multimodal Emotion Recognition with High-level Speech and Text Features

M. R. Makiuchi

Kuniaki Uto

Koichi Shinoda

226

29 Sep 2021

Word-level confidence estimation for RNN transducers

Laurent El Shafey

163

28 Sep 2021

Private Language Model Adaptation for Speech Recognition

259

28 Sep 2021

Factorized Neural Transducer for Efficient Language Model AdaptationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Xie Chen

Zhong Meng

S. Parthasarathy

Jinyu Li

497

27 Sep 2021