v1v2 (latest)

Listen, Attend and Spell

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015

5 August 2015

Papers citing "Listen, Attend and Spell"

50 / 1,064 papers shown

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech RecognitionSpoken Language Technology Workshop (SLT), 2022

371

19 Apr 2022

Self-critical Sequence Training for Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Chen Chen

Yuchen Hu

165

13 Apr 2022

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent SystemsInterspeech (Interspeech), 2022

238

11 Apr 2022

Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition

123

08 Apr 2022

A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition

146

05 Apr 2022

Class-Incremental Learning by Knowledge Distillation with Adaptive Feature ConsolidationComputer Vision and Pattern Recognition (CVPR), 2022

249

230

02 Apr 2022

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech RecognitionInterspeech (Interspeech), 2021

Jian Kang

145

02 Apr 2022

Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Kanthashree Mysore Sathyendra

134

01 Apr 2022

Memory-Efficient Training of RNN-Transducer with Sampled SoftmaxInterspeech (Interspeech), 2022

Jaesong Lee

Lukas Lee

Shinji Watanabe

286

31 Mar 2022

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech DatasetInterspeech (Interspeech), 2022

...

Pengyuan Zhang

Lei Xie

Yonghong Yan

147

31 Mar 2022

NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention MechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Zhiyong Wu

Yuxuan Wang

31 Mar 2022

CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASRInterspeech (Interspeech), 2022

Zhijian Ou

172

31 Mar 2022

Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAEInternational Conference on Artificial Intelligence for Industries (ICAII), 2022

154

30 Mar 2022

Recent improvements of ASR models in the face of adversarial attacksInterspeech (Interspeech), 2022

R. Olivier

Bhiksha Raj

AAML

260

29 Mar 2022

Streaming parallel transducer beam search with fast-slow cascaded encodersInterspeech (Interspeech), 2022

Ozlem Kalinli

204

29 Mar 2022

Integrating Lattice-Free MMI into End-to-End Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

301

29 Mar 2022

WeNet 2.0: More Productive End-to-End Speech Recognition ToolkitInterspeech (Interspeech), 2022

Binbin Zhang

Chao Yang

274

129

29 Mar 2022

Investigating Self-supervised Pretraining Frameworks for Pathological Speech RecognitionInterspeech (Interspeech), 2022

Lester Phillip Violeta

Wen-Chin Huang

Tomoki Toda

268

29 Mar 2022

Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Chen Chen

Yuchen Hu

201

29 Mar 2022

Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASRInternational Conference on Neural Information Processing (ICONIP), 2022

Fangyuan Wang

Bo Xu

169

29 Mar 2022

Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsLanguage Resources and Evaluation (LRE), 2022

197

28 Mar 2022

Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionInterspeech (Interspeech), 2022

Yuchen Hu

Nana Hou

Chen Chen

Chng Eng Siong

183

28 Mar 2022

Joint Transformer/RNN Architecture for Gesture Typing in Indic LanguagesInternational Conference on Computational Linguistics (COLING), 2020

Emil Biju

Anirudh Sriram

Mitesh M. Khapra

Pratyush Kumar

108

26 Mar 2022

Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks

193

24 Mar 2022

Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)International Conference on Machine Learning (ICML), 2022

Yu Huang

Junyang Lin

Chang Zhou

Hongxia Yang

Longbo Huang

181

145

23 Mar 2022

Transformer-based Streaming ASR with Cumulative AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Mohan Li

Shucong Zhang

Catalin Zorila

R. Doddipatla

160

11 Mar 2022

aaeCAPTCHA: The Design and Implementation of Audio Adversarial CAPTCHAEuropean Symposium on Security and Privacy (Euro S&P), 2022

Md. Imran Hossen

X. Hei

141

05 Mar 2022

Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition SystemsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

232

02 Mar 2022

A Brief Overview of Unsupervised Neural Speech Representation Learning

Lasse Borgholt

Jakob Drachmann Havtorn

243

01 Mar 2022

Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey

Ngoc Dung Huynh

Mohamed Reda Bouadjenek

Imran Razzak

182

22 Feb 2022

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Tianzi Wang

Shujie Hu

227

21 Feb 2022

Learning Representations Robust to Group Shifts and Adversarial Examples

Ming-Chang Chiu

Xuezhe Ma

OOD

126

18 Feb 2022

End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system

Zheng Zhang

Pan Zhou

157

18 Feb 2022

Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech RecognizersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Yotaro Kubo

Shigeki Karita

M. Bacchiani

141

16 Feb 2022

Conversational Speech Recognition By Learning Conversation-level CharacteristicsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Lei Xie

171

16 Feb 2022

USTED: Improving ASR with a Unified Speech and Text Encoder-DecoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Bolaji Yusuf

Ankur Gandhe

Alex Sokolov

202

12 Feb 2022

Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding

Peter Sullivan

Toshiko Shibano

Muhammad Abdul-Mageed

150

10 Feb 2022

ASRPU: A Programmable Accelerator for Low-Power Automatic Speech RecognitionSocial Science Research Network (SSRN), 2022

D. Pinto

J. Arnau

Antonio González

10 Feb 2022

Semantic-aware Speech to Text Transmission with Redundancy Removal

178

07 Feb 2022

Joint Speech Recognition and Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

145

03 Feb 2022

RescoreBERT: Discriminative Speech Recognition Rescoring with BERTIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

416

02 Feb 2022

BEA-Base: A Benchmark for ASR of Spontaneous HungarianInternational Conference on Language Resources and Evaluation (LREC), 2022

154

01 Feb 2022

Transformer-based Models of Text Normalization for Speech Applications

173

01 Feb 2022

Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge SelectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Minglun Han

Linhao Dong

165

30 Jan 2022

Reducing language context confusion for end-to-end code-switching automatic speech recognitionInterspeech (Interspeech), 2022

Jiangyan Yi

169

28 Jan 2022

On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR

223

26 Jan 2022

Improving the fusion of acoustic and text representations in RNN-TIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Chao Zhang

199

25 Jan 2022

Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

153

25 Jan 2022

Recent Progress in the CUHK Dysarthric Speech Recognition SystemIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

161

15 Jan 2022

Spectro-Temporal Deep Features for Disordered Speech Assessment and RecognitionInterspeech (Interspeech), 2021

Zengrui Jin

120

14 Jan 2022