Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1508.01211
Cited By
v1
v2 (latest)
Listen, Attend and Spell
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015
5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Listen, Attend and Spell"
50 / 1,064 papers shown
Title
Training Spiking Neural Networks Using Lessons From Deep Learning
Proceedings of the IEEE (Proc. IEEE), 2021
Nhan Duy Truong
Max Ward
Emre Neftci
Xinxin Wang
Gregor Lenz
Girish Dwivedi
Bennamoun
Doo Seok Jeong
Wei D. Lu
516
665
0
27 Sep 2021
ChannelAugment: Improving generalization of multi-channel ASR by training with input channel randomization
Automatic Speech Recognition & Understanding (ASRU), 2021
M. Gaudesi
F. Weninger
D. Sharma
P. Zhan
AAML
101
1
0
23 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
186
27
0
19 Sep 2021
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
F. Weninger
M. Gaudesi
Ralf Leibold
R. Gemello
P. Zhan
83
4
0
17 Sep 2021
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
Chen Zhang
Jiaxing Yu
Luchin Chang
Xu Tan
Jiawei Chen
Tao Qin
Kecheng Zhang
127
16
0
16 Sep 2021
Utterance-level neural confidence measure for end-to-end children speech recognition
W. Liu
Tan Lee
110
6
0
16 Sep 2021
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Felix Wu
Kwangyoun Kim
Jing Pan
Kyu Jeong Han
Kilian Q. Weinberger
Yoav Artzi
161
83
0
14 Sep 2021
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition
Interspeech (Interspeech), 2021
Rong Gong
Carl Quillen
D. Sharma
Andrew Goderre
José Laínez
Ljubomir Milanović
187
15
0
10 Sep 2021
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Guangzhi Sun
Chao Zhang
P. Woodland
200
40
0
01 Sep 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
180
12
0
24 Aug 2021
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Interspeech (Interspeech), 2021
Xiaodong Cui
Brian Kingsbury
G. Saon
David Haws
Zoltán Tüske
130
6
0
24 Aug 2021
Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer
Krishna D N Freshworks
74
8
0
22 Aug 2021
A Dual-Decoder Conformer for Multilingual Speech Recognition
Krishna D N Freshworks
82
3
0
22 Aug 2021
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition
Krishna D N Freshworks
122
14
0
22 Aug 2021
Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers
Interspeech (Interspeech), 2021
Juntae Kim
Jee-Hye Lee
178
8
0
22 Aug 2021
A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Xiaoqiang Wang
Yanqing Liu
Sheng Zhao
Jinyu Li
KELM
144
18
0
17 Aug 2021
SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain Features
Interspeech (Interspeech), 2021
Gwantae Kim
D. Han
Hanseok Ko
138
58
0
06 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Interspeech (Interspeech), 2021
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
157
30
0
05 Aug 2021
Adversarial Data Augmentation for Disordered Speech Recognition
Zengrui Jin
Mengzhe Geng
Xurong Xie
Jianwei Yu
Shansong Liu
Xunying Liu
Helen Meng
118
45
0
02 Aug 2021
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations
European Signal Processing Conference (EUSIPCO), 2021
Seyun Um
Jihyun Kim
Jihyun Lee
Hong-Goo Kang
CVBM
290
4
0
26 Jul 2021
Ensemble of Convolution Neural Networks on Heterogeneous Signals for Sleep Stage Scoring
Social Science Research Network (SSRN), 2021
Enrique Fernández-Blanco
C. Fernandez-Lozano
A. Pazos
Daniel Rivero
116
4
0
23 Jul 2021
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Interspeech (Interspeech), 2021
Hirofumi Inaguma
Tatsuya Kawahara
178
2
0
15 Jul 2021
A Configurable Multilingual Model is All You Need to Recognize All Languages
Long Zhou
Jinyu Li
Eric Sun
Shujie Liu
221
46
0
13 Jul 2021
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data
K. Cheuk
Dorien Herremans
Li Su
353
39
0
11 Jul 2021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Automatic Speech Recognition & Understanding (ASRU), 2021
Xiaohui Zhang
Vimal Manohar
David C. Zhang
Frank Zhang
Yangyang Shi
Nayan Singhal
Julian Chan
Fuchun Peng
Yatharth Saraf
M. Seltzer
277
14
0
09 Jul 2021
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Tomohiro Tanaka
Ryo Masumura
Mana Ihori
Akihiko Takashima
Shota Orihashi
Naoki Makishima
120
4
0
07 Jul 2021
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition
Christian Huber
Juan Hussain
Sebastian Stüker
A. Waibel
171
33
0
05 Jul 2021
Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Timo Lohrenz
P. Schwarz
Zhengyang Li
Tim Fingscheidt
150
11
0
02 Jul 2021
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis
Shammur A. Chowdhury
Nadir Durrani
Ahmed M. Ali
344
20
0
01 Jul 2021
On joint training with interfaces for spoken language understanding
Interspeech (Interspeech), 2021
A. Raju
Milind Rao
Gautam Tiwari
Pranav Dheram
Bryan Anderson
Zhe Zhang
Chul Lee
Bach Bui
Ariya Rastrow
VLM
191
11
0
30 Jun 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
323
433
0
29 Jun 2021
Where are we in semantic concept extraction for Spoken Language Understanding?
Sahar Ghannay
Antoine Caubrière
Salima Mdhaffar
G. Laperriere
Bassam Jabaian
Yannick Esteve
183
18
0
24 Jun 2021
Towards Automatic Speech to Sign Language Generation
Parul Kapoor
Rudrabha Mukhopadhyay
Sindhu B. Hegde
Vinay P. Namboodiri
C. V. Jawahar
SLR
154
16
0
24 Jun 2021
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition
Xiong Wang
Sining Sun
Lei Xie
Long Ma
97
21
0
17 Jun 2021
Layer Pruning on Demand with Intermediate CTC
Jaesong Lee
Jingu Kang
Shinji Watanabe
125
21
0
17 Jun 2021
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
VLM
236
55
0
16 Jun 2021
Attention-Based Keyword Localisation in Speech using Visual Grounding
Kayode Olaleye
Herman Kamper
104
13
0
16 Jun 2021
SynthASR: Unlocking Synthetic Data for Speech Recognition
Interspeech (Interspeech), 2021
A. Fazel
Wei Yang
Yulan Liu
Roberto Barra-Chicote
Yi Meng
Roland Maas
J. Droppo
SyDa
163
59
0
14 Jun 2021
Improving RNN-T ASR Performance with Date-Time and Location Awareness
Workshop on Time-Delay Systems (TS), 2021
Swayambhu Nath Ray
Soumyajit Mitra
Raghavendra Bilgi
Sri Garimella
104
5
0
11 Jun 2021
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Interspeech (Interspeech), 2021
Max W. Y. Lam
Jun Wang
Chao Weng
Jane Polak Scowcroft
Dong Yu
124
7
0
08 Jun 2021
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios
Interspeech (Interspeech), 2021
E. Tsunoo
Kentarou Shibata
Chaitanya Narisetty
Yosuke Kashiwagi
Shinji Watanabe
115
13
0
07 Jun 2021
Approximate Fixed-Points in Recurrent Neural Networks
Zhengxiong Wang
Anton Ragni
70
4
0
04 Jun 2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Interspeech (Interspeech), 2021
Zhong Meng
Yu-Huan Wu
Naoyuki Kanda
Liang Lu
Xie Chen
Guoli Ye
Eric Sun
Jinyu Li
Jiawei Liu
MoMe
145
22
0
04 Jun 2021
Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
Interspeech (Interspeech), 2021
Shammur A. Chowdhury
A. Hussein
Ahmed Abdelali
Ahmed M. Ali
247
48
0
31 May 2021
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Interspeech (Interspeech), 2021
Swayambhu Nath Ray
Minhua Wu
A. Raju
Pegah Ghahremani
Raghavendra Bilgi
Milind Rao
Harish Arsikere
Ariya Rastrow
A. Stolcke
J. Droppo
151
14
0
14 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
104
4
0
13 May 2021
Quantifying and Maximizing the Benefits of Back-End Noise Adaption on Attention-Based Speech Recognition Models
Coleman Hooper
Thierry Tambe
Gu-Yeon Wei
104
1
0
03 May 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
On Addressing Practical Challenges for RNN-Transducer
Automatic Speech Recognition & Understanding (ASRU), 2021
Rui Zhao
Jian Xue
Jinyu Li
Wenning Wei
Lei He
Jiawei Liu
253
33
0
27 Apr 2021
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models
Interspeech (Interspeech), 2021
Thibault Doutre
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Olivier Siohan
Liangliang Cao
138
6
0
25 Apr 2021
Previous
1
2
3
...
9
10
11
...
20
21
22
Next