ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.01211
  4. Cited By
Listen, Attend and Spell
v1v2 (latest)

Listen, Attend and Spell

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015
5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
    RALM
ArXiv (abs)PDFHTML

Papers citing "Listen, Attend and Spell"

50 / 1,064 papers shown
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechInterspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
236
71
0
23 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction NetworkInterspeech (Interspeech), 2021
Janne Pylkkönen
Antti Ukkonen
Juho Kilpikoski
Samu Tamminen
Hannes Heikinheimo
181
28
0
22 Apr 2021
Advanced Long-context End-to-end Speech Recognition Using
  Context-expanded Transformers
Advanced Long-context End-to-end Speech Recognition Using Context-expanded TransformersInterspeech (Interspeech), 2021
Takaaki Hori
Niko Moritz
Chiori Hori
Jonathan Le Roux
140
37
0
19 Apr 2021
Non-linear Functional Modeling using Neural Networks
Non-linear Functional Modeling using Neural NetworksJournal of Computational And Graphical Statistics (JCGS), 2021
Aniruddha Rajendra Rao
M. Reimherr
172
36
0
19 Apr 2021
Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition
Acoustic Data-Driven Subword Modeling for End-to-End Speech RecognitionInterspeech (Interspeech), 2021
Wei Zhou
Mohammad Zeineldeen
Zuoyun Zheng
Ralf Schluter
Hermann Ney
216
14
0
19 Apr 2021
A Method to Reveal Speaker Identity in Distributed ASR Training, and How
  to Counter It
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Trung D. Q. Dang
Om Thakkar
Swaroop Indra Ramaswamy
Rajiv Mathews
Peter Chin
Franccoise Beaufays
FedML
103
10
0
15 Apr 2021
Integration of Pre-trained Networks with Continuous Token Interface for
  End-to-End Spoken Language Understanding
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
S. Seo
Donghyun Kwak
Bowon Lee
194
34
0
15 Apr 2021
Annealing Knowledge Distillation
Annealing Knowledge DistillationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
A. Jafari
Mehdi Rezagholizadeh
Pranav Sharma
A. Ghodsi
190
91
0
14 Apr 2021
Efficient conformer-based speech recognition with linear attention
Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
199
26
0
14 Apr 2021
Investigating Methods to Improve Language Model Integration for
  Attention-based Encoder-Decoder ASR Models
Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR ModelsInterspeech (Interspeech), 2021
Mohammad Zeineldeen
Aleksandr Glushko
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
191
44
0
12 Apr 2021
Non-autoregressive Transformer-based End-to-end ASR using BERT
Non-autoregressive Transformer-based End-to-end ASR using BERTIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Fu-Hao Yu
Kuan-Yu Chen
144
32
0
10 Apr 2021
Lip reading using external viseme decoding
Lip reading using external viseme decodingIranian Conference on Machine Vision and Image Processing (IMVIP), 2021
J. Peymanfard
Mohammad Reza Mohammadi
Hossein Zeinali
N. Mozayani
150
16
0
10 Apr 2021
Boundary and Context Aware Training for CIF-based Non-Autoregressive
  End-to-end ASR
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
Fan Yu
Haoneng Luo
Pengcheng Guo
Yuhao Liang
Zhuoyuan Yao
Lei Xie
Yingying Gao
Leijing Hou
Shilei Zhang
117
14
0
10 Apr 2021
Language model fusion for streaming end to end speech recognition
Language model fusion for streaming end to end speech recognition
Rodrigo Cabrera
Xiaofeng Liu
M. Ghodsi
Zebulun Matteson
Eugene Weinstein
Anjuli Kannan
MoMeAI4TS
91
14
0
09 Apr 2021
On Architectures and Training for Raw Waveform Feature Extraction in ASR
On Architectures and Training for Raw Waveform Feature Extraction in ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
Peter Vieting
Christoph Luscher
Wilfried Michel
Ralf Schluter
Hermann Ney
168
11
0
09 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by
  Applying Fast-Skip Regularization
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip RegularizationInterspeech (Interspeech), 2021
Zhengkun Tian
Jiangyan Yi
Ye Bai
Jianhua Tao
Shuai Zhang
Zhengqi Wen
83
19
0
07 Apr 2021
Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture
  Search For End-to-End ASR
Darts-Conformer: Towards Efficient Gradient-Based Neural Architecture Search For End-to-End ASR
Xian Shi
Pan Zhou
Wei Chen
Lei Xie
166
19
0
07 Apr 2021
Extremely Low Footprint End-to-End ASR System for Smart Device
Extremely Low Footprint End-to-End ASR System for Smart DeviceInterspeech (Interspeech), 2021
Zhifu Gao
Yiwu Yao
Shiliang Zhang
Jun Yang
Ming Lei
Ian Mcloughlin
106
15
0
06 Apr 2021
Non-autoregressive Mandarin-English Code-switching Speech Recognition
Non-autoregressive Mandarin-English Code-switching Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Shun-Po Chuang
Heng-Jui Chang
Sung-Feng Huang
Hung-yi Lee
241
16
0
06 Apr 2021
Understanding Medical Conversations: Rich Transcription, Confidence
  Scores & Information Extraction
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information ExtractionInterspeech (Interspeech), 2021
H. Soltau
Mingqiu Wang
Izhak Shafran
Laurent El Shafey
MedImLM&MA
148
13
0
06 Apr 2021
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition
Dissecting User-Perceived Latency of On-Device E2E Speech RecognitionInterspeech (Interspeech), 2021
Yuan Shangguan
Rohit Prabhavalkar
Hang Su
Jay Mahadeokar
Yangyang Shi
...
Chunyang Wu
Duc Le
Ozlem Kalinli
Christian Fuegen
M. Seltzer
229
33
0
06 Apr 2021
SpeechStew: Simply Mix All Available Speech Recognition Data to Train
  One Large Neural Network
SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network
William Chan
Daniel S. Park
Chris A. Lee
Yu Zhang
Quoc V. Le
Mohammad Norouzi
AI4TS
367
147
0
05 Apr 2021
Streaming Multi-talker Speech Recognition with Joint Speaker
  Identification
Streaming Multi-talker Speech Recognition with Joint Speaker IdentificationInterspeech (Interspeech), 2021
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
213
21
0
05 Apr 2021
Towards Lifelong Learning of End-to-end ASR
Towards Lifelong Learning of End-to-end ASRInterspeech (Interspeech), 2021
Heng-Jui Chang
Hung-yi Lee
Lin-Shan Lee
KELMCLL
256
36
0
04 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding
  with Numbers
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers
Loren Lugosch
Piyush Papreja
Mirco Ravanelli
A. Heba
Titouan Parcollet
166
14
0
04 Apr 2021
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech
  Recognition
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021
Zhengkun Tian
Jiangyan Yi
Jianhua Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
Zhengqi Wen
113
23
0
04 Apr 2021
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
HMM-Free Encoder Pre-Training for Streaming RNN TransducerInterspeech (Interspeech), 2021
Lu Huang
J. Sun
Yu Tang
Junfeng Hou
Jinkun Chen
Jun Zhang
Zejun Ma
159
3
0
02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a
  Language-Independent Subword Discriminative Feature Representation
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationInterspeech (Interspeech), 2021
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
175
4
0
02 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a
  SMART: a Monte Carlo-based approach and case study with longitudinal
  overdispersed count outcomes
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomesStatistical Methods in Medical Research (Stat Med), 2021
Jamie Yap
John J. Dziak
David Kabiito
Claire Babirye
J. McKay
Bibhas Chakraborty
J. Nakatumba‐Nabende
194
27
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021
Alana de Santana Correia
Esther Luna Colombini
HAI
332
258
0
31 Mar 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter TransformersACM Computing Surveys (CSUR), 2021
Quentin Fournier
G. Caron
Daniel Aloise
382
139
0
26 Mar 2021
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual
  Speech Separation
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech SeparationComputer Vision and Pattern Recognition (CVPR), 2021
Jiyoung Lee
Soo-Whan Chung
Sunok Kim
Hong-Goo Kang
Kwanghoon Sohn
173
59
0
25 Mar 2021
Advancing RNN Transducer Technology for Speech Recognition
Advancing RNN Transducer Technology for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
G. Saon
Zoltan Tueske
Daniel Bolaños
Brian Kingsbury
185
98
0
17 Mar 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning
  with Self-Knowledge Distillation
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge DistillationInterspeech (Interspeech), 2021
Md. Akmal Haidar
Chao Xing
Mehdi Rezagholizadeh
182
6
0
17 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
179
18
0
13 Mar 2021
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource
  End-to-End Speech Recognition
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech RecognitionItalian National Conference on Sensors (INS), 2021
A. Laptev
A. Andrusenko
Ivan Podluzhny
Anton Mitrofanov
Ivan Medennikov
Yuri N. Matveev
VLM
134
15
0
12 Mar 2021
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative
  Adversarial Networks
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Md. Akmal Haidar
Mehdi Rezagholizadeh
221
9
0
10 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers
End-to-end acoustic modelling for phone recognition of young readersSpeech Communication (Speech Commun.), 2021
Lucile Gelin
Morgane Daniel
J. Pinquier
Thomas Pellegrini
191
17
0
04 Mar 2021
Alignment Knowledge Distillation for Online Streaming Attention-based
  Speech Recognition
Alignment Knowledge Distillation for Online Streaming Attention-based Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Hirofumi Inaguma
Tatsuya Kawahara
343
19
0
28 Feb 2021
MixSpeech: Data Augmentation for Low-resource Automatic Speech
  Recognition
MixSpeech: Data Augmentation for Low-resource Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Linghui Meng
Jin Xu
Xu Tan
Yongfeng Zhang
Tao Qin
Bo Xu
VLM
167
86
0
25 Feb 2021
Neural ranking models for document retrieval
Neural ranking models for document retrieval
M. Trabelsi
Zhiyu Zoey Chen
Brian D. Davison
J. Heflin
FedML
185
35
0
23 Feb 2021
Joint Intent Detection And Slot Filling Based on Continual Learning
  Model
Joint Intent Detection And Slot Filling Based on Continual Learning ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yanfei Hui
Jianzong Wang
Ning Cheng
Fengying Yu
Tianbo Wu
Jing Xiao
88
19
0
22 Feb 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets,
  Tracks, Baselines, Results and Methods
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and MethodsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Xian Shi
Fan Yu
Yizhou Lu
Yuhao Liang
Qiangze Feng
Daliang Wang
Y. Qian
Lei Xie
126
79
0
20 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An
  Empirical Study
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical StudyComputer Speech and Language (CSL), 2021
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
162
63
0
19 Feb 2021
Vision-Aided 6G Wireless Communications: Blockage Prediction and
  Proactive Handoff
Vision-Aided 6G Wireless Communications: Blockage Prediction and Proactive HandoffIEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2021
Gouranga Charan
Muhammad Alrabeiah
Ahmed Alkhateeb
184
170
0
18 Feb 2021
Do End-to-End Speech Recognition Models Care About Context?
Do End-to-End Speech Recognition Models Care About Context?Interspeech (Interspeech), 2020
Lasse Borgholt
Jakob Drachmann Havtorn
Zeljko Agic
Anders Søgaard
Lars Maaløe
Christian Igel
110
8
0
17 Feb 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for
  air traffic control systems
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systemsApplied Soft Computing (Appl Soft Comput), 2021
Yi Lin
Bo Yang
Linchao Li
Dongyue Guo
Jianwei Zhang
Hu Chen
Yi Zhang
173
32
0
17 Feb 2021
End-to-End Automatic Speech Recognition with Deep Mutual Learning
End-to-End Automatic Speech Recognition with Deep Mutual LearningAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
Ryo Masumura
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Takanori Ashihara
97
6
0
16 Feb 2021
Exploring Transformers in Natural Language Generation: GPT, BERT, and
  XLNet
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet
M. O. Topal
Anil Bas
Imke van Heerden
LLMAGAI4CE
129
105
0
16 Feb 2021
Improving speech recognition models with small samples for air traffic
  control systems
Improving speech recognition models with small samples for air traffic control systemsNeurocomputing (Neurocomputing), 2021
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
193
33
0
16 Feb 2021
Previous
123...101112...202122
Next
Page 11 of 22
Pageof 22