Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.04906
Cited By
On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
8 November 2020
Shucong Zhang
Erfan Loweimi
P. Bell
Steve Renals
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers"
21 / 21 papers shown
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
134
0
0
01 Oct 2025
Whisper Has an Internal Word Aligner
Sung-Lin Yeh
Yen Meng
Hao Tang
179
1
0
12 Sep 2025
Dynamic Acoustic Model Architecture Optimization in Training for ASR
Jingjing Xu
Zijian Yang
Albert Zeyer
Eugen Beck
Ralf Schlueter
Hermann Ney
296
2
0
16 Jun 2025
How Redundant Is the Transformer Stack in Speech Representation Models?
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Teresa Dorszewski
Albert Kjøller Jacobsen
Lenka Tětková
Lars Kai Hansen
511
3
0
20 Jan 2025
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition
Shucong Zhang
Titouan Parcollet
Rogier van Dalen
Sourav Bhattacharya
403
1
0
10 Jan 2025
Convexity-based Pruning of Speech Representation Models
International Workshop on Machine Learning for Signal Processing (MLSP), 2024
Teresa Dorszewski
Lenka Tětková
Lars Kai Hansen
313
2
0
16 Aug 2024
Linear-Complexity Self-Supervised Learning for Speech Processing
Shucong Zhang
Titouan Parcollet
Rogier van Dalen
Sourav Bhattacharya
314
1
0
18 Jul 2024
Multi-Convformer: Extending Conformer with Multiple Convolution Kernels
Darshan Prabhu
Yifan Peng
Preethi Jyothi
Shinji Watanabe
337
7
0
04 Jul 2024
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization
Jianzong Wang
Ziqi Liang
Xulong Zhang
Ning Cheng
Jing Xiao
205
1
0
30 Apr 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Hamza Kheddar
Mustapha Hemis
Yassine Himeur
OffRL
306
174
0
02 Mar 2024
SpeechAlign: a Framework for Speech Translation Alignment Evaluation
International Conference on Language Resources and Evaluation (LREC), 2023
Belen Alastruey
Aleix Sant
Gerard I. Gállego
David Dale
Marta R. Costa-jussá
AuLLM
354
3
0
20 Sep 2023
SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding
Interspeech (Interspeech), 2023
Titouan Parcollet
Rogier van Dalen
Shucong Zhang
S. Bhattacharya
303
16
0
12 Jul 2023
Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
Interspeech (Interspeech), 2023
Ziao Yang
Samridhi Choudhary
Siegfried Kunzmann
Zheng Zhang
MQ
318
6
0
01 Jun 2023
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Interspeech (Interspeech), 2023
Yifan Peng
Yui Sudo
Muhammad Shakeel
Shinji Watanabe
303
59
0
28 May 2023
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yifan Peng
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Shinji Watanabe
229
56
0
27 Feb 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
351
6
0
19 Dec 2022
Is Smaller Always Faster? Tradeoffs in Compressing Self-Supervised Speech Transformers
Tzu-Quan Lin
Tsung-Huan Yang
Chun-Yao Chang
Kuang-Ming Chen
Tzu-hsun Feng
Hung-yi Lee
Hao Tang
336
6
0
17 Nov 2022
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding
International Conference on Machine Learning (ICML), 2022
Yifan Peng
Siddharth Dalmia
Ian Lane
Shinji Watanabe
306
198
0
06 Jul 2022
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Neural Information Processing Systems (NeurIPS), 2022
Sehoon Kim
A. Gholami
Albert Eaton Shaw
Nicholas Lee
K. Mangalam
Jitendra Malik
Michael W. Mahoney
Kurt Keutzer
404
135
0
02 Jun 2022
On the Locality of Attention in Direct Speech Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Belen Alastruey
Javier Ferrando
Gerard I. Gállego
Marta R. Costa-jussá
284
8
0
19 Apr 2022
Similarity and Content-based Phonetic Self Attention for Speech Recognition
Interspeech (Interspeech), 2022
Kyuhong Shim
Wonyong Sung
370
8
0
19 Mar 2022
1
Page 1 of 1