ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.07503
  4. Cited By
Attention-Based Models for Speech Recognition

Attention-Based Models for Speech Recognition

24 June 2015
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
ArXivPDFHTML

Papers citing "Attention-Based Models for Speech Recognition"

50 / 299 papers shown
Title
Enhancing short-term traffic prediction by integrating trends and fluctuations with attention mechanism
Enhancing short-term traffic prediction by integrating trends and fluctuations with attention mechanism
A. Das
Agnimitra Sengupta
S. I. Guler
AI4TS
26
0
0
28 Apr 2025
A 71.2-$μ$W Speech Recognition Accelerator with Recurrent Spiking Neural Network
A 71.2-μμμW Speech Recognition Accelerator with Recurrent Spiking Neural Network
Chih-Chyau Yang
Tian-Sheuan Chang
60
1
0
27 Mar 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Adam Stooke
Rohit Prabhavalkar
K. Sim
P. M. Mengibar
37
0
0
06 Feb 2025
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography
Viswanath Sivakumar
Jeffrey Seely
Alan Du
Sean R Bittner
Adam Berenzweig
Anuoluwapo Bolarinwa
Alexandre Gramfort
Michael I Mandel
13
3
0
26 Oct 2024
HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR
HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR
Hainan Xu
Travis M. Bartley
Vladimir Bataev
Boris Ginsburg
141
0
0
03 Oct 2024
The Conformer Encoder May Reverse the Time Dimension
The Conformer Encoder May Reverse the Time Dimension
Robin Schmitt
Albert Zeyer
Mohammad Zeineldeen
Ralf Schluter
Hermann Ney
31
0
0
01 Oct 2024
Lightweight Transducer Based on Frame-Level Criterion
Lightweight Transducer Based on Frame-Level Criterion
Genshun Wan
Mengzhi Wang
Tingzhi Mao
Hang Chen
Z. Ye
44
1
0
05 Sep 2024
GMFL-Net: A Global Multi-geometric Feature Learning Network for
  Repetitive Action Counting
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Jun Li
Jinying Wu
Qiming Li
Feifei Guo
39
0
0
31 Aug 2024
Improving Prediction of Need for Mechanical Ventilation using
  Cross-Attention
Improving Prediction of Need for Mechanical Ventilation using Cross-Attention
Anwesh Mohanty
S. Shashikumar
Jonathan Y. Lam
S. Nemati
AI4CE
24
0
0
21 Jul 2024
An efficient text augmentation approach for contextualized Mandarin
  speech recognition
An efficient text augmentation approach for contextualized Mandarin speech recognition
Naijun Zheng
Xucheng Wan
Kai Liu
Ziqing Du
Zhou Huan
40
1
0
14 Jun 2024
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
Xuankai Chang
Jiatong Shi
Jinchuan Tian
Yuning Wu
Yuxun Tang
Yihan Wu
Shinji Watanabe
Yossi Adi
Xie Chen
Qin Jin
43
15
0
11 Jun 2024
Label-Synchronous Neural Transducer for E2E Simultaneous Speech
  Translation
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation
Keqi Deng
Philip C. Woodland
31
4
0
06 Jun 2024
''You should probably read this'': Hedge Detection in Text
''You should probably read this'': Hedge Detection in Text
Denys Katerenchuk
Rivka Levitan
41
1
0
22 May 2024
HANet: A Hierarchical Attention Network for Change Detection With
  Bitemporal Very-High-Resolution Remote Sensing Images
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images
Chengxi Han
Chen Wu
Haonan Guo
Meiqi Hu
Hongruixuan Chen
28
89
0
14 Apr 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech
  Recognition
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Hainan Xu
Zhehuai Chen
Fei Jia
Boris Ginsburg
33
0
0
04 Apr 2024
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in
  Sorani Kurdish
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in Sorani Kurdish
Sana Isam
Hossein Hassani
23
0
0
29 Mar 2024
An Attention Long Short-Term Memory based system for automatic
  classification of speech intelligibility
An Attention Long Short-Term Memory based system for automatic classification of speech intelligibility
Miguel Fernández-Díaz
A. Gallardo-Antolín
14
40
0
05 Feb 2024
On Speaker Attribution with SURT
On Speaker Attribution with SURT
Desh Raj
Matthew Wiesner
Matthew Maciejewski
Leibny Paola García-Perera
Daniel Povey
Sanjeev Khudanpur
24
3
0
28 Jan 2024
Improving ASR Contextual Biasing with Guided Attention
Improving ASR Contextual Biasing with Guided Attention
Jiyang Tang
Kwangyoun Kim
Suwon Shon
Felix Wu
Prashant Sridhar
Shinji Watanabe
19
8
0
16 Jan 2024
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition
Peng Shen
Xugang Lu
Hisashi Kawai
27
1
0
18 Dec 2023
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech
  Recognition with Universal Speech Models
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Shaojin Ding
David Qiu
David Rim
Yanzhang He
Oleg Rybakov
...
Tara N. Sainath
Zhonglin Han
Jian Li
Amir Yazdanbakhsh
Shivani Agrawal
MQ
26
9
0
13 Dec 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
30
4
0
09 Oct 2023
Updated Corpora and Benchmarks for Long-Form Speech Recognition
Updated Corpora and Benchmarks for Long-Form Speech Recognition
Jennifer Drexler Fox
Desh Raj
Natalie Delworth
Quinn Mcnamara
Corey Miller
Miguel Jetté
AuLLM
26
7
0
26 Sep 2023
Variational Connectionist Temporal Classification for Order-Preserving
  Sequence Modeling
Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling
Zheng Nan
T. Dang
V. Sethu
Beena Ahmed
BDL
19
2
0
21 Sep 2023
Semi-Autoregressive Streaming ASR With Label Context
Semi-Autoregressive Streaming ASR With Label Context
Siddhant Arora
G. Saon
Shinji Watanabe
Brian Kingsbury
AI4TS
23
5
0
19 Sep 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming
  Encoder-decoder Speech Recognition
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
22
4
0
24 Jul 2023
Arbitrary point cloud upsampling via Dual Back-Projection Network
Arbitrary point cloud upsampling via Dual Back-Projection Network
Zhisong Liu
Zijia Wang
Zhen Jia
3DPC
19
4
0
18 Jul 2023
Machine Learning for Autonomous Vehicle's Trajectory Prediction: A
  comprehensive survey, Challenges, and Future Research Directions
Machine Learning for Autonomous Vehicle's Trajectory Prediction: A comprehensive survey, Challenges, and Future Research Directions
Vibha Bharilya
Neetesh Kumar
28
47
0
12 Jul 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
Desh Raj
Daniel Povey
Sanjeev Khudanpur
VLM
26
9
0
18 Jun 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in
  Transducer
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
Lu Huang
B. Li
Jun Zhang
Lu Lu
Zejun Ma
26
2
0
07 Jun 2023
Adaptive Contextual Biasing for Transducer Based Streaming Speech
  Recognition
Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Tianyi Xu
Zhanheng Yang
Kaixun Huang
Pengcheng Guo
Aoting Zhang
Biao Li
Changru Chen
C. Li
Linfu Xie
14
10
0
01 Jun 2023
DistriBlock: Identifying adversarial audio samples by leveraging
  characteristics of the output distribution
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution
Matías P. Pizarro
D. Kolossa
Asja Fischer
AAML
35
1
0
26 May 2023
CopyNE: Better Contextual ASR by Copying Named Entities
CopyNE: Better Contextual ASR by Copying Named Entities
Shilin Zhou
Zhenghua Li
Yu Hong
M. Zhang
Zhefeng Wang
Baoxing Huai
15
5
0
22 May 2023
Contextualized End-to-End Speech Recognition with Contextual Phrase
  Prediction Network
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Kaixun Huang
Aoting Zhang
Zhanheng Yang
Pengcheng Guo
Bingshen Mu
Tianyi Xu
Linfu Xie
27
16
0
21 May 2023
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers
  for Speech Recognition
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Xuandi Fu
Kanthashree Mysore Sathyendra
Ankur Gandhe
Jing Liu
Grant P. Strimel
Ross McGowan
Athanasios Mouchtaris
25
14
0
09 May 2023
Joint Multi-scale Cross-lingual Speaking Style Transfer with
  Bidirectional Attention Mechanism for Automatic Dubbing
Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Jingbei Li
Sipan Li
Ping Chen
Lu Zhang
Yi Meng
Zhiyong Wu
H. Meng
Qiao Tian
Yuping Wang
Yuxuan Wang
35
3
0
09 May 2023
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
Ruiqi Li
Rongjie Huang
Lichao Zhang
Jinglin Liu
Zhou Zhao
25
4
0
08 May 2023
Self-regularised Minimum Latency Training for Streaming
  Transformer-based Speech Recognition
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Mohan Li
R. Doddipatla
Catalin Zorila
25
0
0
24 Apr 2023
Dynamic Chunk Convolution for Unified Streaming and Non-Streaming
  Conformer ASR
Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Xilai Li
Goeric Huybrechts
S. Ronanki
Jeffrey J. Farris
S. Bodapati
33
6
0
18 Apr 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and
  Durations
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Hainan Xu
Fei Jia
Somshubra Majumdar
Hengguan Huang
Shinji Watanabe
Boris Ginsburg
27
17
0
13 Apr 2023
Confidence Score Based Speaker Adaptation of Conformer Speech
  Recognition Systems
Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems
Jiajun Deng
Xurong Xie
Tianzi Wang
Mingyu Cui
Boyang Xue
Zengrui Jin
Guinan Li
Shujie Hu
Xunying Liu
26
5
0
15 Feb 2023
Sources of Richness and Ineffability for Phenomenally Conscious States
Sources of Richness and Ineffability for Phenomenally Conscious States
Xu Ji
Eric Elmoznino
George Deane
Axel Constant
G. Dumas
Guillaume Lajoie
Jonathan Simon
Yoshua Bengio
23
11
0
13 Feb 2023
Alien Coding
Alien Coding
Thibault Gauthier
Miroslav Olsák
J. Urban
30
7
0
27 Jan 2023
BayesSpeech: A Bayesian Transformer Network for Automatic Speech
  Recognition
BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition
Will Rieger
BDL
UQCV
19
0
0
16 Jan 2023
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition
  Systems A case study for Modern Greek
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek
Georgios Paraskevopoulos
Theodoros Kouzelis
Georgios Rouvalis
Athanasios Katsamanis
V. Katsouros
Alexandros Potamianos
VLM
23
7
0
31 Dec 2022
Training Integer-Only Deep Recurrent Neural Networks
Training Integer-Only Deep Recurrent Neural Networks
V. Nia
Eyyub Sari
Vanessa Courville
M. Asgharian
MQ
45
2
0
22 Dec 2022
AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech
  Translation
AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Xingshan Zeng
Liangyou Li
Qun Liu
24
5
0
17 Dec 2022
DDSupport: Language Learning Support System that Displays Differences
  and Distances from Model Speech
DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Kazuki Kawamura
Jun Rekimoto
26
0
0
08 Dec 2022
Neural Transducer Training: Reduced Memory Consumption with Sample-wise
  Computation
Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
Stefan Braun
Erik McDermott
Roger Hsiao
34
1
0
29 Nov 2022
A Machine Learning-based Framework for Predictive Maintenance of
  Semiconductor Laser for Optical Communication
A Machine Learning-based Framework for Predictive Maintenance of Semiconductor Laser for Optical Communication
K. Abdelli
H. Griesser
S. Pachnicke
11
11
0
05 Nov 2022
123456
Next