Attention-Based Models for Speech Recognition

24 June 2015

Papers citing "Attention-Based Models for Speech Recognition"

50 / 299 papers shown

Title
Enhancing short-term traffic prediction by integrating trends and fluctuations with attention mechanism A. Das Agnimitra Sengupta S. I. Guler AI4TS 26 0 0 28 Apr 2025
A 71.2- $μ$ W Speech Recognition Accelerator with Recurrent Spiking Neural Network Chih-Chyau Yang Tian-Sheuan Chang 60 1 0 27 Mar 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers Adam Stooke Rohit Prabhavalkar K. Sim P. M. Mengibar 37 0 0 06 Feb 2025
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography Viswanath Sivakumar Jeffrey Seely Alan Du Sean R Bittner Adam Berenzweig Anuoluwapo Bolarinwa Alexandre Gramfort Michael I Mandel 13 3 0 26 Oct 2024
HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR Hainan Xu Travis M. Bartley Vladimir Bataev Boris Ginsburg 141 0 0 03 Oct 2024
The Conformer Encoder May Reverse the Time Dimension Robin Schmitt Albert Zeyer Mohammad Zeineldeen Ralf Schluter Hermann Ney 31 0 0 01 Oct 2024
Lightweight Transducer Based on Frame-Level Criterion Genshun Wan Mengzhi Wang Tingzhi Mao Hang Chen Z. Ye 44 1 0 05 Sep 2024
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting Jun Li Jinying Wu Qiming Li Feifei Guo 39 0 0 31 Aug 2024
Improving Prediction of Need for Mechanical Ventilation using Cross-Attention Anwesh Mohanty S. Shashikumar Jonathan Y. Lam S. Nemati AI4CE 24 0 0 21 Jul 2024
An efficient text augmentation approach for contextualized Mandarin speech recognition Naijun Zheng Xucheng Wan Kai Liu Ziqing Du Zhou Huan 40 1 0 14 Jun 2024
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units Xuankai Chang Jiatong Shi Jinchuan Tian Yuning Wu Yuxun Tang Yihan Wu Shinji Watanabe Yossi Adi Xie Chen Qin Jin 43 15 0 11 Jun 2024
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation Keqi Deng Philip C. Woodland 31 4 0 06 Jun 2024
''You should probably read this'': Hedge Detection in Text Denys Katerenchuk Rivka Levitan 41 1 0 22 May 2024
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images Chengxi Han Chen Wu Haonan Guo Meiqi Hu Hongruixuan Chen 28 89 0 14 Apr 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition Hainan Xu Zhehuai Chen Fei Jia Boris Ginsburg 33 0 0 04 Apr 2024
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in Sorani Kurdish Sana Isam Hossein Hassani 23 0 0 29 Mar 2024
An Attention Long Short-Term Memory based system for automatic classification of speech intelligibility Miguel Fernández-Díaz A. Gallardo-Antolín 14 40 0 05 Feb 2024
On Speaker Attribution with SURT Desh Raj Matthew Wiesner Matthew Maciejewski Leibny Paola García-Perera Daniel Povey Sanjeev Khudanpur 24 3 0 28 Jan 2024
Improving ASR Contextual Biasing with Guided Attention Jiyang Tang Kwangyoun Kim Suwon Shon Felix Wu Prashant Sridhar Shinji Watanabe 19 8 0 16 Jan 2024
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Peng Shen Xugang Lu Hisashi Kawai 27 1 0 18 Dec 2023
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Shaojin Ding David Qiu David Rim Yanzhang He Oleg Rybakov ... Tara N. Sainath Zhonglin Han Jian Li Amir Yazdanbakhsh Shivani Agrawal MQ 26 9 0 13 Dec 2023
Adaptive Multi-head Contrastive Learning Lei Wang Piotr Koniusz Tom Gedeon Liang Zheng 30 4 0 09 Oct 2023
Updated Corpora and Benchmarks for Long-Form Speech Recognition Jennifer Drexler Fox Desh Raj Natalie Delworth Quinn Mcnamara Corey Miller Miguel Jetté AuLLM 26 7 0 26 Sep 2023
Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling Zheng Nan T. Dang V. Sethu Beena Ahmed BDL 19 2 0 21 Sep 2023
Semi-Autoregressive Streaming ASR With Label Context Siddhant Arora G. Saon Shinji Watanabe Brian Kingsbury AI4TS 23 5 0 19 Sep 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition E. Tsunoo Hayato Futami Yosuke Kashiwagi Siddhant Arora Shinji Watanabe 22 4 0 24 Jul 2023
Arbitrary point cloud upsampling via Dual Back-Projection Network Zhisong Liu Zijia Wang Zhen Jia 3DPC 19 4 0 18 Jul 2023
Machine Learning for Autonomous Vehicle's Trajectory Prediction: A comprehensive survey, Challenges, and Future Research Directions Vibha Bharilya Neetesh Kumar 28 47 0 12 Jul 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition Desh Raj Daniel Povey Sanjeev Khudanpur VLM 26 9 0 18 Jun 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer Lu Huang B. Li Jun Zhang Lu Lu Zejun Ma 26 2 0 07 Jun 2023
Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition Tianyi Xu Zhanheng Yang Kaixun Huang Pengcheng Guo Aoting Zhang Biao Li Changru Chen C. Li Linfu Xie 14 10 0 01 Jun 2023
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution Matías P. Pizarro D. Kolossa Asja Fischer AAML 35 1 0 26 May 2023
CopyNE: Better Contextual ASR by Copying Named Entities Shilin Zhou Zhenghua Li Yu Hong M. Zhang Zhefeng Wang Baoxing Huai 15 5 0 22 May 2023
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network Kaixun Huang Aoting Zhang Zhanheng Yang Pengcheng Guo Bingshen Mu Tianyi Xu Linfu Xie 27 16 0 21 May 2023
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition Xuandi Fu Kanthashree Mysore Sathyendra Ankur Gandhe Jing Liu Grant P. Strimel Ross McGowan Athanasios Mouchtaris 25 14 0 09 May 2023
Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing Jingbei Li Sipan Li Ping Chen Lu Zhang Yi Meng Zhiyong Wu H. Meng Qiao Tian Yuping Wang Yuxuan Wang 35 3 0 09 May 2023
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment Ruiqi Li Rongjie Huang Lichao Zhang Jinglin Liu Zhou Zhao 25 4 0 08 May 2023
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition Mohan Li R. Doddipatla Catalin Zorila 25 0 0 24 Apr 2023
Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR Xilai Li Goeric Huybrechts S. Ronanki Jeffrey J. Farris S. Bodapati 33 6 0 18 Apr 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations Hainan Xu Fei Jia Somshubra Majumdar Hengguan Huang Shinji Watanabe Boris Ginsburg 27 17 0 13 Apr 2023
Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems Jiajun Deng Xurong Xie Tianzi Wang Mingyu Cui Boyang Xue Zengrui Jin Guinan Li Shujie Hu Xunying Liu 26 5 0 15 Feb 2023
Sources of Richness and Ineffability for Phenomenally Conscious States Xu Ji Eric Elmoznino George Deane Axel Constant G. Dumas Guillaume Lajoie Jonathan Simon Yoshua Bengio 23 11 0 13 Feb 2023
Alien Coding Thibault Gauthier Miroslav Olsák J. Urban 30 7 0 27 Jan 2023
BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition Will Rieger BDL UQCV 19 0 0 16 Jan 2023
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek Georgios Paraskevopoulos Theodoros Kouzelis Georgios Rouvalis Athanasios Katsamanis V. Katsouros Alexandros Potamianos VLM 23 7 0 31 Dec 2022
Training Integer-Only Deep Recurrent Neural Networks V. Nia Eyyub Sari Vanessa Courville M. Asgharian MQ 45 2 0 22 Dec 2022
AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation Xingshan Zeng Liangyou Li Qun Liu 24 5 0 17 Dec 2022
DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech Kazuki Kawamura Jun Rekimoto 26 0 0 08 Dec 2022
Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation Stefan Braun Erik McDermott Roger Hsiao 34 1 0 29 Nov 2022
A Machine Learning-based Framework for Predictive Maintenance of Semiconductor Laser for Optical Communication K. Abdelli H. Griesser S. Pachnicke 11 11 0 05 Nov 2022