ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.01090
  4. Cited By
On Modular Training of Neural Acoustics-to-Word Model for LVCSR

On Modular Training of Neural Acoustics-to-Word Model for LVCSR

3 March 2018
Zhehuai Chen
Qi Liu
Hao Li
Kai Yu
ArXiv (abs)PDFHTML

Papers citing "On Modular Training of Neural Acoustics-to-Word Model for LVCSR"

18 / 18 papers shown
Title
Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech RecognitionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Bingshen Mu
Kun Wei
Pengcheng Guo
Lei Xie
196
5
0
12 Jul 2025
Decoupling and Interacting Multi-Task Learning Network for Joint Speech
  and Accent Recognition
Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Qijie Shao
Pengcheng Guo
Jinghao Yan
Pengfei Hu
Lei Xie
104
12
0
13 Nov 2023
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End
  Speech Recognition and Understanding
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wei Wang
Shuo Ren
Yao Qian
Shujie Liu
Yu Shi
Y. Qian
Michael Zeng
131
21
0
23 Oct 2021
Topic Classification on Spoken Documents Using Deep Acoustic and
  Linguistic Features
Topic Classification on Spoken Documents Using Deep Acoustic and Linguistic Features
Tan Liu
Wu Guo
Bin Gu
75
5
0
16 Jun 2021
Decoupling Pronunciation and Language for End-to-end Code-switching
  Automatic Speech Recognition
Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Shuai Zhang
Jiangyan Yi
Zhengkun Tian
Ye Bai
Jianhua Tao
Zhengqi Wen
86
15
0
28 Oct 2020
Modular End-to-end Automatic Speech Recognition Framework for
  Acoustic-to-word Model
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word ModelIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Qi Liu
Zhehuai Chen
Hao Li
Mingkun Huang
Yizhou Lu
Kai Yu
137
10
0
31 Jul 2020
A systematic comparison of grapheme-based vs. phoneme-based label units
  for encoder-decoder-attention models
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
Mohammad Zeineldeen
Albert Zeyer
Wei Zhou
T. Ng
Ralf Schluter
Hermann Ney
208
2
0
19 May 2020
Audio Caption: Listen and Tell
Audio Caption: Listen and Tell
Mengyue Wu
Heinrich Dinkel
Kai Yu
183
67
0
25 Feb 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
90
17
0
18 Feb 2019
End-to-end Anchored Speech Recognition
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
142
20
0
06 Feb 2019
Speaker Adaptation for End-to-End CTC Models
Speaker Adaptation for End-to-End CTC Models
Ke Li
Jinyu Li
Yong Zhao
Kshitiz Kumar
Jiawei Liu
102
25
0
04 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
124
26
0
31 Dec 2018
End-to-end contextual speech recognition using class language models and
  a token passing decoder
End-to-end contextual speech recognition using class language models and a token passing decoder
Zhehuai Chen
Mahaveer Jain
Yongqiang Wang
M. Seltzer
Christian Fuegen
167
58
0
05 Dec 2018
Improving End-to-end Speech Recognition with Pronunciation-assisted
  Sub-word Modeling
Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Hainan Xu
Shuoyang Ding
Shinji Watanabe
180
39
0
10 Nov 2018
Linguistic Search Optimization for Deep Learning Based LVCSR
Linguistic Search Optimization for Deep Learning Based LVCSR
Zhehuai Chen
127
1
0
02 Aug 2018
Sequence Discriminative Training for Deep Learning based Acoustic
  Keyword Spotting
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting
Zhehuai Chen
Y. Qian
Kai Yu
88
20
0
02 Aug 2018
Acoustic-to-Word Recognition with Sequence-to-Sequence Models
Acoustic-to-Word Recognition with Sequence-to-Sequence Models
Shruti Palaskar
Florian Metze
155
21
0
23 Jul 2018
A GPU-based WFST Decoder with Exact Lattice Generation
A GPU-based WFST Decoder with Exact Lattice Generation
Zhehuai Chen
Justin Luitjens
Hainan Xu
Yiming Wang
Daniel Povey
Sanjeev Khudanpur
216
17
0
09 Apr 2018
1