ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08041
  4. Cited By
Multi-Stream End-to-End Speech Recognition
v1v2 (latest)

Multi-Stream End-to-End Speech Recognition

IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
17 June 2019
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
ArXiv (abs)PDFHTML

Papers citing "Multi-Stream End-to-End Speech Recognition"

14 / 14 papers shown
SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography
SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography
Nima Hadidi
Jason Chan
Ebrahim Feghhi
Jonathan C. Kao
155
0
0
14 Jun 2025
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation
Hemant Yadav
Sunayana Sitaram
R. Shah
SSL
318
0
0
20 Aug 2024
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised
  Learning with Masked Unit Prediction
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit PredictionInternational Conference on Learning Representations (ICLR), 2023
Jiatong Shi
Hirofumi Inaguma
Xutai Ma
Ilia Kulikov
Anna Y. Sun
273
36
0
04 Oct 2023
Exploration on HuBERT with Multiple Resolutions
Exploration on HuBERT with Multiple ResolutionsInterspeech (Interspeech), 2023
Jiatong Shi
Yun Tang
Hirofumi Inaguma
Hongyu Gong
J. Pino
Shinji Watanabe
366
11
0
01 Jun 2023
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech RecognitionInterspeech (Interspeech), 2022
Jiamin Xie
John H. L. Hansen
193
1
0
04 Jul 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal EnhancementIEEE Transactions on Artificial Intelligence (IEEE TAI), 2022
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
140
4
0
24 Jan 2022
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Tassadaq Hussain
M. Gogate
K. Dashtipour
Amir Hussain
VLM
227
19
0
18 Nov 2021
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk
  and Far-Talk Speech Recognition
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
F. Weninger
M. Gaudesi
Ralf Leibold
R. Gemello
P. Zhan
101
4
0
17 Sep 2021
Scaling sparsemax based channel selection for speech recognition with
  ad-hoc microphone arrays
Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arraysInterspeech (Interspeech), 2021
Junqi Chen
Xiao-Lei Zhang
207
12
0
29 Mar 2021
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness
  of Multi-Stream End-to-End ASR
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASRSpoken Language Technology Workshop (SLT), 2021
Ruizhi Li
Gregory Sell
H. Hermansky
152
2
0
05 Feb 2021
Generalized Operating Procedure for Deep Learning: an Unconstrained
  Optimal Design Perspective
Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective
Shen Chen
Mingwei Zhang
Jiamin Cui
Wei Yao
CVBM
132
0
0
31 Dec 2020
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker VerificationComputing and informatics (CAI), 2020
Wei Yao
Shen Chen
Jiamin Cui
Yaolin Lou
233
7
0
21 Dec 2020
Multistream CNN for Robust Acoustic Modeling
Multistream CNN for Robust Acoustic Modeling
Kyu Jeong Han
Jing Pan
Venkata Krishna Naveen Tadala
T. Ma
Daniel Povey
184
40
0
21 May 2020
A practical two-stage training strategy for multi-stream end-to-end
  speech recognition
A practical two-stage training strategy for multi-stream end-to-end speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
106
7
0
23 Oct 2019
1
Page 1 of 1