ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08041
  4. Cited By
Multi-Stream End-to-End Speech Recognition
v1v2 (latest)

Multi-Stream End-to-End Speech Recognition

IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
17 June 2019
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
ArXiv (abs)PDFHTML

Papers citing "Multi-Stream End-to-End Speech Recognition"

14 / 14 papers shown
SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography
SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography
Nima Hadidi
Jason Chan
Ebrahim Feghhi
Jonathan C. Kao
148
0
0
14 Jun 2025
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation
Hemant Yadav
Sunayana Sitaram
R. Shah
SSL
306
0
0
20 Aug 2024
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised
  Learning with Masked Unit Prediction
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit PredictionInternational Conference on Learning Representations (ICLR), 2023
Jiatong Shi
Hirofumi Inaguma
Xutai Ma
Ilia Kulikov
Anna Y. Sun
261
36
0
04 Oct 2023
Exploration on HuBERT with Multiple Resolutions
Exploration on HuBERT with Multiple ResolutionsInterspeech (Interspeech), 2023
Jiatong Shi
Yun Tang
Hirofumi Inaguma
Hongyu Gong
J. Pino
Shinji Watanabe
358
11
0
01 Jun 2023
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech RecognitionInterspeech (Interspeech), 2022
Jiamin Xie
John H. L. Hansen
183
1
0
04 Jul 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent
  Architecture for Acoustic Signal Enhancement
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal EnhancementIEEE Transactions on Artificial Intelligence (IEEE TAI), 2022
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
134
4
0
24 Jan 2022
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Tassadaq Hussain
M. Gogate
K. Dashtipour
Amir Hussain
VLM
219
19
0
18 Nov 2021
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk
  and Far-Talk Speech Recognition
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
F. Weninger
M. Gaudesi
Ralf Leibold
R. Gemello
P. Zhan
91
4
0
17 Sep 2021
Scaling sparsemax based channel selection for speech recognition with
  ad-hoc microphone arrays
Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arraysInterspeech (Interspeech), 2021
Junqi Chen
Xiao-Lei Zhang
204
12
0
29 Mar 2021
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness
  of Multi-Stream End-to-End ASR
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASRSpoken Language Technology Workshop (SLT), 2021
Ruizhi Li
Gregory Sell
H. Hermansky
143
2
0
05 Feb 2021
Generalized Operating Procedure for Deep Learning: an Unconstrained
  Optimal Design Perspective
Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective
Shen Chen
Mingwei Zhang
Jiamin Cui
Wei Yao
CVBM
125
0
0
31 Dec 2020
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker VerificationComputing and informatics (CAI), 2020
Wei Yao
Shen Chen
Jiamin Cui
Yaolin Lou
218
7
0
21 Dec 2020
Multistream CNN for Robust Acoustic Modeling
Multistream CNN for Robust Acoustic Modeling
Kyu Jeong Han
Jing Pan
Venkata Krishna Naveen Tadala
T. Ma
Daniel Povey
158
40
0
21 May 2020
A practical two-stage training strategy for multi-stream end-to-end
  speech recognition
A practical two-stage training strategy for multi-stream end-to-end speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
102
7
0
23 Oct 2019
1
Page 1 of 1