Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1906.08041
Cited By
v1
v2 (latest)
Multi-Stream End-to-End Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
17 June 2019
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Shinji Watanabe
Takaaki Hori
H. Hermansky
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Stream End-to-End Speech Recognition"
14 / 14 papers shown
SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography
Nima Hadidi
Jason Chan
Ebrahim Feghhi
Jonathan C. Kao
148
0
0
14 Jun 2025
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation
Hemant Yadav
Sunayana Sitaram
R. Shah
SSL
306
0
0
20 Aug 2024
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction
International Conference on Learning Representations (ICLR), 2023
Jiatong Shi
Hirofumi Inaguma
Xutai Ma
Ilia Kulikov
Anna Y. Sun
261
36
0
04 Oct 2023
Exploration on HuBERT with Multiple Resolutions
Interspeech (Interspeech), 2023
Jiatong Shi
Yun Tang
Hirofumi Inaguma
Hongyu Gong
J. Pino
Shinji Watanabe
358
11
0
01 Jun 2023
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition
Interspeech (Interspeech), 2022
Jiamin Xie
John H. L. Hansen
183
1
0
04 Jul 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2022
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
134
4
0
24 Jan 2022
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Tassadaq Hussain
M. Gogate
K. Dashtipour
Amir Hussain
VLM
219
19
0
18 Nov 2021
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition
F. Weninger
M. Gaudesi
Ralf Leibold
R. Gemello
P. Zhan
91
4
0
17 Sep 2021
Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arrays
Interspeech (Interspeech), 2021
Junqi Chen
Xiao-Lei Zhang
204
12
0
29 Mar 2021
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR
Spoken Language Technology Workshop (SLT), 2021
Ruizhi Li
Gregory Sell
H. Hermansky
143
2
0
05 Feb 2021
Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective
Shen Chen
Mingwei Zhang
Jiamin Cui
Wei Yao
CVBM
125
0
0
31 Dec 2020
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Computing and informatics (CAI), 2020
Wei Yao
Shen Chen
Jiamin Cui
Yaolin Lou
218
7
0
21 Dec 2020
Multistream CNN for Robust Acoustic Modeling
Kyu Jeong Han
Jing Pan
Venkata Krishna Naveen Tadala
T. Ma
Daniel Povey
158
40
0
21 May 2020
A practical two-stage training strategy for multi-stream end-to-end speech recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Ruizhi Li
Gregory Sell
Xiaofei Wang
Shinji Watanabe
H. Hermansky
102
7
0
23 Oct 2019
1
Page 1 of 1