Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.00091
Cited By
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization
30 October 2020
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization"
8 / 8 papers shown
Title
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
128
7
0
23 Oct 2023
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
85
12
0
29 Oct 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Wolfgang Mack
Julian Wechsler
Emanuel Habets
124
11
0
03 Jan 2022
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
88
15
0
22 Nov 2021
FAST-RIR: Fast neural diffuse room impulse response generator
Anton Ratnarajah
Shi-Xiong Zhang
Meng Yu
Zhenyu Tang
Tianyi Zhou
Dong Yu
71
56
0
07 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
115
80
0
16 Feb 2021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
108
38
0
23 Dec 2020
1