ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.00091
  4. Cited By
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition
  with Source Localization

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization

30 October 2020
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
ArXiv (abs)PDFHTML

Papers citing "Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization"

8 / 8 papers shown
Title
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
128
7
0
23 Oct 2023
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source
  Separation
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
85
12
0
29 Oct 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Wolfgang Mack
Julian Wechsler
Emanuel Habets
124
11
0
03 Jan 2022
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
88
15
0
22 Nov 2021
FAST-RIR: Fast neural diffuse room impulse response generator
FAST-RIR: Fast neural diffuse room impulse response generator
Anton Ratnarajah
Shi-Xiong Zhang
Meng Yu
Zhenyu Tang
Tianyi Zhou
Dong Yu
71
56
0
07 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
80
257
0
08 Sep 2021
Deep Learning based Multi-Source Localization with Source Splitting and
  its Effectiveness in Multi-Talker Speech Recognition
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
115
80
0
16 Feb 2021
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
108
38
0
23 Dec 2020
1