ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.00091
  4. Cited By
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition
  with Source Localization

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
30 October 2020
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Yong-mei Xu
Shi-Xiong Zhang
Dong Yu
ArXiv (abs)PDFHTML

Papers citing "Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization"

9 / 9 papers shown
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
781
10
0
23 Oct 2023
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source
  Separation
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source SeparationNeural Information Processing Systems (NeurIPS), 2022
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
222
14
0
29 Oct 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and
  Recognition in Real Multiparty Conversational Environments
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational EnvironmentsInterspeech (Interspeech), 2022
Yicheng Du
Aditya Arie Nugraha
Kouhei Sekiguchi
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
148
0
0
15 Jul 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Signal-Aware Direction-of-Arrival Estimation Using Attention MechanismsComputer Speech and Language (CSL), 2022
Wolfgang Mack
Julian Wechsler
Emanuel Habets
319
16
0
03 Jan 2022
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Multi-Channel Multi-Speaker ASR Using 3D Spatial FeatureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
196
18
0
22 Nov 2021
FAST-RIR: Fast neural diffuse room impulse response generator
FAST-RIR: Fast neural diffuse room impulse response generator
Anton Ratnarajah
Shi-Xiong Zhang
Meng Yu
Zhenyu Tang
Tianyi Zhou
Dong Yu
217
67
0
07 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning MethodsJournal of the Acoustical Society of America (JASA), 2021
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
372
340
0
08 Sep 2021
Deep Learning based Multi-Source Localization with Source Splitting and
  its Effectiveness in Multi-Talker Speech Recognition
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech RecognitionComputer Speech and Language (CSL), 2021
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
304
91
0
16 Feb 2021
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
241
39
0
23 Dec 2020
1
Page 1 of 1