Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2011.00091
Cited By

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition
with Source Localization

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

30 October 2020

Aswin Shanmugam Subramanian

Shinji Watanabe

Shi-Xiong Zhang

ArXiv (abs)PDF HTML

Papers citing "Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization"

9 / 9 papers shown

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Philip J. B. Jackson

794

10

0

23 Oct 2023

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source
Separation

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source SeparationNeural Information Processing Systems (NeurIPS), 2022

Moitreya Chatterjee

223

14

0

29 Oct 2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and
Recognition in Real Multiparty Conversational Environments

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational EnvironmentsInterspeech (Interspeech), 2022

Aditya Arie Nugraha

Kouhei Sekiguchi

Mathieu Fontaine

Kazuyoshi Yoshii

148

0

0

15 Jul 2022

Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms

Signal-Aware Direction-of-Arrival Estimation Using Attention MechanismsComputer Speech and Language (CSL), 2022

Julian Wechsler

325

16

0

03 Jan 2022

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature

Multi-Channel Multi-Speaker ASR Using 3D Spatial FeatureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Shi-Xiong Zhang

196

18

0

22 Nov 2021

FAST-RIR: Fast neural diffuse room impulse response generator

FAST-RIR: Fast neural diffuse room impulse response generator

Anton Ratnarajah

Shi-Xiong Zhang

218

68

0

07 Oct 2021

A Survey of Sound Source Localization with Deep Learning Methods

A Survey of Sound Source Localization with Deep Learning MethodsJournal of the Acoustical Society of America (JASA), 2021

Pierre-Amaury Grumiaux

Alexandre Guérin

388

340

0

08 Sep 2021

Deep Learning based Multi-Source Localization with Source Splitting and
its Effectiveness in Multi-Talker Speech Recognition

Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech RecognitionComputer Speech and Language (CSL), 2021

Aswin Shanmugam Subramanian

Shinji Watanabe

309

91

0

16 Feb 2021

The 2020 ESPnet update: new features, broadened applications,
performance improvements, and future plans

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Shinji Watanabe

...

Aswin Shanmugam Subramanian

Wangyou Zhang

249

39

0

23 Dec 2020

Page 1 of 1