Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.10876
Cited By
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
26 June 2019
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition"
11 / 11 papers shown
Title
Target Speaker ASR with Whisper
Alexander Polok
Dominik Klement
Sanjeev Khudanpur
Kevin Duh
J. Černocký
L. Burget
181
5
0
17 Jan 2025
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Alexander Polok
Dominik Klement
M. Kocour
Jiangyu Han
Federico Landini
Bolaji Yusuf
Sanjeev Khudanpur
Kevin Duh
J. Černocký
L. Burget
52
0
0
03 Jan 2025
Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning
Amit Kumar Bhuyan
H. Dutta
Subir Biswas
FedML
71
2
0
16 Apr 2024
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
73
14
0
06 Jul 2021
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Yuki Takashima
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Leibny Paola García-Perera
Kenji Nagamatsu
60
26
0
08 Jun 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Takafumi Moriya
Naoyuki Kamo
68
23
0
02 Jun 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
382
337
0
24 Jan 2021
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Ivan Medennikov
M. Korenevsky
Tatiana Prisyach
Yuri Y. Khokhlov
Mariya Korenevskaya
...
Anton Mitrofanov
A. Andrusenko
Ivan Podluzhny
A. Laptev
A. Romanenko
61
205
0
14 May 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
83
122
0
28 Mar 2020
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
Naoyuki Kanda
Shota Horiguchi
Yusuke Fujita
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
58
36
0
17 Sep 2019
1