Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.06220
Cited By
Speaker Diarization with Region Proposal Network
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
14 February 2020
Zili Huang
Shinji Watanabe
Yusuke Fujita
Leibny Paola García-Perera
Yiwen Shao
Daniel Povey
Sanjeev Khudanpur
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Speaker Diarization with Region Proposal Network"
36 / 36 papers shown
Domain-Aware Speaker Diarization On African-Accented English
Chibuzor Okocha
Kelechi Ezema
Christan Grant
155
1
0
25 Sep 2025
An Investigation Into Explainable Audio Hate Speech Detection
SIGDIAL Conferences (SIGDIAL), 2024
Jinmyeong An
Wonjun Lee
Yejin Jeon
Jungseul Ok
Yunsu Kim
Gary Geunbae Lee
258
5
0
12 Aug 2024
Online speaker diarization of meetings guided by speech separation
Elio Gruttadauria
Mathieu Fontaine
S. Essid
260
8
0
30 Jan 2024
Multi-channel Conversational Speaker Separation via Neural Diarization
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
H. Taherian
DeLiang Wang
BDL
262
25
0
15 Nov 2023
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
401
8
0
12 Oct 2023
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Gaobin Yang
Maokui He
Shutong Niu
Ruoyu Wang
Yanyan Yue
Shuangqing Qian
Shilong Wu
Jun Du
Chin-Hui Lee
312
17
0
17 Sep 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
Computer Speech and Language (CSL), 2023
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
274
17
0
29 May 2023
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator
Interspeech (Interspeech), 2023
Lingwei Meng
Jiawen Kang
Mingyu Cui
Haibin Wu
Xixin Wu
Helen M. Meng
208
13
0
25 May 2023
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Speech Communication (Speech Commun.), 2023
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
364
5
0
21 Mar 2023
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Gaofeng Cheng
Yifan Chen
Runyan Yang
Qingxu Li
Zehui Yang
...
Qingqing Zhang
Linfu Xie
Y. Qian
Kong Aik Lee
Yonghong Yan
185
9
0
17 Aug 2022
Multi-target Extractor and Detector for Unknown-number Speaker Diarization
IEEE Signal Processing Letters (SPL), 2022
Chin-Yi Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
275
12
0
30 Mar 2022
Using Active Speaker Faces for Diarization in TV shows
Rahul Sharma
Shrikanth Narayanan
CVBM
214
11
0
30 Mar 2022
AVA-AVD: Audio-Visual Speaker Diarization in the Wild
ACM Multimedia (MM), 2021
Eric Z. Xu
Zeyang Song
Satoshi Tsutsui
C. Feng
Mang Ye
Mike Zheng Shou
VGen
551
59
0
29 Nov 2021
Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity
You Jin Kim
Hee-Soo Heo
Jee-weon Jung
Youngki Kwon
Bong-Jin Lee
Joon Son Chung
304
3
0
07 Oct 2021
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Xiong Xiao
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
307
57
0
07 Oct 2021
Localization Based Sequential Grouping for Continuous Speech Separation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Zhong-Qiu Wang
DeLiang Wang
283
13
0
14 Jul 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
252
85
0
20 Jun 2021
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
Interspeech (Interspeech), 2021
K. Kinoshita
Marc Delcroix
Naohiro Tawara
319
84
0
19 May 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
254
23
0
05 May 2021
Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings
Interspeech (Interspeech), 2021
Kiran Karra
A. McCree
258
2
0
06 Apr 2021
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
Interspeech (Interspeech), 2021
Desh Raj
Sanjeev Khudanpur
337
3
0
05 Apr 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
174
7
0
02 Mar 2021
Contrastive Separative Coding for Self-supervised Representation Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
SSL
188
3
0
01 Mar 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Computer Speech and Language (CSL), 2021
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
869
411
0
24 Jan 2021
Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Yawen Xue
Shota Horiguchi
Yusuke Fujita
Yuki Takashima
Shinji Watanabe
Leibny Paola García-Perera
Kenji Nagamatsu
283
6
0
21 Jan 2021
Speaker activity driven neural speech extraction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Marc Delcroix
Kateřina Žmolíková
Tsubasa Ochiai
K. Kinoshita
Tomohiro Nakatani
340
39
0
14 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Computer Speech and Language (CSL), 2020
Federico Landini
Jan Profant
Mireia Díez
L. Burget
521
250
0
29 Dec 2020
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Computing and informatics (CAI), 2020
Wei Yao
Shen Chen
Jiamin Cui
Yaolin Lou
304
7
0
21 Dec 2020
End-to-End Speaker Diarization as Post-Processing
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Shota Horiguchi
Leibny Paola García-Perera
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
298
45
0
18 Dec 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
252
37
0
05 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
291
115
0
03 Nov 2020
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
Desh Raj
Leibny Paola García-Perera
Zili Huang
Shinji Watanabe
Daniel Povey
A. Stolcke
Sanjeev Khudanpur
251
81
0
03 Nov 2020
Online Speaker Diarization with Relation Network
Xiang Li
Yucheng Zhao
Chong Luo
Wenjun Zeng
196
2
0
17 Sep 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Spoken Language Technology Workshop (SLT), 2020
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
270
52
0
11 Aug 2020
Neural Speaker Diarization with Speaker-Wise Chain Rule
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Jing Shi
Kenji Nagamatsu
244
51
0
02 Jun 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
432
226
0
20 May 2020
1
Page 1 of 1