ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.02405
  4. Cited By
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized
  Maximum Eigengap

Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

IEEE Signal Processing Letters (IEEE SPL), 2020
5 March 2020
Tae Jin Park
Kyu Jeong Han
Manoj Kumar
Shrikanth Narayanan
ArXiv (abs)PDFHTML

Papers citing "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

50 / 66 papers shown
Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric
Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric
Nikita Neveditsin
Pawan Lingras
V. Mago
180
0
0
24 Nov 2025
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Alexander Polok
Dominik Klement
M. Kocour
Jiangyu Han
Federico Landini
Bolaji Yusuf
Sanjeev Khudanpur
Sanjeev Khudanpur
J. Černocký
L. Burget
298
0
0
03 Jan 2025
Joint Training of Speaker Embedding Extractor, Speech and Overlap
  Detection for Diarization
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Petr Pálka
Federico Landini
Dominik Klement
Mireia Díez
Anna Silnova
Marc Delcroix
L. Burget
VLM
217
1
0
04 Nov 2024
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization
Mao-Kui He
Jun Du
Shu-Tong Niu
Qing-Feng Liu
Chin-Hui Lee
230
2
0
15 Oct 2024
Self-Tuning Spectral Clustering for Speaker Diarization
Self-Tuning Spectral Clustering for Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Nikhil Raghav
Avisek Gupta
Md Sahidullah
Swagatam Das
491
5
0
16 Sep 2024
Leveraging Self-Supervised Learning for Speaker Diarization
Leveraging Self-Supervised Learning for Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiangyu Han
Federico Landini
Johan Rohdin
Anna Silnova
Mireia Díez
Lukas Burget
371
38
0
14 Sep 2024
Integrating Audio, Visual, and Semantic Information for Enhanced
  Multimodal Speaker Diarization
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization
Luyao Cheng
Hui Wang
Siqi Zheng
Yafeng Chen
Rongjie Huang
Qinglin Zhang
Qian Chen
Xihao Li
255
5
0
22 Aug 2024
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant
  Automatic Speech Recognition and Diarization
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
Samuele Cornell
Taejin Park
Steve Huang
Christoph Boeddeker
Xuankai Chang
Matthew Maciejewski
Sanjeev Khudanpur
Paola García
Shinji Watanabe
234
27
0
23 Jul 2024
Assessing the Robustness of Spectral Clustering for Deep Speaker
  Diarization
Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization
Nikhil Raghav
Md Sahidullah
219
4
0
21 Mar 2024
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant
  Meeting Transcription
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting TranscriptionInterspeech (Interspeech), 2024
Alon Vinnikov
Amir Ivry
Aviv Hurvitz
Igor Abramovski
S. Koubi
...
S. Sivasankaran
Yifan Gong
Min Tang
Huaming Wang
Eyal Krupka
242
50
0
16 Jan 2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language
  Models
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Quan Wang
Yiling Huang
Guanlong Zhao
Evan Clark
Wei Xia
Hank Liao
AuLLM
721
23
0
07 Jan 2024
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors
Federico Landini
Mireia Díez
Themos Stafylakis
Lukávs Burget
445
24
0
07 Dec 2023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's
  DASR System
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
T. Park
He Huang
Ante Jukić
Kunal Dhawan
Krishna C. Puvvada
Nithin Rao Koluguri
Nikolay Karpov
A. Laptev
Jagadeesh Balam
Boris Ginsburg
237
11
0
18 Oct 2023
Discriminative Training of VBx Diarization
Discriminative Training of VBx DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Dominik Klement
Mireia Díez
Federico Landini
Lukávs Burget
Anna Silnova
Marc Delcroix
Naohiro Tawara
455
5
0
04 Oct 2023
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription
  System
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription SystemAutomatic Speech Recognition & Understanding (ASRU), 2023
Xiang Lyu
Yuhang Cao
Qing Wang
Jingjing Yin
Yuguang Yang
Pengpeng Zou
G. Zachmann
Heng Lu
VLM
200
4
0
28 Sep 2023
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Profile-Error-Tolerant Target-Speaker Voice Activity DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Midia Yousefi
Takuya Yoshioka
Jian Wu
271
7
0
21 Sep 2023
DiariST: Streaming Speech Translation with Speaker Diarization
DiariST: Streaming Speech Translation with Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muqiao Yang
Naoyuki Kanda
Xiaofei Wang
Junkun Chen
Peidong Wang
Jian Xue
Jinyu Li
Takuya Yoshioka
285
7
0
14 Sep 2023
Enhancing Speaker Diarization with Large Language Models: A Contextual
  Beam Search Approach
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search ApproachIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
T. Park
Kunal Dhawan
Nithin Rao Koluguri
Jagadeesh Balam
279
25
0
11 Sep 2023
Affinity Clustering Framework for Data Debiasing Using Pairwise
  Distribution Discrepancy
Affinity Clustering Framework for Data Debiasing Using Pairwise Distribution Discrepancy
Siamak Ghodsi
Eirini Ntoutsi
132
2
0
02 Jun 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordingsComputer Speech and Language (CSL), 2023
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
259
17
0
29 May 2023
Exploring Speaker-Related Information in Spoken Language Understanding
  for Better Speaker Diarization
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker DiarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Luyao Cheng
Siqi Zheng
Zhang Qinglin
Haibo Wang
Yafeng Chen
Qian Chen
201
6
0
22 May 2023
End-to-End Integration of Speech Separation and Voice Activity Detection
  for Low-Latency Diarization of Telephone Conversations
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone ConversationsSpeech Communication (Speech Commun.), 2023
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
Alessio Brutti
S. Squartini
348
5
0
21 Mar 2023
TS-SEP: Joint Diarization and Separation Conditioned on Estimated
  Speaker Embeddings
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker EmbeddingsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Christoph Boeddeker
Aswin Shanmugam Subramanian
Gordon Wichern
Reinhold Haeb-Umbach
Jonathan Le Roux
347
34
0
07 Mar 2023
GPU-accelerated Guided Source Separation for Meeting Transcription
GPU-accelerated Guided Source Separation for Meeting TranscriptionInterspeech (Interspeech), 2022
Desh Raj
Daniel Povey
Sanjeev Khudanpur
388
47
0
10 Dec 2022
TSUP Speaker Diarization System for Conversational Short-phrase Speaker
  Diarization Challenge
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Bowen Pang
Huan Zhao
Gaosheng Zhang
Xiaoyue Yang
Yanguo Sun
Li Zhang
Qing Wang
Linfu Xie
BDL
178
3
0
26 Oct 2022
Speaker Diarization Based on Multi-channel Microphone Array in
  Small-scale Meeting
Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Yu Du
R. Zhou
136
1
0
26 Oct 2022
Highly Efficient Real-Time Streaming and Fully On-Device Speaker
  Diarization with Multi-Stage Clustering
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Quan Wang
Yiling Huang
Han Lu
Guanlong Zhao
Ignacio López Moreno
283
12
0
25 Oct 2022
Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation
Spectral Clustering-aware Learning of Embeddings for Speaker DiarisationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Evonne Lee
Guangzhi Sun
Chuxu Zhang
P. Woodland
218
1
0
24 Oct 2022
Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting
Spatial-aware Speaker Diarization for Multi-channel Multi-party MeetingInterspeech (Interspeech), 2022
Jie Wang
Yuji Liu
Binling Wang
Yiming Zhi
Song Li
Shipeng Xia
Jiayang Zhang
Feng Tong
Lin Li
Q. Hong
174
11
0
24 Sep 2022
The BUCEA Speaker Diarization System for the VoxCeleb Speaker
  Recognition Challenge 2022
The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
R. Zhou
Yu Du
Che-Ming Hu
158
0
0
20 Sep 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local AttractorsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
298
30
0
06 Jun 2022
Reformulating Speaker Diarization as Community Detection With Emphasis
  On Topological Structure
Reformulating Speaker Diarization as Community Detection With Emphasis On Topological StructureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Siqi Zheng
Hongbin Suo
172
8
0
26 Apr 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural DiarizationThe Speaker and Language Recognition Workshop (Odyssey), 2022
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
272
23
0
24 Apr 2022
Multimodal Clustering with Role Induced Constraints for Speaker
  Diarization
Multimodal Clustering with Role Induced Constraints for Speaker DiarizationInterspeech (Interspeech), 2022
Nikolaos Flemotomos
Shrikanth Narayanan
269
7
0
01 Apr 2022
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
Streaming Speaker-Attributed ASR with Token-Level Speaker EmbeddingsInterspeech (Interspeech), 2022
Naoyuki Kanda
Jian Wu
Yu Wu
Xiong Xiao
Zhong Meng
Xiaofei Wang
Yashesh Gaur
Zhuo Chen
Jinyu Li
Takuya Yoshioka
269
38
0
30 Mar 2022
Multi-scale Speaker Diarization with Dynamic Scale Weighting
Multi-scale Speaker Diarization with Dynamic Scale WeightingInterspeech (Interspeech), 2022
Tae Jin Park
Nithin Rao Koluguri
Jagadeesh Balam
Boris Ginsburg
242
28
0
30 Mar 2022
Using Active Speaker Faces for Diarization in TV shows
Using Active Speaker Faces for Diarization in TV shows
Rahul Sharma
Shrikanth Narayanan
CVBM
201
11
0
30 Mar 2022
The xmuspeech system for multi-channel multi-party meeting transcription
  challenge
The xmuspeech system for multi-channel multi-party meeting transcription challenge
Jie Wang
Yuji Liu
Binling Wang
Yiming Zhi
Song Li
Shipeng Xia
Jiayang Zhang
Lin Li
Q. Hong
Feng Tong
192
0
0
11 Feb 2022
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party
  meeting transcription (M2MeT) challenge
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Maokui He
Xiang Lv
Weilin Zhou
Jingjing Yin
Xiaoqi Zhang
...
Shutong Niu
Yuhang Cao
Heng Lu
Jun Du
Chin-Hui Lee
224
8
0
10 Feb 2022
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel
  Multi-party Meeting Transcription Challenge
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Jingguang Tian
Xinhui Hu
Xinkang Xu
218
9
0
10 Feb 2022
The Volcspeech system for the ICASSP 2022 multi-channel multi-party
  meeting transcription challenge
The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chen Shen
Yi Y. Liu
Wenzhi Fan
Bin Wang
Shi-Xue Wen
Yao Tian
Jun Zhang
Jingsheng Yang
Zejun Ma
180
5
0
09 Feb 2022
Multi-Channel End-to-End Neural Diarization with Distributed Microphones
Multi-Channel End-to-End Neural Diarization with Distributed MicrophonesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shota Horiguchi
Yuki Takashima
Leibny Paola García-Perera
Shinji Watanabe
Yohei Kawaguchi
345
26
0
10 Oct 2021
TitaNet: Neural Model for speaker representation with 1D Depth-wise
  separable convolutions and global context
TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global contextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Nithin Rao Koluguri
Taejin Park
Boris Ginsburg
ViT
276
156
0
08 Oct 2021
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number
  of Speakers using End-to-End Speaker-Attributed ASR
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Xiong Xiao
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
262
54
0
07 Oct 2021
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer
  Transducer Speaker Turn Detection
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wei Xia
Han Lu
Quan Wang
Anshuman Tripathi
Yiling Huang
Ignacio López Moreno
Hasim Sak
235
58
0
23 Sep 2021
Joint speaker diarisation and tracking in switching state-space model
Joint speaker diarisation and tracking in switching state-space modelSpoken Language Technology Workshop (SLT), 2021
J. H. M. Wong
Yifan Gong
141
0
0
23 Sep 2021
Diarisation using location tracking with agglomerative clustering
Diarisation using location tracking with agglomerative clusteringSpoken Language Technology Workshop (SLT), 2021
J. H. M. Wong
Igor Abramovski
Xiong Xiao
Yifan Gong
224
1
0
22 Sep 2021
Overlap-aware low-latency online speaker diarization based on end-to-end
  local segmentation
Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation
Juan Manuel Coria
H. Bredin
Sahar Ghannay
Sophie Rosset
292
37
0
14 Sep 2021
XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021
XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021
Jie Wang
Fuchuan Tong
Zhi-Cong Chen
Lin Li
Q. Hong
Haodong Zhou
231
1
0
06 Sep 2021
The ByteDance Speaker Diarization System for the VoxCeleb Speaker
  Recognition Challenge 2021
The ByteDance Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021
Keke Wang
Xudong Mao
Hao Wu
Chen Ding
Chuxiang Shang
Rui Xia
Yuxuan Wang
212
13
0
05 Sep 2021
12
Next
Page 1 of 2