Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

IEEE Signal Processing Letters (IEEE SPL), 2020

5 March 2020

Shrikanth Narayanan

Papers citing "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

50 / 66 papers shown

Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric

Nikita Neveditsin

Pawan Lingras

V. Mago

180

24 Nov 2025

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

298

03 Jan 2025

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization

217

04 Nov 2024

Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization

230

15 Oct 2024

Self-Tuning Spectral Clustering for Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

491

16 Sep 2024

Leveraging Self-Supervised Learning for Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Lukas Burget

371

14 Sep 2024

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization

Luyao Cheng

Hui Wang

Siqi Zheng

Yafeng Chen

Rongjie Huang

Qinglin Zhang

Qian Chen

Xihao Li

255

22 Aug 2024

The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization

Shinji Watanabe

234

23 Jul 2024

Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization

Nikhil Raghav

Md Sahidullah

219

21 Mar 2024

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting TranscriptionInterspeech (Interspeech), 2024

...

242

16 Jan 2024

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

721

07 Jan 2024

DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

445

07 Dec 2023

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

T. Park

He Huang

Ante Jukić

Kunal Dhawan

Krishna C. Puvvada

Nithin Rao Koluguri

Nikolay Karpov

A. Laptev

Jagadeesh Balam

Boris Ginsburg

237

18 Oct 2023

Discriminative Training of VBx DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Dominik Klement

455

04 Oct 2023

PP-MeT: a Real-world Personalized Prompt based Meeting Transcription SystemAutomatic Speech Recognition & Understanding (ASRU), 2023

Qing Wang

Pengpeng Zou

Heng Lu

200

28 Sep 2023

Profile-Error-Tolerant Target-Speaker Voice Activity DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

271

21 Sep 2023

DiariST: Streaming Speech Translation with Speaker DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

285

14 Sep 2023

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search ApproachIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

T. Park

Kunal Dhawan

Nithin Rao Koluguri

Jagadeesh Balam

279

11 Sep 2023

Affinity Clustering Framework for Data Debiasing Using Pairwise Distribution Discrepancy

Siamak Ghodsi

Eirini Ntoutsi

132

02 Jun 2023

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordingsComputer Speech and Language (CSL), 2023

259

29 May 2023

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker DiarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Siqi Zheng

Qian Chen

201

22 May 2023

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone ConversationsSpeech Communication (Speech Commun.), 2023

348

21 Mar 2023

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker EmbeddingsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Christoph Boeddeker

Aswin Shanmugam Subramanian

Gordon Wichern

Reinhold Haeb-Umbach

Jonathan Le Roux

347

07 Mar 2023

GPU-accelerated Guided Source Separation for Meeting TranscriptionInterspeech (Interspeech), 2022

Desh Raj

Daniel Povey

Sanjeev Khudanpur

388

10 Dec 2022

TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

Xiaoyue Yang

178

26 Oct 2022

Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting

Yu Du

R. Zhou

136

26 Oct 2022

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

283

25 Oct 2022

Spectral Clustering-aware Learning of Embeddings for Speaker DiarisationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Evonne Lee

Guangzhi Sun

Chuxu Zhang

P. Woodland

218

24 Oct 2022

Spatial-aware Speaker Diarization for Multi-channel Multi-party MeetingInterspeech (Interspeech), 2022

174

24 Sep 2022

The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

R. Zhou

Yu Du

Che-Ming Hu

158

20 Sep 2022

Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local AttractorsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Shota Horiguchi

Shinji Watanabe

Leibny Paola García-Perera

Yuki Takashima

Yohei Kawaguchi

298

06 Jun 2022

Reformulating Speaker Diarization as Community Detection With Emphasis On Topological StructureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Siqi Zheng

Hongbin Suo

172

26 Apr 2022

Improving the Naturalness of Simulated Conversations for End-to-End Neural DiarizationThe Speaker and Language Recognition Workshop (Odyssey), 2022

Natsuo Yamashita

Shota Horiguchi

Takeshi Homma

272

24 Apr 2022

Multimodal Clustering with Role Induced Constraints for Speaker DiarizationInterspeech (Interspeech), 2022

Nikolaos Flemotomos

Shrikanth Narayanan

269

01 Apr 2022

Streaming Speaker-Attributed ASR with Token-Level Speaker EmbeddingsInterspeech (Interspeech), 2022

269

30 Mar 2022

Multi-scale Speaker Diarization with Dynamic Scale WeightingInterspeech (Interspeech), 2022

Tae Jin Park

Nithin Rao Koluguri

Jagadeesh Balam

Boris Ginsburg

242

30 Mar 2022

Using Active Speaker Faces for Diarization in TV shows

Rahul Sharma

Shrikanth Narayanan

CVBM

201

30 Mar 2022

The xmuspeech system for multi-channel multi-party meeting transcription challenge

192

11 Feb 2022

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

...

Shutong Niu

Yuhang Cao

Heng Lu

Jun Du

Chin-Hui Lee

224

10 Feb 2022

Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge

Jingguang Tian

Xinhui Hu

Xinkang Xu

218

10 Feb 2022

The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

180

09 Feb 2022

Multi-Channel End-to-End Neural Diarization with Distributed MicrophonesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Shota Horiguchi

Yuki Takashima

Leibny Paola García-Perera

Shinji Watanabe

Yohei Kawaguchi

345

10 Oct 2021

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global contextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Nithin Rao Koluguri

Taejin Park

Boris Ginsburg

ViT

276

156

08 Oct 2021

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

262

07 Oct 2021

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

235

23 Sep 2021

Joint speaker diarisation and tracking in switching state-space modelSpoken Language Technology Workshop (SLT), 2021

J. H. M. Wong

Yifan Gong

141

23 Sep 2021

Diarisation using location tracking with agglomerative clusteringSpoken Language Technology Workshop (SLT), 2021

224

22 Sep 2021

Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation

292

14 Sep 2021

XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021

231

06 Sep 2021

The ByteDance Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021

Xudong Mao

Yuxuan Wang

212

05 Sep 2021