Advances in integration of end-to-end neural and clustering-based
diarization for real conversational speech

v1v2 (latest)

Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech

19 May 2021

ArXiv (abs)PDF HTML Github (78★)

Papers citing "Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech"

19 / 19 papers shown

Title
Pretraining Multi-Speaker Identification for Neural Speaker Diarization Shota Horiguchi Atsushi Ando Marc Delcroix Naohiro Tawara 24 0 0 30 May 2025
Summary of the NOTSOFAR-1 Challenge: Highlights and Learnings Igor Abramovski Alon Vinnikov Shalev Shaer Naoyuki Kanda Xiaofei Wang Amir Ivry Eyal Krupka 140 1 0 28 Jan 2025
USED: Universal Speaker Extraction and Diarization Junyi Ao Mehmet Sinan Yildirim Ruijie Tao Mengyao Ge Shuai Wang Yan-min Qian Haizhou Li 101 6 0 17 Jan 2025
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition Alexander Polok Dominik Klement M. Kocour Jiangyu Han Federico Landini Bolaji Yusuf Sanjeev Khudanpur Kevin Duh J. Černocký L. Burget 61 0 0 03 Jan 2025
Guided Speaker Embedding Shota Horiguchi Takafumi Moriya Atsushi Ando Takanori Ashihara Hiroshi Sato Naohiro Tawara Marc Delcroix 124 1 0 03 Jan 2025
Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization Xiang Li Vivek Govindan Rohit Paturi S. Srinivasan 46 1 0 26 Jun 2024
Online speaker diarization of meetings guided by speech separation Elio Gruttadauria Mathieu Fontaine S. Essid 30 5 0 30 Jan 2024
Improving End-to-End Neural Diarization Using Conversational Summary Representations Samuel J. Broughton Lahiru Samarakoon 47 7 0 24 Jun 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings L. Serafini Samuele Cornell Giovanni Morrone Enrico Zovato Alessio Brutti S. Squartini 83 9 0 29 May 2023
TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization Jiaming Wang Zhihao Du Shiliang Zhang 45 6 0 08 Mar 2023
BER: Balanced Error Rate For Speaker Diarization Tao Liu K. Yu 39 4 0 08 Nov 2022
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization Dongmei Wang Xiong Xiao Naoyuki Kanda Takuya Yoshioka Jian Wu 83 28 0 27 Aug 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yuki Takashima Yohei Kawaguchi 98 24 0 06 Jun 2022
Robust End-to-end Speaker Diarization with Generic Neural Clustering Chenyu Yang Yu Wang 109 2 0 18 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization Federico Landini Alicia Lozano-Diez Mireia Díez Lukávs Burget 60 37 0 02 Apr 2022
Multimodal Clustering with Role Induced Constraints for Speaker Diarization Nikolaos Flemotomos Shrikanth Narayanan 50 4 0 01 Apr 2022
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model K. Kinoshita Marc Delcroix Tomoharu Iwata BDL 61 19 0 14 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 294 1,911 0 26 Oct 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yawen Xue Yuki Takashima Yohei Kawaguchi 79 38 0 04 Jul 2021