ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.09040
  4. Cited By
Advances in integration of end-to-end neural and clustering-based
  diarization for real conversational speech
v1v2 (latest)

Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech

19 May 2021
K. Kinoshita
Marc Delcroix
Naohiro Tawara
ArXiv (abs)PDFHTMLGithub (78★)

Papers citing "Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech"

19 / 19 papers shown
Title
Pretraining Multi-Speaker Identification for Neural Speaker Diarization
Pretraining Multi-Speaker Identification for Neural Speaker Diarization
Shota Horiguchi
Atsushi Ando
Marc Delcroix
Naohiro Tawara
24
0
0
30 May 2025
Summary of the NOTSOFAR-1 Challenge: Highlights and Learnings
Summary of the NOTSOFAR-1 Challenge: Highlights and Learnings
Igor Abramovski
Alon Vinnikov
Shalev Shaer
Naoyuki Kanda
Xiaofei Wang
Amir Ivry
Eyal Krupka
140
1
0
28 Jan 2025
USED: Universal Speaker Extraction and Diarization
USED: Universal Speaker Extraction and Diarization
Junyi Ao
Mehmet Sinan Yildirim
Ruijie Tao
Mengyao Ge
Shuai Wang
Yan-min Qian
Haizhou Li
101
6
0
17 Jan 2025
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Alexander Polok
Dominik Klement
M. Kocour
Jiangyu Han
Federico Landini
Bolaji Yusuf
Sanjeev Khudanpur
Kevin Duh
J. Černocký
L. Burget
61
0
0
03 Jan 2025
Guided Speaker Embedding
Guided Speaker Embedding
Shota Horiguchi
Takafumi Moriya
Atsushi Ando
Takanori Ashihara
Hiroshi Sato
Naohiro Tawara
Marc Delcroix
124
1
0
03 Jan 2025
Speakers Unembedded: Embedding-free Approach to Long-form Neural
  Diarization
Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Xiang Li
Vivek Govindan
Rohit Paturi
S. Srinivasan
46
1
0
26 Jun 2024
Online speaker diarization of meetings guided by speech separation
Online speaker diarization of meetings guided by speech separation
Elio Gruttadauria
Mathieu Fontaine
S. Essid
30
5
0
30 Jan 2024
Improving End-to-End Neural Diarization Using Conversational Summary
  Representations
Improving End-to-End Neural Diarization Using Conversational Summary Representations
Samuel J. Broughton
Lahiru Samarakoon
47
7
0
24 Jun 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
Alessio Brutti
S. Squartini
83
9
0
29 May 2023
TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization
Jiaming Wang
Zhihao Du
Shiliang Zhang
45
6
0
08 Mar 2023
BER: Balanced Error Rate For Speaker Diarization
BER: Balanced Error Rate For Speaker Diarization
Tao Liu
K. Yu
39
4
0
08 Nov 2022
Target Speaker Voice Activity Detection with Transformers and Its
  Integration with End-to-End Neural Diarization
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
83
28
0
27 Aug 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
98
24
0
06 Jun 2022
Robust End-to-end Speaker Diarization with Generic Neural Clustering
Robust End-to-end Speaker Diarization with Generic Neural Clustering
Chenyu Yang
Yu Wang
109
2
0
18 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for
  End-to-End Neural Diarization
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization
Federico Landini
Alicia Lozano-Diez
Mireia Díez
Lukávs Burget
60
37
0
02 Apr 2022
Multimodal Clustering with Role Induced Constraints for Speaker
  Diarization
Multimodal Clustering with Role Induced Constraints for Speaker Diarization
Nikolaos Flemotomos
Shrikanth Narayanan
50
4
0
01 Apr 2022
Tight integration of neural- and clustering-based diarization through
  deep unfolding of infinite Gaussian mixture model
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
K. Kinoshita
Marc Delcroix
Tomoharu Iwata
BDL
61
19
0
14 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
294
1,911
0
26 Oct 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using
  Global and Local Attractors
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Yohei Kawaguchi
79
38
0
04 Jul 2021
1