Speaker Diarization with Region Proposal Network

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

14 February 2020

Zili Huang

Shinji Watanabe

Yusuke Fujita

Leibny Paola García-Perera

Yiwen Shao

Daniel Povey

Sanjeev Khudanpur

ArXiv (abs)PDF HTML

Papers citing "Speaker Diarization with Region Proposal Network"

36 / 36 papers shown

Domain-Aware Speaker Diarization On African-Accented English

Chibuzor Okocha

Kelechi Ezema

Christan Grant

155

25 Sep 2025

An Investigation Into Explainable Audio Hate Speech DetectionSIGDIAL Conferences (SIGDIAL), 2024

Wonjun Lee

Gary Geunbae Lee

258

12 Aug 2024

Online speaker diarization of meetings guided by speech separation

Elio Gruttadauria

Mathieu Fontaine

S. Essid

260

30 Jan 2024

Multi-channel Conversational Speaker Separation via Neural DiarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

H. Taherian

DeLiang Wang

BDL

262

15 Nov 2023

End-to-end Online Speaker Diarization with Target Speaker Tracking

Weiqing Wang

Ming Li

401

12 Oct 2023

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence ArchitectureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Gaobin Yang

Maokui He

Shutong Niu

Ruoyu Wang

312

17 Sep 2023

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordingsComputer Speech and Language (CSL), 2023

274

29 May 2023

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar SeparatorInterspeech (Interspeech), 2023

Lingwei Meng

Haibin Wu

208

25 May 2023

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone ConversationsSpeech Communication (Speech Commun.), 2023

364

21 Mar 2023

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and BaselinesInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

...

Kong Aik Lee

185

17 Aug 2022

Multi-target Extractor and Detector for Unknown-number Speaker DiarizationIEEE Signal Processing Letters (SPL), 2022

Chin-Yi Cheng

Hung-Shin Lee

Yu Tsao

Hsin-Min Wang

275

30 Mar 2022

Using Active Speaker Faces for Diarization in TV shows

Rahul Sharma

Shrikanth Narayanan

CVBM

214

30 Mar 2022

AVA-AVD: Audio-Visual Speaker Diarization in the WildACM Multimedia (MM), 2021

551

29 Nov 2021

Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity

304

07 Oct 2021

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

307

07 Oct 2021

Localization Based Sequential Grouping for Continuous Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Zhong-Qiu Wang

DeLiang Wang

283

14 Jul 2021

Encoder-Decoder Based Attractors for End-to-End Neural DiarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Leibny Paola García-Perera

252

20 Jun 2021

Advances in integration of end-to-end neural and clustering-based diarization for real conversational speechInterspeech (Interspeech), 2021

K. Kinoshita

Marc Delcroix

Naohiro Tawara

319

19 May 2021

End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker EmbeddingsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

254

05 May 2021

Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN EmbeddingsInterspeech (Interspeech), 2021

Kiran Karra

A. McCree

258

06 Apr 2021

Reformulating DOVER-Lap Label Mapping as a Graph Partitioning ProblemInterspeech (Interspeech), 2021

Desh Raj

Sanjeev Khudanpur

337

05 Apr 2021

Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party EffectAAAI Conference on Artificial Intelligence (AAAI), 2021

174

02 Mar 2021

Contrastive Separative Coding for Self-supervised Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

188

01 Mar 2021

A Review of Speaker Diarization: Recent Advances with Deep LearningComputer Speech and Language (CSL), 2021

Tae Jin Park

Naoyuki Kanda

Dimitrios Dimitriadis

Kyu Jeong Han

Shinji Watanabe

Shrikanth Narayanan

VLM

869

411

24 Jan 2021

Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers

Leibny Paola García-Perera

Kenji Nagamatsu

283

21 Jan 2021

Speaker activity driven neural speech extractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

340

14 Jan 2021

Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasksComputer Speech and Language (CSL), 2020

521

250

29 Dec 2020

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker VerificationComputing and informatics (CAI), 2020

304

21 Dec 2020

End-to-End Speaker Diarization as Post-ProcessingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Shota Horiguchi

Leibny Paola García-Perera

Yusuke Fujita

Shinji Watanabe

Kenji Nagamatsu

298

18 Dec 2020

Multi-class Spectral Clustering with Overlaps for Speaker Diarization

Desh Raj

Zili Huang

Sanjeev Khudanpur

252

05 Nov 2020

Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis

...

291

115

03 Nov 2020

DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs

Desh Raj

Leibny Paola García-Perera

Sanjeev Khudanpur

251

03 Nov 2020

Online Speaker Diarization with Relation Network

Xiang Li

Yucheng Zhao

Chong Luo

Wenjun Zeng

196

17 Sep 2020

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsSpoken Language Technology Workshop (SLT), 2020

270

11 Aug 2020

Neural Speaker Diarization with Speaker-Wise Chain Rule

244

02 Jun 2020

End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors

432

226

20 May 2020