Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection

25 October 2019

Papers citing "Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection"

44 / 44 papers shown

Title
UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation Weiguang Chen Junjie Zhang Jielong Yang Eng Siong Chng Xionghu Zhong 60 0 0 07 Mar 2025
Conversational Rubert for Detecting Competitive Interruptions in ASR-Transcribed Dialogues Dmitrii Galimzianov Viacheslav Vyshegorodtsev 24 0 0 20 Jul 2024
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings Théo Mariotte Anthony Larcher Silvio Montrésor Jean-Hugh Thomas 25 0 0 05 Jun 2024
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification Rémi Uro D. Doukhan Albert Rilliard Laëtitia Larcher Anissa-Claire Adgharouamane Marie Tahon Antoine Laurent 31 4 0 26 Apr 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection Théo Mariotte Anthony Larcher Silvio Montrésor Jean-Hugh Thomas 16 2 0 13 Feb 2024
An Explainable Proxy Model for Multiabel Audio Segmentation Théo Mariotte Antonio Almudévar Marie Tahon Alfonso Ortega Giménez 21 1 0 16 Jan 2024
Powerset multi-class cross entropy loss for neural speaker diarization Alexis Plaquet H. Bredin 99 91 0 19 Oct 2023
Profile-Error-Tolerant Target-Speaker Voice Activity Detection Dongmei Wang Xiong Xiao Naoyuki Kanda Midia Yousefi Takuya Yoshioka Jian Wu 19 3 0 21 Sep 2023
Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains Martin Lebourdais Théo Mariotte Marie Tahon Anthony Larcher Antoine Laurent Silvio Montrésor S. Meignier Jean-Hugh Thomas VLM 25 5 0 24 Jul 2023
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features Théo Mariotte Anthony Larcher Silvio Montrésor Jean-Hugh Thomas 17 1 0 07 Jun 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings L. Serafini Samuele Cornell Giovanni Morrone Enrico Zovato A. Brutti S. Squartini 34 9 0 29 May 2023
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations Giovanni Morrone Samuele Cornell L. Serafini Enrico Zovato A. Brutti S. Squartini 21 4 0 21 Mar 2023
Towards Measuring and Scoring Speaker Diarization Fairness Yannis Tevissen Jérôme Boudy Gérard Chollet Frédéric Petitpont 15 2 0 20 Feb 2023
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description Yannis Tevissen Jérôme Boudy Frédéric Petitpont 14 1 0 17 Jan 2023
GPU-accelerated Guided Source Separation for Meeting Transcription Desh Raj Daniel Povey Sanjeev Khudanpur 11 34 0 10 Dec 2022
Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0 Marie Kunesova Zbynek Zajíc SSL VLM 13 15 0 26 Oct 2022
In search of strong embedding extractors for speaker diarisation Jee-weon Jung Hee-Soo Heo Bong-Jin Lee Jaesung Huh A. Brown Youngki Kwon Shinji Watanabe Joon Son Chung 40 16 0 26 Oct 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture Ziqing Du Kai Liu Xucheng Wan Huan Zhou 19 0 0 24 Sep 2022
Overlapped speech and gender detection with WavLM pre-trained features Martin Lebourdais Marie Tahon Antoine Laurent S. Meignier 27 17 0 09 Sep 2022
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization Dongmei Wang Xiong Xiao Naoyuki Kanda Takuya Yoshioka Jian Wu 28 25 0 27 Aug 2022
Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free Md. Iftekhar Tanveer Diego Casabuena Jussi Karlgren Rosie Jones BDL 4 3 0 25 Jul 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yuki Takashima Y. Kawaguchi 39 23 0 06 Jun 2022
Multi-target Extractor and Detector for Unknown-number Speaker Diarization Chin-Yi Cheng Hung-Shin Lee Yu Tsao Hsin-Min Wang 11 8 0 30 Mar 2022
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR Naoyuki Kanda Xiong Xiao Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 17 34 0 07 Oct 2021
Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation Juan Manuel Coria H. Bredin Sahar Ghannay Sophie Rosset 41 30 0 14 Sep 2021
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification Zeqian Li Xinlu He Jacob Whitehill 11 5 0 09 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection Siqi Zheng Shiliang Zhang Weilong Huang Qian Chen Hongbin Suo Ming Lei Jinwei Feng Zhijie Yan 19 7 0 09 Sep 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yawen Xue Yuki Takashima Y. Kawaguchi 15 37 0 04 Jul 2021
End-to-end Neural Diarization: From Transformer to Conformer Yi Y. Liu Eunjung Han Chul Lee A. Stolcke 11 40 0 14 Jun 2021
End-to-end speaker segmentation for overlap-aware resegmentation H. Bredin Antoine Laurent VLM 209 162 0 08 Apr 2021
Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network Jee-weon Jung Hee-Soo Heo Youngki Kwon Joon Son Chung Bong-Jin Lee 18 18 0 07 Apr 2021
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem Desh Raj Sanjeev Khudanpur 22 3 0 05 Apr 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning Tae Jin Park Naoyuki Kanda Dimitrios Dimitriadis Kyu Jeong Han Shinji Watanabe Shrikanth Narayanan VLM 269 325 0 24 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks Federico Landini Jan Profant Mireia Díez L. Burget 208 198 0 29 Dec 2020
End-to-End Speaker Diarization as Post-Processing Shota Horiguchi Leibny Paola García-Perera Yusuke Fujita Shinji Watanabe Kenji Nagamatsu 17 41 0 18 Dec 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization Desh Raj Zili Huang Sanjeev Khudanpur 23 30 0 05 Nov 2020
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers Eunjung Han Chul Lee A. Stolcke 11 42 0 05 Nov 2020
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs Desh Raj Leibny Paola García-Perera Zili Huang Shinji Watanabe Daniel Povey A. Stolcke Sanjeev Khudanpur 11 64 0 03 Nov 2020
Combination of Deep Speaker Embeddings for Diarisation Guangzhi Sun Chao Zhang P. Woodland 9 20 0 22 Oct 2020
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers Zeqian Li Jacob Whitehill 17 10 0 22 Oct 2020
Analysis of the BUT Diarization System for VoxConverse Challenge Federico Landini O. Glembek P. Matejka Johan Rohdin L. Burget Mireia Díez Anna Silnova 8 32 0 22 Oct 2020
Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews Bo Wang Yue Wu Niall Taylor Terry Lyons M. Liakata A. Nevado-Holgado K. Saunders 14 13 0 08 Aug 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 28 9 0 14 Jun 2020
pyannote.audio: neural building blocks for speaker diarization H. Bredin Ruiqing Yin Juan Manuel Coria G. Gelly Pavel Korshunov Marvin Lavechin D. Fustes Hadrien Titeux Wassim Bouaziz Marie-Philippe Gill 183 312 0 04 Nov 2019