ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.11646
  4. Cited By
Overlap-aware diarization: resegmentation using neural end-to-end
  overlapped speech detection

Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection

25 October 2019
Latané Bullock
H. Bredin
Leibny Paola García-Perera
ArXivPDFHTML

Papers citing "Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection"

44 / 44 papers shown
Title
UniArray: Unified Spectral-Spatial Modeling for Array-Geometry-Agnostic Speech Separation
Weiguang Chen
Junjie Zhang
Jielong Yang
Eng Siong Chng
Xionghu Zhong
60
0
0
07 Mar 2025
Conversational Rubert for Detecting Competitive Interruptions in
  ASR-Transcribed Dialogues
Conversational Rubert for Detecting Competitive Interruptions in ASR-Transcribed Dialogues
Dmitrii Galimzianov
Viacheslav Vyshegorodtsev
24
0
0
20 Jul 2024
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in
  Meetings
ASoBO: Attentive Beamformer Selection for Distant Speaker Diarization in Meetings
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
25
0
0
05 Jun 2024
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced
  Speaker Corpora: Usefulness of Speaker Diarization & Identification
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification
Rémi Uro
D. Doukhan
Albert Rilliard
Laëtitia Larcher
Anissa-Claire Adgharouamane
Marie Tahon
Antoine Laurent
31
4
0
26 Apr 2024
Channel-Combination Algorithms for Robust Distant Voice Activity and
  Overlapped Speech Detection
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
16
2
0
13 Feb 2024
An Explainable Proxy Model for Multiabel Audio Segmentation
An Explainable Proxy Model for Multiabel Audio Segmentation
Théo Mariotte
Antonio Almudévar
Marie Tahon
Alfonso Ortega Giménez
21
1
0
16 Jan 2024
Powerset multi-class cross entropy loss for neural speaker diarization
Powerset multi-class cross entropy loss for neural speaker diarization
Alexis Plaquet
H. Bredin
99
91
0
19 Oct 2023
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Midia Yousefi
Takuya Yoshioka
Jian Wu
19
3
0
21 Sep 2023
Joint speech and overlap detection: a benchmark over multiple audio
  setup and speech domains
Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Martin Lebourdais
Théo Mariotte
Marie Tahon
Anthony Larcher
Antoine Laurent
Silvio Montrésor
S. Meignier
Jean-Hugh Thomas
VLM
25
5
0
24 Jul 2023
Multi-microphone Automatic Speech Segmentation in Meetings Based on
  Circular Harmonics Features
Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features
Théo Mariotte
Anthony Larcher
Silvio Montrésor
Jean-Hugh Thomas
17
1
0
07 Jun 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
34
9
0
29 May 2023
End-to-End Integration of Speech Separation and Voice Activity Detection
  for Low-Latency Diarization of Telephone Conversations
End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
Giovanni Morrone
Samuele Cornell
L. Serafini
Enrico Zovato
A. Brutti
S. Squartini
21
4
0
21 Mar 2023
Towards Measuring and Scoring Speaker Diarization Fairness
Towards Measuring and Scoring Speaker Diarization Fairness
Yannis Tevissen
Jérôme Boudy
Gérard Chollet
Frédéric Petitpont
15
2
0
20 Feb 2023
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge
  2022 System Description
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description
Yannis Tevissen
Jérôme Boudy
Frédéric Petitpont
14
1
0
17 Jan 2023
GPU-accelerated Guided Source Separation for Meeting Transcription
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj
Daniel Povey
Sanjeev Khudanpur
11
34
0
10 Dec 2022
Multitask Detection of Speaker Changes, Overlapping Speech and Voice
  Activity Using wav2vec 2.0
Multitask Detection of Speaker Changes, Overlapping Speech and Voice Activity Using wav2vec 2.0
Marie Kunesova
Zbynek Zajíc
SSL
VLM
13
15
0
26 Oct 2022
In search of strong embedding extractors for speaker diarisation
In search of strong embedding extractors for speaker diarisation
Jee-weon Jung
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
A. Brown
Youngki Kwon
Shinji Watanabe
Joon Son Chung
40
16
0
26 Oct 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
19
0
0
24 Sep 2022
Overlapped speech and gender detection with WavLM pre-trained features
Overlapped speech and gender detection with WavLM pre-trained features
Martin Lebourdais
Marie Tahon
Antoine Laurent
S. Meignier
27
17
0
09 Sep 2022
Target Speaker Voice Activity Detection with Transformers and Its
  Integration with End-to-End Neural Diarization
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
28
25
0
27 Aug 2022
Unsupervised Speaker Diarization that is Agnostic to Language,
  Overlap-Aware, and Tuning Free
Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free
Md. Iftekhar Tanveer
Diego Casabuena
Jussi Karlgren
Rosie Jones
BDL
4
3
0
25 Jul 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Y. Kawaguchi
39
23
0
06 Jun 2022
Multi-target Extractor and Detector for Unknown-number Speaker
  Diarization
Multi-target Extractor and Detector for Unknown-number Speaker Diarization
Chin-Yi Cheng
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
11
8
0
30 Mar 2022
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number
  of Speakers using End-to-End Speaker-Attributed ASR
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Xiong Xiao
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
17
34
0
07 Oct 2021
Overlap-aware low-latency online speaker diarization based on end-to-end
  local segmentation
Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation
Juan Manuel Coria
H. Bredin
Sahar Ghannay
Sophie Rosset
41
30
0
14 Sep 2021
Compositional Clustering: Applications to Multi-Label Object Recognition
  and Speaker Identification
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification
Zeqian Li
Xinlu He
Jacob Whitehill
11
5
0
09 Sep 2021
BeamTransformer: Microphone Array-based Overlapping Speech Detection
BeamTransformer: Microphone Array-based Overlapping Speech Detection
Siqi Zheng
Shiliang Zhang
Weilong Huang
Qian Chen
Hongbin Suo
Ming Lei
Jinwei Feng
Zhijie Yan
19
7
0
09 Sep 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using
  Global and Local Attractors
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Y. Kawaguchi
15
37
0
04 Jul 2021
End-to-end Neural Diarization: From Transformer to Conformer
End-to-end Neural Diarization: From Transformer to Conformer
Yi Y. Liu
Eunjung Han
Chul Lee
A. Stolcke
11
40
0
14 Jun 2021
End-to-end speaker segmentation for overlap-aware resegmentation
End-to-end speaker segmentation for overlap-aware resegmentation
H. Bredin
Antoine Laurent
VLM
209
162
0
08 Apr 2021
Three-class Overlapped Speech Detection using a Convolutional Recurrent
  Neural Network
Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network
Jee-weon Jung
Hee-Soo Heo
Youngki Kwon
Joon Son Chung
Bong-Jin Lee
18
18
0
07 Apr 2021
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
Desh Raj
Sanjeev Khudanpur
22
3
0
05 Apr 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
269
325
0
24 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
  diarization: theory, implementation and analysis on standard tasks
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
208
198
0
29 Dec 2020
End-to-End Speaker Diarization as Post-Processing
End-to-End Speaker Diarization as Post-Processing
Shota Horiguchi
Leibny Paola García-Perera
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
17
41
0
18 Dec 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
23
30
0
05 Nov 2020
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a
  Variable Number of Speakers
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers
Eunjung Han
Chul Lee
A. Stolcke
11
42
0
05 Nov 2020
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
Desh Raj
Leibny Paola García-Perera
Zili Huang
Shinji Watanabe
Daniel Povey
A. Stolcke
Sanjeev Khudanpur
11
64
0
03 Nov 2020
Combination of Deep Speaker Embeddings for Diarisation
Combination of Deep Speaker Embeddings for Diarisation
Guangzhi Sun
Chao Zhang
P. Woodland
9
20
0
22 Oct 2020
Compositional embedding models for speaker identification and
  diarization with simultaneous speech from 2+ speakers
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
17
10
0
22 Oct 2020
Analysis of the BUT Diarization System for VoxConverse Challenge
Analysis of the BUT Diarization System for VoxConverse Challenge
Federico Landini
O. Glembek
P. Matejka
Johan Rohdin
L. Burget
Mireia Díez
Anna Silnova
8
32
0
22 Oct 2020
Learning to Detect Bipolar Disorder and Borderline Personality Disorder
  with Language and Speech in Non-Clinical Interviews
Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews
Bo Wang
Yue Wu
Niall Taylor
Terry Lyons
M. Liakata
A. Nevado-Holgado
K. Saunders
14
13
0
08 Aug 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
28
9
0
14 Jun 2020
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
183
312
0
04 Nov 2019
1