ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.07272
  4. Cited By
Target-Speaker Voice Activity Detection: a Novel Approach for
  Multi-Speaker Diarization in a Dinner Party Scenario
v1v2 (latest)

Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario

14 May 2020
Ivan Medennikov
M. Korenevsky
Tatiana Prisyach
Yuri Y. Khokhlov
Mariya Korenevskaya
Ivan Sorokin
Tatiana Timofeeva
Anton Mitrofanov
A. Andrusenko
Ivan Podluzhny
A. Laptev
A. Romanenko
ArXiv (abs)PDFHTML

Papers citing "Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario"

24 / 124 papers shown
Title
A Comparative Study of Modular and Joint Approaches for
  Speaker-Attributed ASR on Monaural Long-Form Audio
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
73
14
0
06 Jul 2021
Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Shu-Tong Niu
Jun Du
Lei Sun
Chin-Hui Lee
41
5
0
06 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using
  Global and Local Attractors
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Yohei Kawaguchi
79
38
0
04 Jul 2021
Enrollment-less training for personalized voice activity detection
Enrollment-less training for personalized voice activity detection
Naoki Makishima
Mana Ihori
Tomohiro Tanaka
Akihiko Takashima
Shota Orihashi
Ryo Masumura
43
10
0
23 Jun 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
74
68
0
20 Jun 2021
Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural
  Diarization
Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization
Yuki Takashima
Yusuke Fujita
Shota Horiguchi
Shinji Watanabe
Paola García
Kenji Nagamatsu
90
15
0
09 Jun 2021
End-to-End Speaker Diarization Conditioned on Speech Activity and
  Overlap Detection
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Yuki Takashima
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Leibny Paola García-Perera
Kenji Nagamatsu
62
26
0
08 Jun 2021
End-to-End Diarization for Variable Number of Speakers with Local-Global
  Networks and Discriminative Speaker Embeddings
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Soumi Maiti
Hakan Erdogan
K. Wilson
Scott Wisdom
Shinji Watanabe
J. Hershey
72
22
0
05 May 2021
Adapting Speaker Embeddings for Speaker Diarisation
Adapting Speaker Embeddings for Speaker Diarisation
Youngki Kwon
Jee-weon Jung
Hee-Soo Heo
You Jin Kim
Bong-Jin Lee
Joon Son Chung
38
13
0
07 Apr 2021
Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA
  Clustering of DNN Embeddings
Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings
Kiran Karra
A. McCree
14
2
0
06 Apr 2021
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
Desh Raj
Sanjeev Khudanpur
95
3
0
05 Apr 2021
Target Speaker Verification with Selective Auditory Attention for Single
  and Multi-talker Speech
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech
Chenglin Xu
Wei Rao
Jibin Wu
Haizhou Li
68
32
0
30 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
68
30
0
19 Mar 2021
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech
  Diarization Challenge
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge
Weiqing Wang
Qingjian Lin
Danwei Cai
Lin Yang
Ming Li
23
8
0
06 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
384
337
0
24 Jan 2021
Speaker activity driven neural speech extraction
Speaker activity driven neural speech extraction
Marc Delcroix
Kateřina Žmolíková
Tsubasa Ochiai
K. Kinoshita
Tomohiro Nakatani
106
35
0
14 Jan 2021
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
  diarization: theory, implementation and analysis on standard tasks
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
Federico Landini
Jan Profant
Mireia Díez
L. Burget
287
209
0
29 Dec 2020
End-to-End Speaker Diarization as Post-Processing
End-to-End Speaker Diarization as Post-Processing
Shota Horiguchi
Leibny Paola García-Perera
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
93
42
0
18 Dec 2020
Block-Online Guided Source Separation
Block-Online Guided Source Separation
Shota Horiguchi
Yusuke Fujita
Kenji Nagamatsu
50
4
0
16 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
105
31
0
05 Nov 2020
Integration of speech separation, diarization, and recognition for
  multi-speaker meetings: System description, comparison, and analysis
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
63
88
0
03 Nov 2020
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
Desh Raj
Leibny Paola García-Perera
Zili Huang
Shinji Watanabe
Daniel Povey
A. Stolcke
Sanjeev Khudanpur
117
68
0
03 Nov 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous
  Multi-Talker Recordings
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
74
49
0
11 Aug 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for
  Mixture Signals
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
96
22
0
25 Jun 2020
Previous
123