Speaker Embeddings With Weakly Supervised Voice Activity Detection For
Efficient Speaker DiarizationThe Speaker and Language Recognition Workshop (Odyssey), 2024 |
TitaNet: Neural Model for speaker representation with 1D Depth-wise
separable convolutions and global contextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 |
Content-Aware Speaker Embeddings for Speaker DiarisationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 |
U-vectors: Generating clusterable speaker embedding from unlabeled dataApplied Sciences (AS), 2021 |
Bayesian HMM clustering of x-vector sequences (VBx) in speaker
diarization: theory, implementation and analysis on standard tasksComputer Speech and Language (CSL), 2020 |