Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.03486
Cited By
v1
v2 (latest)
Self-supervised speaker embeddings
6 April 2019
Themos Stafylakis
Johan Rohdin
Oldrich Plchot
Petr Mizera
L. Burget
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-supervised speaker embeddings"
32 / 32 papers shown
Enhancing Self-Supervised Speaker Verification Using Similarity-Connected Graphs and GCN
Zhaorui Sun
Yihao Chen
Jialong Wang
Minqiang Xu
Lei Fang
Sian Fang
Lin Liu
SSL
149
1
0
04 Sep 2025
Towards Low-Latency Tracking of Multiple Speakers With Short-Context Speaker Embeddings
Taous Iatariene
Alexandre Guérin
Romain Serizel
110
0
0
18 Aug 2025
Investigation of Speaker Representation for Target-Speaker Speech Processing
Spoken Language Technology Workshop (SLT), 2024
Takanori Ashihara
Takafumi Moriya
Shota Horiguchi
Junyi Peng
Tsubasa Ochiai
Marc Delcroix
Kohei Matsuura
Hiroshi Sato
231
2
0
15 Oct 2024
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features
Satvik Dixit
Massa Baali
Rita Singh
Bhiksha Raj
323
1
0
07 Oct 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
345
24
0
21 Jul 2024
Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Varun Krishna
Sriram Ganapathy
SSL
270
2
0
02 Jul 2024
Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations
Theo Lepage
Reda Dehak
SSL
248
4
0
23 Apr 2024
What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis
Takanori Ashihara
Marc Delcroix
Takafumi Moriya
Kohei Matsuura
Taichi Asami
Yusuke Ijima
SSL
272
16
0
31 Jan 2024
Experimenting with Additive Margins for Contrastive Self-Supervised Speaker Verification
Interspeech (Interspeech), 2023
Theo Lepage
Reda Dehak
SSL
188
6
0
06 Jun 2023
Self-Supervised Learning with Cluster-Aware-DINO for High-Performance Robust Speaker Verification
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Bing Han
Zhengyang Chen
Y. Qian
150
36
0
12 Apr 2023
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
Spandan Dey
Md. Sahidullah
G. Saha
180
31
0
30 Nov 2022
A comprehensive study on self-supervised distillation for speaker representation learning
Spoken Language Technology Workshop (SLT), 2022
Zhengyang Chen
Yao Qian
Bing Han
Y. Qian
Michael Zeng
SSL
345
23
0
28 Oct 2022
Anchored Speech Recognition with Neural Transducers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Desh Raj
Junteng Jia
Jay Mahadeokar
Chunyang Wu
Niko Moritz
Xiaohui Zhang
Ozlem Kalinli
241
2
0
20 Oct 2022
C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker Verification
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Chunlei Zhang
Dong Yu
196
22
0
15 Aug 2022
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Jaejin Cho
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
SSL
205
22
0
10 Aug 2022
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations
Interspeech (Interspeech), 2022
Jaejin Cho
R. Pappagari
Piotr Żelasko
Laureano Moro-Velazquez
Jesús Villalba
Najim Dehak
SSL
198
14
0
10 Aug 2022
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Sung Hwan Mun
Min Hyun Han
Dongjune Lee
Jihwan Kim
N. Kim
SSL
258
3
0
16 Dec 2021
The JHU submission to VoxSRC-21: Track 3
Jejin Cho
Jesus Villalba
Najim Dehak
205
30
0
28 Sep 2021
Speaker embeddings by modeling channel-wise correlations
Interspeech (Interspeech), 2021
Themos Stafylakis
Johan Rohdin
L. Burget
195
10
0
06 Apr 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
150
8
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
Speech Communication (Speech Commun.), 2020
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
Tianshi Zheng
Dong Wang
231
143
0
23 Dec 2020
Self-supervised Text-independent Speaker Verification using Prototypical Momentum Contrastive Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wei Xia
Chunlei Zhang
Chao Weng
Meng Yu
Dong Yu
SSL
165
91
0
13 Dec 2020
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Arsha Nagrani
Joon Son Chung
Jaesung Huh
Andrew Brown
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
163
76
0
12 Dec 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
213
6
0
30 Oct 2020
An iterative framework for self-supervised deep speaker representation learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Danwei Cai
Weiqing Wang
Ming Li
SSL
131
43
0
25 Oct 2020
Learning Speaker Embedding from Text-to-Speech
Jaejin Cho
Piotr Żelasko
Jesus Villalba
Shinji Watanabe
Najim Dehak
123
13
0
21 Oct 2020
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings
Interspeech (Interspeech), 2020
Florian Kreyssig
P. Woodland
114
7
0
09 Aug 2020
Semi-Supervised Contrastive Learning with Generalized Contrastive Loss and Its Application to Speaker Recognition
Nakamasa Inoue
Keita Goto
SSL
178
61
0
08 Jun 2020
Deep Normalization for Speaker Vectors
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
256
28
0
07 Apr 2020
Deep learning methods in speaker recognition: a review
Periodica Polytechnica Electrical Engineering and Computer Science (PEECS), 2019
Dávid Sztahó
György Szaszák
A. Beke
VLM
137
52
0
14 Nov 2019
Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Zhiyuan Peng
Siyuan Feng
Tan Lee
160
6
0
30 Oct 2019
Self-supervised pre-training with acoustic configurations for replay spoofing detection
Interspeech (Interspeech), 2019
Hye-jin Shim
Hee-Soo Heo
Jee-weon Jung
Ha-Jin Yu
134
8
0
22 Oct 2019
1
Page 1 of 1