ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.17632
  4. Cited By
What Do Self-Supervised Speech and Speaker Models Learn? New Findings
  From a Cross Model Layer-Wise Analysis

What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis

31 January 2024
Takanori Ashihara
Marc Delcroix
Takafumi Moriya
Kohei Matsuura
Taichi Asami
Yusuke Ijima
    SSL
ArXiv (abs)PDFHTML

Papers citing "What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis"

8 / 8 papers shown
DELULU: Discriminative Embedding Learning Using Latent Units for Speaker-Aware Self-Supervised Speech Foundational Model
DELULU: Discriminative Embedding Learning Using Latent Units for Speaker-Aware Self-Supervised Speech Foundational Model
Massa Baali
Rita Singh
Bhiksha Raj
SSL
223
0
0
20 Oct 2025
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
Yuxin Li
Eng Siong Chng
Cuntai Guan
68
1
0
05 Oct 2025
Investigation of Speaker Representation for Target-Speaker Speech
  Processing
Investigation of Speaker Representation for Target-Speaker Speech ProcessingSpoken Language Technology Workshop (SLT), 2024
Takanori Ashihara
Takafumi Moriya
Shota Horiguchi
Junyi Peng
Tsubasa Ochiai
Marc Delcroix
Kohei Matsuura
Hiroshi Sato
226
2
0
15 Oct 2024
Property Neurons in Self-Supervised Speech Transformers
Property Neurons in Self-Supervised Speech TransformersSpoken Language Technology Workshop (SLT), 2024
Tzu-Quan Lin
Guan-Ting Lin
Hung-yi Lee
Hao Tang
MILM
255
4
0
07 Sep 2024
Disentangling segmental and prosodic factors to non-native speech
  comprehensibility
Disentangling segmental and prosodic factors to non-native speech comprehensibilityIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024
Waris Quamer
Ricardo Gutierrez-Osuna
217
1
0
20 Aug 2024
SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake
  Detection
SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection
Yi Zhu
Surya Koppisetti
Trang Tran
Gaurav Bharaj
397
22
0
26 Jul 2024
Overview of Speaker Modeling and Its Applications: From the Lens of Deep
  Speaker Representation Learning
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning
Shuai Wang
Zheng-Shou Chen
Kong Aik Lee
Yan-min Qian
Haizhou Li
341
23
0
21 Jul 2024
WavRx: a Disease-Agnostic, Generalizable, and Privacy-Preserving Speech
  Health Diagnostic Model
WavRx: a Disease-Agnostic, Generalizable, and Privacy-Preserving Speech Health Diagnostic Model
Yi Zhu
Tiago H. Falk
MedIm
255
3
0
26 Jun 2024
1