Revisiting Self-supervised Learning of Speech Representation from a
Mutual Information PerspectiveIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Self-supervised Fine-tuning for Improved Content Representations by
Speaker-invariant ClusteringInterspeech (Interspeech), 2023 |
DinoSR: Self-Distillation and Online Clustering for Self-supervised
Speech Representation LearningNeural Information Processing Systems (NeurIPS), 2023 |
MelHuBERT: A simplified HuBERT on Mel spectrogramsAutomatic Speech Recognition & Understanding (ASRU), 2022 |
Learning Dependencies of Discrete Speech Representations with Neural
Hidden Markov ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 Sung-Lin Yeh Hao Tang |