v1v2 (latest)

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

20 May 2020

Wei Zou

Xiangang Li

SSL

ArXiv (abs)PDF HTML

Papers citing "A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition"

21 / 21 papers shown

NeurIPT: Foundation Model for Neural Interfaces

156

18 Oct 2025

SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR

Natarajan Balaji Shankar

Ruchao Fan

Abeer Alwan

287

15 Jun 2024

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature ExtractorsInternational Conference on Natural Language and Speech Processing (ICNLSP), 2023

Xiangyu Zhang

248

27 Nov 2023

On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge DistillationInterspeech (Interspeech), 2023

204

06 Jul 2023

Cross-Modal Fine-Tuning: Align then RefineInternational Conference on Machine Learning (ICML), 2023

Graham Neubig

303

11 Feb 2023

SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation LearningSpoken Language Technology Workshop (SLT), 2022

Tzu-Quan Lin

...

332

16 Oct 2022

CTCBERT: Advancing Hidden-unit BERT with CTC ObjectivesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

349

16 Oct 2022

DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASRInterspeech (Interspeech), 2022

Ruchao Fan

Abeer Alwan

286

16 Jun 2022

Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Abdel-rahman Mohamed

Hung-yi Lee

Lasse Borgholt

Jakob Drachmann Havtorn

...

796

475

21 May 2022

Speech Pre-training with Acoustic PieceInterspeech (Interspeech), 2022

248

07 Apr 2022

A Brief Overview of Unsupervised Neural Speech Representation Learning

Lasse Borgholt

Jakob Drachmann Havtorn

266

01 Mar 2022

MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition

Haizhou Li

172

27 Oct 2021

Don't speak too fast: The impact of data bias on self-supervised speech models

303

15 Oct 2021

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Rui Wang

...

463

268

14 Oct 2021

Decoupling recognition and transcription in Mandarin ASR

254

02 Aug 2021

BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataFrontiers in Human Neuroscience (Front Hum Neurosci), 2021

Demetres Kostas

Stephane Aroca-Ouellette

Frank Rudzicz

SSL

287

316

28 Jan 2021

Stochastic Attention Head Removal: A simple and effective method for improving Transformer Based ASR Models

Shucong Zhang

Erfan Loweimi

P. Bell

Steve Renals

275

08 Nov 2020

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningInterspeech (Interspeech), 2020

Dongwei Jiang

Wubo Li

Miao Cao

Wei Zou

Xiangang Li

SSL

425

27 Oct 2020

Similarity Analysis of Self-Supervised Speech Representations

428

22 Oct 2020

TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

691

402

12 Jul 2020

Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

649

394

25 Oct 2019