ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.09862
  4. Cited By
A Further Study of Unsupervised Pre-training for Transformer Based
  Speech Recognition
v1v2 (latest)

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

20 May 2020
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
    SSL
ArXiv (abs)PDFHTML

Papers citing "A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition"

21 / 21 papers shown
NeurIPT: Foundation Model for Neural Interfaces
NeurIPT: Foundation Model for Neural Interfaces
Zitao Fang
Chenxuan Li
Hongting Zhou
Shuyang Yu
Guodong DU
Ashwaq Qasem
Yang Lu
Jing Li
J. Zhang
Sim Kuan Goh
156
7
0
18 Oct 2025
SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation
  for Low Resource ASR
SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR
Natarajan Balaji Shankar
Ruchao Fan
Abeer Alwan
287
1
0
15 Jun 2024
A Quantitative Approach to Understand Self-Supervised Models as
  Cross-lingual Feature Extractors
A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature ExtractorsInternational Conference on Natural Language and Speech Processing (ICNLSP), 2023
Shuyue Stella Li
Beining Xu
Xiangyu Zhang
Hexin Liu
Wen-Han Chao
Leibny Paola García
SSL
248
5
0
27 Nov 2023
On-Device Constrained Self-Supervised Speech Representation Learning for
  Keyword Spotting via Knowledge Distillation
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge DistillationInterspeech (Interspeech), 2023
Gene-Ping Yang
Yue Gu
Qingming Tang
Dongsu Du
Yuzong Liu
204
7
0
06 Jul 2023
Cross-Modal Fine-Tuning: Align then Refine
Cross-Modal Fine-Tuning: Align then RefineInternational Conference on Machine Learning (ICML), 2023
Junhong Shen
Liam Li
Lucio Dery
Corey Staten
M. Khodak
Graham Neubig
Ameet Talwalkar
303
63
0
11 Feb 2023
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of
  Self-Supervised Speech Representation Learning
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation LearningSpoken Language Technology Workshop (SLT), 2022
Tzu-hsun Feng
Annie Dong
Ching-Feng Yeh
Shu-Wen Yang
Tzu-Quan Lin
...
Xuankai Chang
Shinji Watanabe
Abdel-rahman Mohamed
Shang-Wen Li
Hung-yi Lee
ELMSSL
332
38
0
16 Oct 2022
CTCBERT: Advancing Hidden-unit BERT with CTC Objectives
CTCBERT: Advancing Hidden-unit BERT with CTC ObjectivesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ruchao Fan
Yiming Wang
Yashesh Gaur
Jinyu Li
349
8
0
16 Oct 2022
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised
  Learning and Its Application to Children's ASR
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASRInterspeech (Interspeech), 2022
Ruchao Fan
Abeer Alwan
286
39
0
16 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
796
475
0
21 May 2022
Speech Pre-training with Acoustic Piece
Speech Pre-training with Acoustic PieceInterspeech (Interspeech), 2022
Shuo Ren
Shujie Liu
Yu Wu
Long Zhou
Furu Wei
SSL
248
19
0
07 Apr 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDLAI4TSSSL
266
13
0
01 Mar 2022
MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal
  Emotion Recognition
MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition
Jinming Zhao
Ruichen Li
Qin Jin
Xinchao Wang
Haizhou Li
172
42
0
27 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech
  models
Don't speak too fast: The impact of data bias on self-supervised speech models
Yen Meng
Yi-Hui Chou
Andy T. Liu
Hung-yi Lee
303
35
0
15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
463
268
0
14 Oct 2021
Decoupling recognition and transcription in Mandarin ASR
Decoupling recognition and transcription in Mandarin ASR
Jiahong Yuan
Xingyu Cai
Dongji Gao
Renjie Zheng
Liang Huang
Kenneth Church
254
14
0
02 Aug 2021
BENDR: using transformers and a contrastive self-supervised learning
  task to learn from massive amounts of EEG data
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataFrontiers in Human Neuroscience (Front Hum Neurosci), 2021
Demetres Kostas
Stephane Aroca-Ouellette
Frank Rudzicz
SSL
287
316
0
28 Jan 2021
Stochastic Attention Head Removal: A simple and effective method for
  improving Transformer Based ASR Models
Stochastic Attention Head Removal: A simple and effective method for improving Transformer Based ASR Models
Shucong Zhang
Erfan Loweimi
P. Bell
Steve Renals
275
0
0
08 Nov 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for
  Self-supervised Speech Representation Learning
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningInterspeech (Interspeech), 2020
Dongwei Jiang
Wubo Li
Miao Cao
Wei Zou
Xiangang Li
SSL
425
74
0
27 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations
Similarity Analysis of Self-Supervised Speech Representations
Yu-An Chung
Yonatan Belinkov
James R. Glass
SSL
428
45
0
22 Oct 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
691
402
0
12 Jul 2020
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
649
394
0
25 Oct 2019
1
Page 1 of 1