Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.03240
Cited By
v1
v2 (latest)
An Unsupervised Autoregressive Model for Speech Representation Learning
5 April 2019
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Unsupervised Autoregressive Model for Speech Representation Learning"
50 / 269 papers shown
Title
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
Henry Gouk
Chen Change Loy
Timothy M. Hospedales
SSL
OOD
AI4TS
194
334
0
18 Oct 2021
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning
Yi-Chen Chen
Shu-Wen Yang
Cheng-Kuang Lee
Simon See
Hung-yi Lee
SSL
127
13
0
18 Oct 2021
DECAR: Deep Clustering for learning general-purpose Audio Representations
Sreyan Ghosh
Sandesh V Katta
Ashish Seth
S. Umesh
SSL
153
12
0
17 Oct 2021
Word Order Does Not Matter For Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Vineel Pratap
Qiantong Xu
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
229
4
0
12 Oct 2021
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sanyuan Chen
Yu Wu
Chengyi Wang
Zhengyang Chen
Zhuo Chen
...
Jian Wu
Yao Qian
Furu Wei
Jinyu Li
Xiangzhan Yu
SSL
166
119
0
12 Oct 2021
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sanjeev Khudanpur
Desh Raj
Sanjeev Khudanpur
171
8
0
10 Oct 2021
Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Joel Shor
A. Jansen
Wei Han
Daniel S. Park
Yu Zhang
SSL
AI4TS
313
65
0
09 Oct 2021
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Xuankai Chang
Takashi Maekaku
Pengcheng Guo
Jing Shi
Yen-Ju Lu
...
Tianzi Wang
Shu-Wen Yang
Yu Tsao
Hung-yi Lee
Shinji Watanabe
SSL
AI4TS
128
85
0
09 Oct 2021
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
Interspeech (Interspeech), 2021
Li Fu
Xiaoxiao Li
Runyu Wang
Lu Fan
Zhengchen Zhang
Meng Chen
Youzheng Wu
Xiaodong He
SSL
140
3
0
08 Oct 2021
Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shaoshi Ling
Chen Shen
Meng Cai
Zejun Ma
VLM
SSL
127
10
0
08 Oct 2021
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Liang-Hsuan Tseng
Yu-Kuan Fu
Heng-Jui Chang
Hung-yi Lee
SSL
114
17
0
07 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
549
198
0
05 Oct 2021
Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Toshiko Shibano
Xinyi Zhang
Miao Li
Haejin Cho
Peter Sullivan
Muhammad Abdul-Mageed
VLM
212
18
0
01 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
119
3
0
29 Sep 2021
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2021
Yu Zhang
Daniel S. Park
Wei Han
James Qin
Anmol Gulati
...
Zhifeng Chen
Quoc V. Le
Chung-Cheng Chiu
Ruoming Pang
Yonghui Wu
SSL
192
196
0
27 Sep 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Interspeech (Interspeech), 2021
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
301
116
0
23 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Chenyu You
Polydoros Giannouris
Yuexian Zou
SSL
187
64
0
08 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Eugene Kharitonov
Ann Lee
Adam Polyak
Yossi Adi
Jade Copet
...
Tu Nguyen
M. Rivière
Abdel-rahman Mohamed
Emmanuel Dupoux
Wei-Ning Hsu
214
136
0
07 Sep 2021
Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Zane Durante
Leena Mathur
Eric Ye
Sichong Zhao
Tejas Ramdas
Khalil Iskarous
217
0
0
27 Aug 2021
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition
Krishna D N Freshworks
122
14
0
22 Aug 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Automatic Speech Recognition & Understanding (ASRU), 2021
Yu-An Chung
Yu Zhang
Wei Han
Chung-Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
SSL
VLM
207
486
0
07 Aug 2021
Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Benjamin van Niekerk
Leanne Nortje
Matthew Baas
Herman Kamper
SSL
204
34
0
02 Aug 2021
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Samuel Kessler
Bethan Thomas
S. Karout
SSL
120
31
0
26 Jul 2021
ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track, 2021 edition
Afra Alishahia
Grzegorz Chrupała
Alejandrina Cristià
Emmanuel Dupoux
Bertrand Higy
Marvin Lavechin
Okko Räsänen
Chen Yu
129
7
0
14 Jul 2021
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation
Interspeech (Interspeech), 2021
Jian Luo
Jianzong Wang
Ning Cheng
Jing Xiao
SSL
135
6
0
09 Jul 2021
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
218
13
0
01 Jul 2021
As easy as APC: overcoming missing data and class imbalance in time series with self-supervised learning
Fiorella Wever
Thomas Anderson Keller
L. Symul
Victor Garcia
SSL
AI4TS
185
1
0
29 Jun 2021
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition
Interspeech (Interspeech), 2021
Ruirui Li
C. Ju
Zeya Chen
Hongda Mao
Oguz H. Elibol
A. Stolcke
114
4
0
18 Jun 2021
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Interspeech (Interspeech), 2021
Jinhan Wang
Yunzheng Zhu
Ruchao Fan
Wei Chu
Abeer Alwan
90
8
0
18 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Wei-Ning Hsu
Benjamin Bolte
Yifan Hao
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
472
3,879
0
14 Jun 2021
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Neural Information Processing Systems (NeurIPS), 2021
Cheng-I Jeff Lai
Yang Zhang
Alexander H. Liu
Shiyu Chang
Yi-Lun Liao
Yung-Sung Chuang
Kaizhi Qian
Sameer Khurana
David D. Cox
James R. Glass
VLM
266
86
0
10 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Speech Communication (Speech Commun.), 2021
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
111
63
0
09 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
SSL
200
36
0
01 Jun 2021
Unsupervised Speech Recognition
Neural Information Processing Systems (NeurIPS), 2021
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
341
292
0
24 May 2021
SUPERB: Speech processing Universal PERformance Benchmark
Interspeech (Interspeech), 2021
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
409
1,065
0
03 May 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Interspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
213
70
0
23 Apr 2021
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Jinchuan Tian
Rongzhi Gu
Helin Wang
Yuexian Zou
129
0
0
08 Apr 2021
Utilizing Self-supervised Representations for MOS Prediction
Interspeech (Interspeech), 2021
Wei-Cheng Tseng
Chien-yu Huang
Wei-Tsung Kao
Yist Y. Lin
Hung-yi Lee
SSL
332
71
0
07 Apr 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Interspeech (Interspeech), 2021
Jheng-hao Lin
Yist Y. Lin
C. Chien
Hung-yi Lee
365
62
0
07 Apr 2021
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model
Interspeech (Interspeech), 2021
Apoorv Vyas
S. Madikeri
H. Bourlard
107
16
0
06 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Interspeech (Interspeech), 2021
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
309
256
0
02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Interspeech (Interspeech), 2021
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
155
4
0
02 Apr 2021
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Computer Speech and Language (CSL), 2021
Haoqi Li
Brian R. Baucom
Shrikanth Narayanan
P. Georgiou
117
2
0
01 Apr 2021
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Interspeech (Interspeech), 2021
Jingsong Wang
Yuxuan He
Chunyu Zhao
Qijie Shao
Wei-Wei Tu
Tom Ko
Hung-yi Lee
Lei Xie
106
5
0
31 Mar 2021
Self-supervised representation learning from 12-lead ECG data
Temesgen Mehari
Nils Strodthoff
SSL
243
176
0
23 Mar 2021
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Jama Hussein Mohamud
Lloyd Thompson
A. Ndoye
Laurent Besacier
158
7
0
16 Mar 2021
XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Zi-qiang Zhang
Yan Song
Ming Wu
Xin Fang
Lirong Dai
SSL
108
21
0
15 Mar 2021
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ruchao Fan
Amber Afshan
Abeer Alwan
124
14
0
12 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
116
6
0
03 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Yuan Gong
Yu-An Chung
James R. Glass
VLM
335
161
0
02 Feb 2021
Previous
1
2
3
4
5
6
Next