ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.03240
  4. Cited By
An Unsupervised Autoregressive Model for Speech Representation Learning
v1v2 (latest)

An Unsupervised Autoregressive Model for Speech Representation Learning

5 April 2019
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
    SSL
ArXiv (abs)PDFHTML

Papers citing "An Unsupervised Autoregressive Model for Speech Representation Learning"

50 / 269 papers shown
Title
Self-Supervised Representation Learning: Introduction, Advances and
  Challenges
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
Henry Gouk
Chen Change Loy
Timothy M. Hospedales
SSLOODAI4TS
194
334
0
18 Oct 2021
Speech Representation Learning Through Self-supervised Pretraining And
  Multi-task Finetuning
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning
Yi-Chen Chen
Shu-Wen Yang
Cheng-Kuang Lee
Simon See
Hung-yi Lee
SSL
127
13
0
18 Oct 2021
DECAR: Deep Clustering for learning general-purpose Audio
  Representations
DECAR: Deep Clustering for learning general-purpose Audio Representations
Sreyan Ghosh
Sandesh V Katta
Ashish Seth
S. Umesh
SSL
153
12
0
17 Oct 2021
Word Order Does Not Matter For Speech Recognition
Word Order Does Not Matter For Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Vineel Pratap
Qiantong Xu
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
229
4
0
12 Oct 2021
UniSpeech-SAT: Universal Speech Representation Learning with Speaker
  Aware Pre-Training
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sanyuan Chen
Yu Wu
Chengyi Wang
Zhengyang Chen
Zhuo Chen
...
Jian Wu
Yao Qian
Furu Wei
Jinyu Li
Xiangzhan Yu
SSL
166
119
0
12 Oct 2021
Injecting Text and Cross-lingual Supervision in Few-shot Learning from
  Self-Supervised Models
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sanjeev Khudanpur
Desh Raj
Sanjeev Khudanpur
171
8
0
10 Oct 2021
Universal Paralinguistic Speech Representations Using Self-Supervised
  Conformers
Universal Paralinguistic Speech Representations Using Self-Supervised ConformersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Joel Shor
A. Jansen
Wei Han
Daniel S. Park
Yu Zhang
SSLAI4TS
313
65
0
09 Oct 2021
An Exploration of Self-Supervised Pretrained Representations for
  End-to-End Speech Recognition
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Xuankai Chang
Takashi Maekaku
Pengcheng Guo
Jing Shi
Yen-Ju Lu
...
Tianzi Wang
Shu-Wen Yang
Yu Tsao
Hung-yi Lee
Shinji Watanabe
SSLAI4TS
128
85
0
09 Oct 2021
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition
SCaLa: Supervised Contrastive Learning for End-to-End Speech RecognitionInterspeech (Interspeech), 2021
Li Fu
Xiaoxiao Li
Runyu Wang
Lu Fan
Zhengchen Zhang
Meng Chen
Youzheng Wu
Xiaodong He
SSL
140
3
0
08 Oct 2021
Improving Pseudo-label Training For End-to-end Speech Recognition Using
  Gradient Mask
Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient MaskIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Shaoshi Ling
Chen Shen
Meng Cai
Zejun Ma
VLMSSL
127
10
0
08 Oct 2021
Mandarin-English Code-switching Speech Recognition with Self-supervised
  Speech Representation Models
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Liang-Hsuan Tseng
Yu-Kuan Fu
Heng-Jui Chang
Hung-yi Lee
SSL
114
17
0
07 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation
  of Hidden-unit BERT
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
549
198
0
05 Oct 2021
Speech Technology for Everyone: Automatic Speech Recognition for
  Non-Native English with Transfer Learning
Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Toshiko Shibano
Xinyi Zhang
Miao Li
Haejin Cho
Peter Sullivan
Muhammad Abdul-Mageed
VLM
212
18
0
01 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish
  Dutch
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
119
3
0
29 Sep 2021
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning
  for Automatic Speech Recognition
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech RecognitionIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2021
Yu Zhang
Daniel S. Park
Wei Han
James Qin
Anmol Gulati
...
Zhifeng Chen
Quoc V. Le
Chung-Cheng Chiu
Ruoming Pang
Yonghui Wu
SSL
192
196
0
27 Sep 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme RecognitionInterspeech (Interspeech), 2021
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
301
116
0
23 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for
  Spoken Question Answering
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Chenyu You
Polydoros Giannouris
Yuexian Zou
SSL
187
64
0
08 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling
Text-Free Prosody-Aware Generative Spoken Language ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Eugene Kharitonov
Ann Lee
Adam Polyak
Yossi Adi
Jade Copet
...
Tu Nguyen
M. Rivière
Abdel-rahman Mohamed
Emmanuel Dupoux
Wei-Ning Hsu
214
136
0
07 Sep 2021
Speech Representations and Phoneme Classification for Preserving the
  Endangered Language of Ladin
Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin
Zane Durante
Leena Mathur
Eric Ye
Sichong Zhao
Tejas Ramdas
Khalil Iskarous
217
0
0
27 Aug 2021
Using Large Pre-Trained Models with Cross-Modal Attention for
  Multi-Modal Emotion Recognition
Using Large Pre-Trained Models with Cross-Modal Attention for Multi-Modal Emotion Recognition
Krishna D N Freshworks
122
14
0
22 Aug 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling
  for Self-Supervised Speech Pre-Training
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingAutomatic Speech Recognition & Understanding (ASRU), 2021
Yu-An Chung
Yu Zhang
Wei Han
Chung-Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
SSLVLM
207
486
0
07 Aug 2021
Analyzing Speaker Information in Self-Supervised Models to Improve
  Zero-Resource Speech Processing
Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing
Benjamin van Niekerk
Leanne Nortje
Matthew Baas
Herman Kamper
SSL
204
34
0
02 Aug 2021
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised
  Speech Representation Learning
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Samuel Kessler
Bethan Thomas
S. Karout
SSL
120
31
0
26 Jul 2021
ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language
  Modelling track, 2021 edition
ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track, 2021 edition
Afra Alishahia
Grzegorz Chrupała
Alejandrina Cristià
Emmanuel Dupoux
Bertrand Higy
Marvin Lavechin
Okko Räsänen
Chen Yu
129
7
0
14 Jul 2021
Dropout Regularization for Self-Supervised Learning of Transformer
  Encoder Speech Representation
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech RepresentationInterspeech (Interspeech), 2021
Jian Luo
Jianzong Wang
Ning Cheng
Jing Xiao
SSL
135
6
0
09 Jul 2021
Pretext Tasks selection for multitask self-supervised speech
  representation learning
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
218
13
0
01 Jul 2021
As easy as APC: overcoming missing data and class imbalance in time
  series with self-supervised learning
As easy as APC: overcoming missing data and class imbalance in time series with self-supervised learning
Fiorella Wever
Thomas Anderson Keller
L. Symul
Victor Garcia
SSLAI4TS
185
1
0
29 Jun 2021
Fusion of Embeddings Networks for Robust Combination of Text Dependent
  and Independent Speaker Recognition
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker RecognitionInterspeech (Interspeech), 2021
Ruirui Li
C. Ju
Zeya Chen
Hongda Mao
Oguz H. Elibol
A. Stolcke
114
4
0
18 Jun 2021
Low Resource German ASR with Untranscribed Data Spoken by Non-native
  Children -- INTERSPEECH 2021 Shared Task SPAPL System
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL SystemInterspeech (Interspeech), 2021
Jinhan Wang
Yunzheng Zhu
Ruchao Fan
Wei Chu
Abeer Alwan
90
8
0
18 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Wei-Ning Hsu
Benjamin Bolte
Yifan Hao
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
472
3,879
0
14 Jun 2021
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021
Cheng-I Jeff Lai
Yang Zhang
Alexander H. Liu
Shiyu Chang
Yi-Lun Liao
Yung-Sung Chuang
Kaizhi Qian
Sameer Khurana
David D. Cox
James R. Glass
VLM
266
86
0
10 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Unsupervised Automatic Speech Recognition: A ReviewSpeech Communication (Speech Commun.), 2021
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLMSSL
111
63
0
09 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by
  Self-Supervised Learning
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAMLSSL
200
36
0
01 Jun 2021
Unsupervised Speech Recognition
Unsupervised Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
341
292
0
24 May 2021
SUPERB: Speech processing Universal PERformance Benchmark
SUPERB: Speech processing Universal PERformance BenchmarkInterspeech (Interspeech), 2021
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
409
1,065
0
03 May 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechInterspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
213
70
0
23 Apr 2021
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via
  Layer Consistency
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Jinchuan Tian
Rongzhi Gu
Helin Wang
Yuexian Zou
129
0
0
08 Apr 2021
Utilizing Self-supervised Representations for MOS Prediction
Utilizing Self-supervised Representations for MOS PredictionInterspeech (Interspeech), 2021
Wei-Cheng Tseng
Chien-yu Huang
Wei-Tsung Kao
Yist Y. Lin
Hung-yi Lee
SSL
332
71
0
07 Apr 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised
  Pretrained Representations
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained RepresentationsInterspeech (Interspeech), 2021
Jheng-hao Lin
Yist Y. Lin
C. Chien
Hung-yi Lee
365
62
0
07 Apr 2021
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0
  acoustic model
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic modelInterspeech (Interspeech), 2021
Apoorv Vyas
S. Madikeri
H. Bourlard
107
16
0
06 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised
  Pre-Training
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-TrainingInterspeech (Interspeech), 2021
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
309
256
0
02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a
  Language-Independent Subword Discriminative Feature Representation
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature RepresentationInterspeech (Interspeech), 2021
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
155
4
0
02 Apr 2021
Unsupervised Speech Representation Learning for Behavior Modeling using
  Triplet Enhanced Contextualized Networks
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized NetworksComputer Speech and Language (CSL), 2021
Haoqi Li
Brian R. Baucom
Shrikanth Narayanan
P. Georgiou
117
2
0
01 Apr 2021
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Auto-KWS 2021 Challenge: Task, Datasets, and BaselinesInterspeech (Interspeech), 2021
Jingsong Wang
Yuxuan He
Chunyu Zhao
Qijie Shao
Wei-Wei Tu
Tom Ko
Hung-yi Lee
Lei Xie
106
5
0
31 Mar 2021
Self-supervised representation learning from 12-lead ECG data
Self-supervised representation learning from 12-lead ECG data
Temesgen Mehari
Nils Strodthoff
SSL
243
176
0
23 Mar 2021
Fast Development of ASR in African Languages using Self Supervised
  Speech Representation Learning
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Jama Hussein Mohamud
Lloyd Thompson
A. Ndoye
Laurent Besacier
158
7
0
16 Mar 2021
XLST: Cross-lingual Self-training to Learn Multilingual Representation
  for Low Resource Speech Recognition
XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Zi-qiang Zhang
Yan Song
Ming Wu
Xin Fang
Lirong Dai
SSL
108
21
0
15 Mar 2021
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised
  Pre-training and Its Application to Children's ASR
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ruchao Fan
Amber Afshan
Abeer Alwan
124
14
0
12 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised
  Multi-Granularity Framework
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
116
6
0
03 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and
  Aggregation
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and AggregationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Yuan Gong
Yu-An Chung
James R. Glass
VLM
335
161
0
02 Feb 2021
Previous
123456
Next