Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2004.05274
Cited By
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
11 April 2020
Yu-An Chung
James R. Glass
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved Speech Representations with Multi-Target Autoregressive Predictive Coding"
37 / 37 papers shown
Title
Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Hsin-Tien Chiang
Szu-Wei Fu
Hsin-Min Wang
Yu Tsao
John H. L. Hansen
152
8
0
15 Nov 2023
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Interspeech (Interspeech), 2023
Asad Ullah
Alessandro Ragano
Andrew Hines
352
4
0
22 Sep 2023
On the Robustness of Arabic Speech Dialect Identification
Interspeech (Interspeech), 2023
Peter Sullivan
AbdelRahim Elmadany
Muhammad Abdul-Mageed
132
16
0
01 Jun 2023
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Huadai Liu
Rongjie Huang
Xuan Lin
Wenqiang Xu
Maozong Zheng
Hong Chen
Jinzheng He
Zhou Zhao
DiffM
249
30
0
22 May 2023
Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition
Annual IEEE International Conference on Pervasive Computing and Communications (PerCom), 2022
H. Haresamudram
Irfan Essa
Thomas Ploetz
AI4TS
270
20
0
11 Nov 2022
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
Neural Information Processing Systems (NeurIPS), 2022
Yonggan Fu
Yang Zhang
Kaizhi Qian
Zhifan Ye
Zhongzhi Yu
Cheng-I Jeff Lai
Yingyan Lin
316
10
0
02 Nov 2022
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Spoken Language Technology Workshop (SLT), 2022
Tzu-hsun Feng
Annie Dong
Ching-Feng Yeh
Shu-Wen Yang
Tzu-Quan Lin
...
Xuankai Chang
Shinji Watanabe
Abdel-rahman Mohamed
Shang-Wen Li
Hung-yi Lee
ELM
SSL
196
38
0
16 Oct 2022
Self-Supervised Speech Representation Learning: A Review
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
574
433
0
21 May 2022
Federated Self-Supervised Learning for Acoustic Event Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Meng Feng
Chieh-Chi Kao
Qingming Tang
Ming Sun
Viktor Rozgic
Spyros Matsoukas
Chao Wang
142
14
0
22 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
207
13
0
01 Mar 2022
Assessing the State of Self-Supervised Human Activity Recognition using Wearables
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2022
H. Haresamudram
Irfan Essa
Thomas Plötz
SSL
288
112
0
22 Feb 2022
Textless Speech Emotion Conversion using Discrete and Decomposed Representations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Nguyen
M. Rivière
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
317
42
0
14 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
735
2,548
0
26 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech models
Yen Meng
Yi-Hui Chou
Andy T. Liu
Hung-yi Lee
231
31
0
15 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
541
197
0
05 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
119
3
0
29 Sep 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Automatic Speech Recognition & Understanding (ASRU), 2021
Yu-An Chung
Yu Zhang
Wei Han
Chung-Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
SSL
VLM
199
486
0
07 Aug 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Wei-Ning Hsu
Benjamin Bolte
Yifan Hao
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
468
3,870
0
14 Jun 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Interspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
213
70
0
23 Apr 2021
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Jama Hussein Mohamud
Lloyd Thompson
A. Ndoye
Laurent Besacier
150
7
0
16 Mar 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
116
6
0
03 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Transactions of the Association for Computational Linguistics (TACL), 2021
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
407
421
0
01 Feb 2021
End2End Acoustic to Semantic Transduction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Valentin Pelloin
Nathalie Camelin
Antoine Laurent
R. Mori
Antoine Caubrière
Yannick Esteve
S. Meignier
102
15
0
01 Feb 2021
A comparison of self-supervised speech representations as input features for unsupervised acoustic word embeddings
Spoken Language Technology Workshop (SLT), 2020
Lisa van Staden
Herman Kamper
SSL
133
17
0
14 Dec 2020
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Shaoshi Ling
Yuzong Liu
140
112
0
11 Dec 2020
Contrastive Predictive Coding for Human Activity Recognition
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2020
H. Haresamudram
Irfan Essa
Thomas Ploetz
246
138
0
09 Dec 2020
End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
E. Morais
H. Kuo
Samuel Thomas
Zoltán Tüske
Brian Kingsbury
109
12
0
16 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
165
7
0
11 Nov 2020
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies
Interspeech (Interspeech), 2020
Alexander H. Liu
Yu-An Chung
James R. Glass
SSL
191
93
0
01 Nov 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Interspeech (Interspeech), 2020
Dongwei Jiang
Wubo Li
Miao Cao
Wei Zou
Xiangang Li
SSL
254
71
0
27 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
165
6
0
24 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
Tomoki Toda
BDL
223
45
0
23 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations
Yu-An Chung
Yonatan Belinkov
James R. Glass
SSL
330
44
0
22 Oct 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
309
45
0
07 Aug 2020
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
SSL
177
31
0
20 May 2020
Vector-Quantized Autoregressive Predictive Coding
Yu-An Chung
Hao Tang
James R. Glass
SSL
150
121
0
17 May 2020
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
384
388
0
25 Oct 2019
1