Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.03416
Cited By
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
6 April 2019
Santiago Pascual
Mirco Ravanelli
Joan Serrà
Antonio Bonafonte
Yoshua Bengio
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks"
50 / 147 papers shown
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling
Puyuan Peng
David Harwath
SSL
212
28
0
07 Feb 2022
Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
A. Shirian
Krishna Somandepalli
T. Guha
SSL
381
12
0
31 Jan 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks
Computer Vision and Pattern Recognition (CVPR), 2022
A. Vasudevan
Dengxin Dai
Luc Van Gool
SSL
210
7
0
04 Jan 2022
Self-Supervised Learning for speech recognition with Intermediate layer supervision
Chengyi Wang
Yu-Huan Wu
Sanyuan Chen
Shujie Liu
Jinyu Li
Yao Qian
Zhenglu Yang
SSL
185
35
0
16 Dec 2021
Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?
Lasse Borgholt
Jakob Drachmann Havtorn
Mostafa Abdou
Joakim Edin
Lars Maaløe
Anders Søgaard
Christian Igel
SSL
127
8
0
29 Nov 2021
Music Classification: Beyond Supervised Learning, Towards Real-world Applications
Minz Won
Janne Spijkervet
Keunwoo Choi
VLM
103
17
0
23 Nov 2021
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Suwon Shon
Ankita Pasad
Felix Wu
Pablo Brusco
Yoav Artzi
Karen Livescu
Kyu Jeong Han
AuLLM
ELM
255
90
0
19 Nov 2021
Joint Unsupervised and Supervised Training for Multilingual ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Junwen Bai
Yue Liu
Yu Zhang
Ankur Bapna
Nikhil Siddhartha
K. Sim
Tara N. Sainath
210
64
0
15 Nov 2021
Membership Inference Attacks Against Self-supervised Speech Models
Interspeech (Interspeech), 2021
Wei-Cheng Tseng
Wei-Tsung Kao
Hung-yi Lee
339
18
0
09 Nov 2021
Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yuanchao Li
P. Bell
Catherine Lai
251
64
0
29 Oct 2021
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Hyeong-Seok Choi
Juheon Lee
W. Kim
Jie Hwan Lee
Hoon Heo
Kyogu Lee
213
177
0
27 Oct 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
1.1K
2,642
0
26 Oct 2021
SSAST: Self-Supervised Audio Spectrogram Transformer
Yuan Gong
Cheng-I Jeff Lai
Yu-An Chung
James R. Glass
ViT
347
354
0
19 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
Henry Gouk
Chen Change Loy
Timothy M. Hospedales
SSL
OOD
AI4TS
238
347
0
18 Oct 2021
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning
Yi-Chen Chen
Shu-Wen Yang
Cheng-Kuang Lee
Simon See
Hung-yi Lee
SSL
135
14
0
18 Oct 2021
Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Joel Shor
A. Jansen
Wei Han
Daniel S. Park
Yu Zhang
SSL
AI4TS
337
65
0
09 Oct 2021
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition
Interspeech (Interspeech), 2021
Hao Yen
Pin-Jui Ku
Chao-Han Huck Yang
Hu Hu
Sabato Marco Siniscalchi
Pin-Yu Chen
Yu Tsao
482
5
0
08 Oct 2021
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Heng-Jui Chang
Shu-Wen Yang
Hung-yi Lee
SSL
601
202
0
05 Oct 2021
Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization
Donmoon Lee
Kyogu Lee
145
3
0
29 Sep 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
144
3
0
29 Sep 2021
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Automatic Speech Recognition & Understanding (ASRU), 2021
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
122
7
0
24 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Chenyu You
Polydoros Giannouris
Yuexian Zou
SSL
196
65
0
08 Sep 2021
Fine-Grained Classroom Activity Detection from Audio with Neural Networks
Eric Slyman
Chris Daw
Morgan Skrabut
A. Usenko
Brian Hutchinson
HAI
165
7
0
29 Jul 2021
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Samuel Kessler
Bethan Thomas
S. Karout
SSL
160
31
0
26 Jul 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
Takashi Maekaku
Xuankai Chang
Yuya Fujita
Li-Wei Chen
Shinji Watanabe
Alexander I. Rudnicky
243
13
0
13 Jul 2021
Layer-wise Analysis of a Self-supervised Speech Representation Model
Automatic Speech Recognition & Understanding (ASRU), 2021
Ankita Pasad
Ju-Chieh Chou
Karen Livescu
SSL
319
373
0
10 Jul 2021
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
299
13
0
01 Jul 2021
Representation based meta-learning for few-shot spoken intent recognition
Interspeech (Interspeech), 2020
Ashish R. Mittal
Samarth Bharadwaj
Shreya Khare
Saneem A. Chemmengath
Karthik Sankaranarayanan
Brian Kingsbury
142
11
0
29 Jun 2021
LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn W. Schuller
Maja Pantic
SSL
152
59
0
16 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Wei-Ning Hsu
Benjamin Bolte
Yifan Hao
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
532
3,993
0
14 Jun 2021
SUPERB: Speech processing Universal PERformance Benchmark
Interspeech (Interspeech), 2021
Shu-Wen Yang
Po-Han Chi
Yung-Sung Chuang
Cheng-I Jeff Lai
Kushal Lakhotia
...
Shuyan Dong
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
SSL
449
1,073
0
03 May 2021
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks
IEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021
Rodrigo Mira
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Björn W. Schuller
Maja Pantic
255
52
0
27 Apr 2021
Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Prachi Singh
Sriram Ganapathy
SSL
103
11
0
19 Apr 2021
Conditional independence for pretext task selection in Self-supervised speech representation learning
Interspeech (Interspeech), 2021
Salah Zaiem
Titouan Parcollet
S. Essid
SSL
182
4
0
15 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers
Loren Lugosch
Piyush Papreja
Mirco Ravanelli
A. Heba
Titouan Parcollet
153
14
0
04 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Interspeech (Interspeech), 2021
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
390
257
0
02 Apr 2021
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Computer Speech and Language (CSL), 2021
Haoqi Li
Brian R. Baucom
Shrikanth Narayanan
P. Georgiou
133
2
0
01 Apr 2021
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Interspeech (Interspeech), 2021
Jingsong Wang
Yuxuan He
Chunyu Zhao
Qijie Shao
Wei-Wei Tu
Tom Ko
Hung-yi Lee
Lei Xie
118
5
0
31 Mar 2021
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation
Spoken Language Technology Workshop (SLT), 2021
C. Jacobs
Yevgen Matusevych
Herman Kamper
229
24
0
19 Mar 2021
Contrastive Learning of Musical Representations
International Society for Music Information Retrieval Conference (ISMIR), 2021
Janne Spijkervet
J. Burgoyne
364
139
0
17 Mar 2021
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning
Jama Hussein Mohamud
Lloyd Thompson
A. Ndoye
Laurent Besacier
174
7
0
16 Mar 2021
Multi-view Audio and Music Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Huy P Phan
Huy Le Nguyen
Oliver Y. Chén
L. D. Pham
P. Koch
Ian Mcloughlin
Alfred Mertins
118
18
0
03 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jun Wang
Max W. Y. Lam
Jane Polak Scowcroft
Dong Yu
125
7
0
02 Mar 2021
Improving speech recognition models with small samples for air traffic control systems
Neurocomputing (Neurocomputing), 2021
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
182
33
0
16 Feb 2021
Multichannel-based learning for audio object extraction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Daniel Arteaga
Jordi Pons
DiffM
245
3
0
11 Feb 2021
Multi-Task Self-Supervised Pre-Training for Music Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ho-Hsiang Wu
Chieh-Chi Kao
Qingming Tang
Ming Sun
Brian McFee
J. P. Bello
Chao Wang
SSL
490
39
0
05 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
132
6
0
03 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Transactions of the Association for Computational Linguistics (TACL), 2021
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
592
433
0
01 Feb 2021
On Scaling Contrastive Representations for Low-Resource Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Lasse Borgholt
T. M. S. Tax
Jakob Drachmann Havtorn
Lars Maaløe
Christian Igel
SSL
147
5
0
01 Feb 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
International Conference on Machine Learning (ICML), 2021
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
273
134
0
19 Jan 2021
Previous
1
2
3
Next