Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

6 April 2019

Mirco Ravanelli

Papers citing "Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks"

50 / 147 papers shown

Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling

Puyuan Peng

David Harwath

SSL

212

07 Feb 2022

Self-supervised Graphs for Audio Representation Learning with Limited Labeled DataIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

381

31 Jan 2022

Sound and Visual Representation Learning with Multiple Pretraining TasksComputer Vision and Pattern Recognition (CVPR), 2022

A. Vasudevan

Dengxin Dai

Luc Van Gool

SSL

210

04 Jan 2022

Self-Supervised Learning for speech recognition with Intermediate layer supervision

185

16 Dec 2021

Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?

Lasse Borgholt

Jakob Drachmann Havtorn

127

29 Nov 2021

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Minz Won

Janne Spijkervet

Keunwoo Choi

VLM

103

23 Nov 2021

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

255

19 Nov 2021

Joint Unsupervised and Supervised Training for Multilingual ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

210

15 Nov 2021

Membership Inference Attacks Against Self-supervised Speech ModelsInterspeech (Interspeech), 2021

Wei-Cheng Tseng

Wei-Tsung Kao

Hung-yi Lee

339

09 Nov 2021

Fusing ASR Outputs in Joint Training for Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Yuanchao Li

P. Bell

Catherine Lai

251

29 Oct 2021

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations

213

177

27 Oct 2021

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

...

Jian Wu

1.1K

2,642

26 Oct 2021

SSAST: Self-Supervised Audio Spectrogram Transformer

347

354

19 Oct 2021

Self-Supervised Representation Learning: Introduction, Advances and Challenges

Linus Ericsson

Henry Gouk

Chen Change Loy

Timothy M. Hospedales

SSL OOD AI4TS

238

347

18 Oct 2021

Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning

135

18 Oct 2021

Universal Paralinguistic Speech Representations Using Self-Supervised ConformersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

337

09 Oct 2021

Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command RecognitionInterspeech (Interspeech), 2021

Sabato Marco Siniscalchi

Pin-Yu Chen

Yu Tsao

482

08 Oct 2021

DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT

601

202

05 Oct 2021

Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization

Donmoon Lee

Kyogu Lee

145

29 Sep 2021

Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch

Jakob Poncelet

Hugo Van hamme

SSL

144

29 Sep 2021

Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker VerificationAutomatic Speech Recognition & Understanding (ASRU), 2021

Xuechen Liu

Md. Sahidullah

Tomi Kinnunen

122

24 Sep 2021

Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

196

08 Sep 2021

Fine-Grained Classroom Activity Detection from Audio with Neural Networks

165

29 Jul 2021

An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

160

26 Jul 2021

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021

Li-Wei Chen

Alexander I. Rudnicky

243

13 Jul 2021

Layer-wise Analysis of a Self-supervised Speech Representation ModelAutomatic Speech Recognition & Understanding (ASRU), 2021

319

373

10 Jul 2021

Pretext Tasks selection for multitask self-supervised speech representation learning

299

01 Jul 2021

Representation based meta-learning for few-shot spoken intent recognitionInterspeech (Interspeech), 2020

Ashish R. Mittal

Samarth Bharadwaj

Shreya Khare

Saneem A. Chemmengath

Karthik Sankaranarayanan

Brian Kingsbury

142

29 Jun 2021

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

Björn W. Schuller

152

16 Jun 2021

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

532

3,993

14 Jun 2021

SUPERB: Speech processing Universal PERformance BenchmarkInterspeech (Interspeech), 2021

...

449

1,073

03 May 2021

End-to-End Video-To-Speech Synthesis using Generative Adversarial NetworksIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021

Rodrigo Mira

Konstantinos Vougioukas

Pingchuan Ma

Stavros Petridis

Björn W. Schuller

Maja Pantic

255

27 Apr 2021

Self-supervised Representation Learning With Path Integral Clustering For Speaker DiarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Prachi Singh

Sriram Ganapathy

SSL

103

19 Apr 2021

Conditional independence for pretext task selection in Self-supervised speech representation learningInterspeech (Interspeech), 2021

182

15 Apr 2021

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Mirco Ravanelli

153

04 Apr 2021

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-TrainingInterspeech (Interspeech), 2021

...

390

257

02 Apr 2021

Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized NetworksComputer Speech and Language (CSL), 2021

Haoqi Li

Brian R. Baucom

Shrikanth Narayanan

P. Georgiou

133

01 Apr 2021

Auto-KWS 2021 Challenge: Task, Datasets, and BaselinesInterspeech (Interspeech), 2021

Qijie Shao

Lei Xie

118

31 Mar 2021

Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptationSpoken Language Technology Workshop (SLT), 2021

C. Jacobs

Yevgen Matusevych

Herman Kamper

229

19 Mar 2021

Contrastive Learning of Musical RepresentationsInternational Society for Music Information Retrieval Conference (ISMIR), 2021

Janne Spijkervet

J. Burgoyne

364

139

17 Mar 2021

Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning

174

16 Mar 2021

Multi-view Audio and Music ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

118

03 Mar 2021

Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party EffectAAAI Conference on Artificial Intelligence (AAAI), 2021

125

02 Mar 2021

Improving speech recognition models with small samples for air traffic control systemsNeurocomputing (Neurocomputing), 2021

182

16 Feb 2021

Multichannel-based learning for audio object extractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Daniel Arteaga

Jordi Pons

DiffM

245

11 Feb 2021

Multi-Task Self-Supervised Pre-Training for Music ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Ming Sun

490

05 Feb 2021

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework

132

03 Feb 2021

Generative Spoken Language Modeling from Raw AudioTransactions of the Association for Computational Linguistics (TACL), 2021

Yossi Adi

...

592

433

01 Feb 2021

On Scaling Contrastive Representations for Low-Resource Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Lasse Borgholt

T. M. S. Tax

Jakob Drachmann Havtorn

Lars Maaløe

Christian Igel

SSL

147

01 Feb 2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled DataInternational Conference on Machine Learning (ICML), 2021

273

134

19 Jan 2021