Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

6 April 2019

Mirco Ravanelli

Papers citing "Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks"

47 / 147 papers shown

The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networksIEEE Open Journal of Signal Processing (JOSP), 2020

Siyuan Feng

O. Scharenborg

SSL

206

17 Dec 2020

Self-Supervised Time Series Representation Learning by Inter-Intra Relational Reasoning

Haoyi Fan

Fengbin Zhang

Yue Gao

AI4TS

172

27 Nov 2020

Towards Semi-Supervised Semantics Understanding from Speech

184

11 Nov 2020

Non-Autoregressive Predictive Coding for Learning Speech Representations from Local DependenciesInterspeech (Interspeech), 2020

207

01 Nov 2020

Interpretable Representation Learning for Speech and Audio Signals Based on Relevance WeightingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Purvi Agrawal

Sriram Ganapathy

147

29 Oct 2020

Robust Raw Waveform Speech Recognition Using Relevance Weighted RepresentationsInterspeech (Interspeech), 2020

Purvi Agrawal

Sriram Ganapathy

106

29 Oct 2020

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningInterspeech (Interspeech), 2020

Dongwei Jiang

Wubo Li

Miao Cao

Wei Zou

Xiangang Li

SSL

296

27 Oct 2020

Probing Acoustic Representations for Phonetic PropertiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Danni Ma

Neville Ryant

M. Liberman

337

25 Oct 2020

Similarity Analysis of Self-Supervised Speech Representations

340

22 Oct 2020

Contrastive Learning of General-Purpose Audio Representations

253

311

21 Oct 2020

FastVC: Fast Voice Conversion with non-parallel data

Oriol Barbany

Milos Cernak

130

08 Oct 2020

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

Yingbo Zhou

195

07 Oct 2020

SESQA: semi-supervised learning for speech quality assessmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Joan Serrà

Jordi Pons

Santiago Pascual

231

01 Oct 2020

Detecting Parkinson's Disease From an Online Speech-task

...

Stella Jensen-Roberts

M. R. Ali

Ray Dorsey

E. Hoque

141

02 Sep 2020

Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection

Soham Deshmukh

Bhiksha Raj

Rita Singh

119

17 Aug 2020

Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

201

116

15 Aug 2020

Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview

316

14 Aug 2020

Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

320

07 Aug 2020

TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

532

393

12 Jul 2020

Learning Speech Representations from Raw Audio by Joint Audiovisual Self-Supervision

141

08 Jul 2020

Self-supervised Learning for Speech Enhancement

Yuchun Wang

Shrikant Venkataramani

Paris Smaragdis

SSL

160

18 Jun 2020

Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers

221

09 Jun 2020

Self-Supervised Dynamic Networks for Covariate Shift Robustness

182

06 Jun 2020

CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning

180

04 Jun 2020

A Convolutional Deep Markov Model for Unsupervised Speech Representation LearningInterspeech (Interspeech), 2020

175

03 Jun 2020

Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks

Yuichiro Koyama

Tyler Vuong

Stefan Uhlich

Bhiksha Raj

246

23 May 2020

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Wei Zou

Xiangang Li

SSL

208

20 May 2020

Vector-Quantized Autoregressive Predictive Coding

Yu-An Chung

Hao Tang

James R. Glass

SSL

186

124

17 May 2020

Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?IEEE Transactions on Affective Computing (IEEE TAC), 2020

417

04 May 2020

An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and AnxietyInterspeech (Interspeech), 2020

...

Björn W. Schuller

259

102

30 Apr 2020

From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from SpeechInternational Conference on Learning Representations (ICLR), 2020

128

13 Apr 2020

Improved Speech Representations with Multi-Target Autoregressive Predictive CodingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Yu-An Chung

James R. Glass

SSL

206

11 Apr 2020

A Comparison of Metric Learning Loss Functions for End-To-End Speaker VerificationInternational Conference on Statistical Language and Speech Processing (ICSLSP), 2020

172

31 Mar 2020

Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited DataEURASIP Journal on Audio, Speech, and Music Processing (JEASMP), 2020

Vincent Roger

Jérôme Farinas

J. Pinquier

115

09 Mar 2020

Towards Learning a Universal Non-Semantic Representation of SpeechInterspeech (Interspeech), 2020

Félix de Chaumont Quitry

556

166

25 Feb 2020

Limitations of weak labels for embedding and taggingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Nicolas Turpault

Romain Serizel

Emmanuel Vincent

317

05 Feb 2020

Unsupervised Pre-training of Bidirectional Speech Encoders via Masked ReconstructionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

229

28 Jan 2020

Multi-task self-supervised learning for Robust Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Mirco Ravanelli

467

303

25 Jan 2020

Visually Guided Self Supervised Learning of Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Abhinav Shukla

Konstantinos Vougioukas

166

13 Jan 2020

Robust Estimation of Hypernasality in Dysarthria with Acoustic Model Likelihood FeaturesIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019

Michael Stephen Saxon

213

26 Nov 2019

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded SpeechInternational Conference on Learning Representations (ICLR), 2019

David Harwath

Wei-Ning Hsu

James R. Glass

170

21 Nov 2019

Speaker-invariant Affective Representation Learning via Adversarial TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Shrikanth Narayanan

346

04 Nov 2019

Learning audio representations via phase prediction

Félix de Chaumont Quitry

Marco Tagliasacchi

Dominik Roblek

SSL AI4TS

105

25 Oct 2019

Generative Pre-Training for Speech with Autoregressive Predictive CodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Yu-An Chung

James R. Glass

SSL

330

182

23 Oct 2019

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Wei Zou

Xiangang Li

263

105

22 Oct 2019

Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNNSpeech Synthesis Workshop (SSW), 2019

David Álvarez

Santiago Pascual

Antonio Bonafonte

177

03 Jun 2019

Self-supervised audio representation learning for mobile devices

Marco Tagliasacchi

Beat Gfeller

Félix de Chaumont Quitry

Dominik Roblek

SSL AI4TS

157

24 May 2019