Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.07902
Cited By
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data
22 September 2017
Wei-Ning Hsu
Yu Zhang
James R. Glass
BDL
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"
47 / 47 papers shown
Title
Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning
Runwu Shi
Katsutoshi Itoyama
K. Nakadai
SSL
DRL
39
1
0
31 Dec 2024
Unsupervised Representation Learning from Sparse Transformation Analysis
Yue Song
Thomas Anderson Keller
Yisong Yue
Pietro Perona
Max Welling
DRL
29
0
0
07 Oct 2024
Cross-Utterance Conditioned VAE for Speech Generation
Y. Li
Cheng Yu
Guangzhi Sun
Weiqin Zu
Zheng Tian
...
Wei Pan
Chao Zhang
Jun Wang
Yang Yang
Fanglei Sun
11
2
0
08 Sep 2023
Multifactor Sequential Disentanglement via Structured Koopman Autoencoders
Nimrod Berman
Ilana D Naiman
Omri Azencot
CoGe
22
21
0
30 Mar 2023
Learning Interpretable Low-dimensional Representation via Physical Symmetry
Xuanjie Liu
Daniel Y. Chin
Yichen Huang
Gus Xia
24
3
0
05 Feb 2023
A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training
Yang Xiang
Jesper Lisby Højvang
M. Rasmussen
M. G. Christensen
DRL
21
5
0
16 Nov 2022
Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation
Chendong Zhao
Jianzong Wang
Xiaoyang Qu
Haoqian Wang
Jing Xiao
SSL
30
1
0
15 Oct 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
17
289
0
30 Sep 2022
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders
Lies Bollens
T. Francart
Hugo Van hamme
DRL
BDL
9
14
0
01 Jul 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
128
349
0
21 May 2022
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization
Wei Wei
Hengguan Huang
Xiangming Gu
Hao Wang
Ye Wang
BDL
22
0
0
05 May 2022
Improved far-field speech recognition using Joint Variational Autoencoder
Shashi Kumar
S. Rath
Abhishek Pandey
DRL
10
0
0
24 Apr 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
19
11
0
01 Mar 2022
Benchmarking Generative Latent Variable Models for Speech
Jakob Drachmann Havtorn
Lasse Borgholt
Søren Hauberg
J. Frellsen
Lars Maaløe
18
3
0
22 Feb 2022
Noise-robust voice conversion with domain adversarial training
Hongqiang Du
Lei Xie
Haizhou Li
11
11
0
26 Jan 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
77
1,698
0
26 Oct 2021
Contrastively Disentangled Sequential Variational Autoencoder
M. Kiener
Weiran Wang
Michael Gerndt
CoGe
DRL
19
40
0
22 Oct 2021
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
Xiaoyu Bie
Laurent Girin
Simon Leglaive
Thomas Hueber
Xavier Alameda-Pineda
18
12
0
11 Jun 2021
Learning Robust Latent Representations for Controllable Speech Synthesis
Shakti Kumar
Jithin Pradeep
Hussain Zaidi
DRL
28
6
0
10 May 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
14
235
0
02 Apr 2021
Interpretability and Explainability: A Machine Learning Zoo Mini-tour
Ricards Marcinkevics
Julia E. Vogt
XAI
18
119
0
03 Dec 2020
Mutual Information Based Method for Unsupervised Disentanglement of Video Representation
Aditya Sreekar
Ujjwal Tiwari
A. Namboodiri
DRL
15
4
0
17 Nov 2020
Unsupervised Learning of Disentangled Speech Content and Style Representation
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDL
DRL
17
15
0
24 Oct 2020
THIN: THrowable Information Networks and Application for Facial Expression Recognition In The Wild
Estèphe Arnaud
Arnaud Dapogny
Kévin Bailly
CVBM
12
23
0
15 Oct 2020
Dynamic Future Net: Diversified Human Motion Generation
Wenheng Chen
He-Nan Wang
Yi Yuan
Tianjia Shao
Kun Zhou
3DH
30
22
0
25 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
27
316
0
09 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
13
58
0
29 Jul 2020
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Janek Ebbers
Michael Kuhlmann
Tobias Cord-Landwehr
Reinhold Haeb-Umbach
DRL
CoGe
SSL
23
4
0
26 May 2020
S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
Yizhe Zhu
Martin Renqiang Min
Asim Kadav
H. Graf
CoGe
DRL
19
95
0
23 May 2020
CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
Mengyue Yang
Furui Liu
Zhitang Chen
Xinwei Shen
Jianye Hao
Jun Wang
OOD
CoGe
CML
41
44
0
18 Apr 2020
Variational inference formulation for a model-free simulation of a dynamical system with unknown parameters by a recurrent neural network
K. Yeo
D. E. C. Grullon
Fan-Keng Sun
Duane S. Boning
Jayant Kalagnanam
BDL
16
3
0
02 Mar 2020
Weakly-Supervised Disentanglement Without Compromises
Francesco Locatello
Ben Poole
Gunnar Rätsch
Bernhard Schölkopf
Olivier Bachem
Michael Tschannen
CoGe
OOD
DRL
178
313
0
07 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
26
81
0
02 Jan 2020
Enhancing VAEs for Collaborative Filtering: Flexible Priors & Gating Mechanisms
Daeryong Kim
B. Suh
14
50
0
03 Nov 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
15
173
0
23 Oct 2019
Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations
Jan Stühmer
Richard E. Turner
Sebastian Nowozin
DRL
BDL
CoGe
111
25
0
05 Sep 2019
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations
Mingda Chen
Qingming Tang
Sam Wiseman
Kevin Gimpel
DRL
23
76
0
02 Apr 2019
Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion
Seongkyu Mun
Suwon Shon
11
21
0
04 Dec 2018
Unsupervised learning with contrastive latent variable models
Kristen A. Severson
S. Ghosh
Kenney Ng
SSL
DRL
11
37
0
14 Nov 2018
A General Method for Amortizing Variational Filtering
Joseph Marino
Milan Cvitkovic
Yisong Yue
16
34
0
13 Nov 2018
A New Multi-vehicle Trajectory Generator to Simulate Vehicle-to-Vehicle Encounters
Wenhao Ding
Wenshuo Wang
Ding Zhao
22
13
0
15 Sep 2018
Hyperprior Induced Unsupervised Disentanglement of Latent Representations
Abdul Fatir Ansari
Harold Soh
CoGe
CML
UD
DRL
19
31
0
12 Sep 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
38
37
0
21 Jul 2018
Disentangling Controllable and Uncontrollable Factors of Variation by Interacting with the World
Yoshihide Sawada
DRL
16
10
0
19 Apr 2018
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
Ehsan Hosseini-Asl
Yingbo Zhou
Caiming Xiong
R. Socher
16
54
0
27 Mar 2018
Factorised spatial representation learning: application in semi-supervised myocardial segmentation
A. Chartsias
T. Joyce
G. Papanastasiou
S. Semple
M. Williams
D. Newby
R. Dharmakumar
Sotirios A. Tsaftaris
DRL
51
69
0
19 Mar 2018
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
233
2,545
0
25 Jan 2016
1