v1v2 (latest)

Scalable Factorized Hierarchical Variational Autoencoder Training

9 April 2018

Papers citing "Scalable Factorized Hierarchical Variational Autoencoder Training"

15 / 15 papers shown

Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement

122

02 Oct 2025

Half-AVAE: Adversarial-Enhanced Factorized and Structured Encoder-Free VAE for Underdetermined Independent Component Analysis

Yuan-Hao Wei

Yan-Jie Sun

289

08 Jun 2025

Sequential Representation Learning via Static-Dynamic Conditional DisentanglementEuropean Conference on Computer Vision (ECCV), 2024

Mathieu Cyrille Simon

Pascal Frossard

Christophe De Vleeschouwer

CoGe CML

242

10 Aug 2024

Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial TrainingSpoken Language Technology Workshop (SLT), 2022

Jinzi Qi

Hugo Van hamme

AAML

192

24 Oct 2022

Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational AutoencodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

263

01 Jul 2022

A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer

Hu Hu

Sabato Marco Siniscalchi

Chao-Han Huck Yang

Chin-Hui Lee

BDL

225

16 Oct 2021

Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

226

31 Dec 2020

Disentangled Dynamic Graph Deep Generation

Wenbin Zhang

Liming Zhang

Dieter Pfoser

Bo Pan

277

14 Oct 2020

Generating Handwriting via Decoupled Style DescriptorsEuropean Conference on Computer Vision (ECCV), 2020

Atsunobu Kotani

Stefanie Tellex

James Tompkin

321

26 Aug 2020

Solving inverse problems using conditional invertible neural networksJournal of Computational Physics (JCP), 2020

G. A. Padmanabha

N. Zabaras

AI4CE

324

31 Jul 2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded SpeechInternational Conference on Learning Representations (ICLR), 2019

David Harwath

Wei-Ning Hsu

James R. Glass

275

21 Nov 2019

Contextual Joint Factor Acoustic EmbeddingsSpoken Language Technology Workshop (SLT), 2019

Yanpei Shi

Thomas Hain

165

16 Oct 2019

Transfer Learning from Audio-Visual Grounding to Speech RecognitionInterspeech (Interspeech), 2019

167

09 Jul 2019

Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion

Seongkyu Mun

Suwon Shon

137

04 Dec 2018

Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition

Wei-Ning Hsu

Hao Tang

James R. Glass

SSL

168

13 Jun 2018