ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.03201
  4. Cited By
Scalable Factorized Hierarchical Variational Autoencoder Training
v1v2 (latest)

Scalable Factorized Hierarchical Variational Autoencoder Training

9 April 2018
Wei-Ning Hsu
James R. Glass
    BDL
ArXiv (abs)PDFHTML

Papers citing "Scalable Factorized Hierarchical Variational Autoencoder Training"

15 / 15 papers shown
Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Jianing Yang
Sheng Li
Takahiro Shinozaki
Yuki Saito
Hiroshi Saruwatari
122
0
0
02 Oct 2025
Half-AVAE: Adversarial-Enhanced Factorized and Structured Encoder-Free VAE for Underdetermined Independent Component Analysis
Half-AVAE: Adversarial-Enhanced Factorized and Structured Encoder-Free VAE for Underdetermined Independent Component Analysis
Yuan-Hao Wei
Yan-Jie Sun
289
2
0
08 Jun 2025
Sequential Representation Learning via Static-Dynamic Conditional
  Disentanglement
Sequential Representation Learning via Static-Dynamic Conditional DisentanglementEuropean Conference on Computer Vision (ECCV), 2024
Mathieu Cyrille Simon
Pascal Frossard
Christophe De Vleeschouwer
CoGeCML
242
4
0
10 Aug 2024
Weak-Supervised Dysarthria-invariant Features for Spoken Language
  Understanding using an FHVAE and Adversarial Training
Weak-Supervised Dysarthria-invariant Features for Spoken Language Understanding using an FHVAE and Adversarial TrainingSpoken Language Technology Workshop (SLT), 2022
Jinzi Qi
Hugo Van hamme
AAML
192
1
0
24 Oct 2022
Learning Subject-Invariant Representations from Speech-Evoked EEG Using
  Variational Autoencoders
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational AutoencodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Lies Bollens
T. Francart
Hugo Van hamme
DRLBDL
263
19
0
01 Jul 2022
A Variational Bayesian Approach to Learning Latent Variables for
  Acoustic Knowledge Transfer
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Hu Hu
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Chin-Hui Lee
BDL
225
7
0
16 Oct 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
226
74
0
31 Dec 2020
Disentangled Dynamic Graph Deep Generation
Disentangled Dynamic Graph Deep Generation
Wenbin Zhang
Liming Zhang
Dieter Pfoser
Bo Pan
277
46
0
14 Oct 2020
Generating Handwriting via Decoupled Style Descriptors
Generating Handwriting via Decoupled Style DescriptorsEuropean Conference on Computer Vision (ECCV), 2020
Atsunobu Kotani
Stefanie Tellex
James Tompkin
321
35
0
26 Aug 2020
Solving inverse problems using conditional invertible neural networks
Solving inverse problems using conditional invertible neural networksJournal of Computational Physics (JCP), 2020
G. A. Padmanabha
N. Zabaras
AI4CE
324
75
0
31 Jul 2020
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded
  Speech
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded SpeechInternational Conference on Learning Representations (ICLR), 2019
David Harwath
Wei-Ning Hsu
James R. Glass
275
88
0
21 Nov 2019
Contextual Joint Factor Acoustic Embeddings
Contextual Joint Factor Acoustic EmbeddingsSpoken Language Technology Workshop (SLT), 2019
Yanpei Shi
Thomas Hain
165
3
0
16 Oct 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech RecognitionInterspeech (Interspeech), 2019
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
167
32
0
09 Jul 2019
Domain Mismatch Robust Acoustic Scene Classification using Channel
  Information Conversion
Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion
Seongkyu Mun
Suwon Shon
137
22
0
04 Dec 2018
Unsupervised Adaptation with Interpretable Disentangled Representations
  for Distant Conversational Speech Recognition
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
Wei-Ning Hsu
Hao Tang
James R. Glass
SSL
168
22
0
13 Jun 2018
1
Page 1 of 1