ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.09420
  4. Cited By
Improving Zero-shot Voice Style Transfer via Disentangled Representation
  Learning

Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning

17 March 2021
Siyang Yuan
Pengyu Cheng
Ruiyi Zhang
Weituo Hao
Zhe Gan
Lawrence Carin
    DRL
ArXiv (abs)PDFHTML

Papers citing "Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning"

36 / 36 papers shown
Title
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
Taejun Bak
Youngsik Eom
SeungJae Choi
Young-Sun Joo
49
1
0
04 Oct 2024
Zero-shot Cross-lingual Voice Transfer for TTS
Zero-shot Cross-lingual Voice Transfer for TTS
Fadi Biadsy
Youzheng Chen
Isaac Elias
Kyle Kastner
Gary Wang
Andrew Rosenberg
Bhuvana Ramabhadran
75
1
0
20 Sep 2024
Operational Latent Spaces
Operational Latent Spaces
Scott H. Hawley
Austin R. Tackett
61
0
0
04 Jun 2024
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot
  Voice Conversion
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion
Pengcheng Li
Jianzong Wang
Xulong Zhang
Yong Zhang
Jing Xiao
Ning Cheng
DRL
77
2
0
02 May 2024
Voice Attribute Editing with Text Prompt
Voice Attribute Editing with Text Prompt
Zheng-Yan Sheng
Yang Ai
Li-Juan Liu
Jia Pan
Zhenhua Ling
61
5
0
13 Apr 2024
Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain
  Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery
Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery
Jingru Zhu
Ya Guo
Geng Sun
Liang Hong
Jie Chen
74
3
0
06 Mar 2024
Learning Disentangled Speech Representations with Contrastive Learning
  and Time-Invariant Retrieval
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Yimin Deng
Huaizhen Tang
Xulong Zhang
Ning Cheng
Jing Xiao
Jianzong Wang
DRL
82
1
0
16 Jan 2024
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control
  and Contrastive Learning with Negative Samples Augmentation
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Yimin Deng
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
77
3
0
15 Nov 2023
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice
  Alignment
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Zheng-Yan Sheng
Yang Ai
Yan-Nian Chen
Zhenhua Ling
CVBM
53
4
0
18 Sep 2023
Learning Disentangled Representation with Mutual Information
  Maximization for Real-Time UAV Tracking
Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking
Xucheng Wang
Xiangyang Yang
Hengzhou Ye
Shuiwang Li
71
6
0
20 Aug 2023
DisCover: Disentangled Music Representation Learning for Cover Song
  Identification
DisCover: Disentangled Music Representation Learning for Cover Song Identification
Jiahao Xun
Shengyu Zhang
Yanting Yang
Jieming Zhu
Liqun Deng
Zhou Zhao
Zhenhua Dong
Ruiqi Li
Lichao Zhang
Leilei Gan
AAMLDRL
46
5
0
19 Jul 2023
SLMGAN: Exploiting Speech Language Model Representations for
  Unsupervised Zero-Shot Voice Conversion in GANs
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
Yinghao Aaron Li
Cong Han
N. Mesgarani
65
5
0
18 Jul 2023
Diversifying Joint Vision-Language Tokenization Learning
Diversifying Joint Vision-Language Tokenization Learning
Vardaan Pahuja
A. Piergiovanni
A. Angelova
74
0
0
06 Jun 2023
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain
Liqiang Jing
Xuemeng Song
Xuming Lin
Zhongzhou Zhao
Wei Zhou
Liqiang Nie
91
17
0
05 May 2023
Leveraging Neural Representations for Audio Manipulation
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
65
2
0
10 Apr 2023
Speaking Style Conversion in the Waveform Domain Using Discrete
  Self-Supervised Units
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units
Gallil Maimon
Yossi Adi
104
14
0
19 Dec 2022
Replacing Language Model for Style Transfer
Replacing Language Model for Style Transfer
Peng Cheng
Rui Li
KELM
72
3
0
14 Nov 2022
Disentangled Speech Representation Learning for One-Shot Cross-lingual
  Voice Conversion Using $β$-VAE
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using βββ-VAE
Hui Lu
Disong Wang
Xixin Wu
Zhiyong Wu
Xunying Liu
Helen M. Meng
DRL
117
10
0
25 Oct 2022
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Alon Levkovitch
Eliya Nachmani
Lior Wolf
DiffM
78
29
0
05 Jun 2022
End-to-End Zero-Shot Voice Conversion with Location-Variable
  Convolutions
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
Wonjune Kang
M. Hasegawa-Johnson
D. Roy
82
8
0
19 May 2022
Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts
Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts
Paige Tuttosi
Emma Hughson
Akihiro Matsufuji
Angelica Lim
69
4
0
10 May 2022
So Different Yet So Alike! Constrained Unsupervised Text Style Transfer
So Different Yet So Alike! Constrained Unsupervised Text Style Transfer
Abhinav Ramesh Kashyap
Devamanyu Hazarika
Min-Yen Kan
Roger Zimmermann
Soujanya Poria
GAN
83
14
0
09 May 2022
Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention
  VAE
Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE
Ziang Long
Yunling Zheng
Meng Yu
Jack Xin
DRL
63
5
0
30 Mar 2022
Variational Autoencoder with Disentanglement Priors for Low-Resource
  Task-Specific Natural Language Generation
Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language Generation
Zhuang Li
Zhuang Li
Xingliang Yuan
Tongtong Wu
Tianyang Zhan
Gholamreza Haffari
CoGeUDDRL
107
4
0
27 Feb 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier
Learning the Beauty in Songs: Neural Singing Voice Beautifier
Jinglin Liu
Chengxi Li
Yi Ren
Zhiying Zhu
Zhou Zhao
DiffM
94
17
0
27 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level
  Bipartite Graph
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
114
13
0
24 Feb 2022
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised
  Learning
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning
Qiqi Wang
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DRL
116
23
0
22 Feb 2022
Tubes Among Us: Analog Attack on Automatic Speaker Identification
Tubes Among Us: Analog Attack on Automatic Speaker Identification
Shimaa Ahmed
Yash R. Wani
Ali Shahin Shamsabadi
Mohammad Yaghini
Ilia Shumailov
Nicolas Papernot
Kassem Fawaz
AAML
62
4
0
06 Feb 2022
Semantic Feature Extraction for Generalized Zero-shot Learning
Semantic Feature Extraction for Generalized Zero-shot Learning
Junhan Kim
Kyuhong Shim
B. Shim
VLM
69
33
0
29 Dec 2021
Training Robust Zero-Shot Voice Conversion Models with Self-supervised
  Features
Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features
Trung D. Q. Dang
Dung T. Tran
Peter Chin
K. Koishida
SSL
69
15
0
08 Dec 2021
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System
  for Both Human Beings and Machines
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines
Haozhe Zhang
Zexin Cai
Xiaoyi Qin
Ming Li
91
15
0
06 Nov 2021
CycleFlow: Purify Information Factors by Cycle Loss
CycleFlow: Purify Information Factors by Cycle Loss
Haoran Sun
Chen Chen
Lantian Li
Dong Wang
65
1
0
18 Oct 2021
Beyond Voice Identity Conversion: Manipulating Voice Attributes by
  Adversarial Learning of Structured Disentangled Representations
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations
L. Benaroya
Nicolas Obin
Axel Roebel
42
5
0
26 Jul 2021
Many-to-Many Voice Conversion based Feature Disentanglement using
  Variational Autoencoder
Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder
Manh Luong
Viet-Anh Tran
DRL
48
16
0
11 Jul 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised
  Speech Representation Disentanglement for One-shot Voice Conversion
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
Helen Meng
DRL
84
141
0
18 Jun 2021
Adversarially learning disentangled speech representations for robust
  multi-factor voice conversion
Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Jie Wang
Jingbei Li
Xintao Zhao
Zhiyong Wu
Shiyin Kang
Helen Meng
DRL
123
29
0
30 Jan 2021
1