Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2005.08445
Cited By

Many-to-Many Voice Transformer Network

v1v2v3v4 (latest)

Many-to-Many Voice Transformer Network

18 May 2020

Hirokazu Kameoka

Takuhiro Kaneko

ArXiv (abs)PDF HTML

Papers citing "Many-to-Many Voice Transformer Network"

11 / 11 papers shown

HierSpeech++: Bridging the Gap between Semantic and Acoustic
Representation of Speech by Hierarchical Variational Inference for Zero-shot
Speech Synthesis

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech SynthesisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

443

69

0

21 Nov 2023

Emotion Intensity and its Control for Emotional Voice Conversion

Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022

Björn W. Schuller

Haizhou Li

414

82

0

10 Jan 2022

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
Processing

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Rui Wang

...

451

266

0

14 Oct 2021

Style Equalization: Unsupervised Learning of Controllable Generative
Sequence Models

Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models

Jen-Hao Rick Chang

Xiaoshuai Zhang

302

20

0

06 Oct 2021

A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion

A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionInterspeech (Interspeech), 2021

Kazuhiro Kobayashi

189

14

0

02 Jun 2021

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice
Conversion

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion

Hirokazu Kameoka

Takuhiro Kaneko

273

23

0

14 Apr 2021

Non-autoregressive sequence-to-sequence voice conversion

Non-autoregressive sequence-to-sequence voice conversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Kazuhiro Kobayashi

197

26

0

14 Apr 2021

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
Langevin Dynamics

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics

Hirokazu Kameoka

Takuhiro Kaneko

352

26

0

06 Oct 2020

Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence
Modeling

Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence ModelingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

365

121

0

06 Sep 2020

Nonparallel Voice Conversion with Augmented Classifier Star Generative
Adversarial Networks

Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Hirokazu Kameoka

Takuhiro Kaneko

437

22

0

27 Aug 2020

Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Hirokazu Kameoka

387

48

0

07 Aug 2020

Page 1 of 1