ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.04076
  4. Cited By
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and
  Context Preservation Mechanisms

AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018
9 November 2018
Kou Tanaka
Hirokazu Kameoka
Takuhiro Kaneko
Nobukatsu Hojo
ArXiv (abs)PDFHTML

Papers citing "AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms"

50 / 53 papers shown
Title
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024
Hyoung-Seok Oh
Sang-Hoon Lee
Deok-Hyun Cho
Seong-Whan Lee
501
1
0
16 Jan 2024
Parallel and Limited Data Voice Conversion Using Stochastic Variational
  Deep Kernel Learning
Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel LearningEngineering applications of artificial intelligence (EAAI), 2022
Mohamadreza Jafaryani
H. Sheikhzadeh
V. Pourahmadi
164
4
0
08 Sep 2023
All-for-One and One-For-All: Deep learning-based feature fusion for
  Synthetic Speech Detection
All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection
Daniele Mari
Davide Salvi
Paolo Bestagini
Simone Milani
94
5
0
28 Jul 2023
Rhythm Modeling for Voice Conversion
Rhythm Modeling for Voice ConversionIEEE Signal Processing Letters (IEEE SPL), 2023
Benjamin van Niekerk
M. Carbonneau
Herman Kamper
204
9
0
12 Jul 2023
An Overview of Affective Speech Synthesis and Conversion in the Deep
  Learning Era
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning EraProceedings of the IEEE (Proc. IEEE), 2022
Andreas Triantafyllopoulos
Björn W. Schuller
Gokcce .Iymen
M. Sezgin
Xiangheng He
...
Shuo Liu
Silvan Mertes
Elisabeth André
Ruibo Fu
Jianhua Tao
254
84
0
06 Oct 2022
The Sound of Silence: Efficiency of First Digit Features in Synthetic
  Audio Detection
The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio DetectionInternational Workshop on Information Forensics and Security (WIFS), 2022
Daniele Mari
Federica Latora
Simone Milani
89
12
0
06 Oct 2022
ConvNeXt Based Neural Network for Audio Anti-Spoofing
ConvNeXt Based Neural Network for Audio Anti-Spoofing
Qiaowei Ma
J. Zhong
Yitao Yang
Weiheng Liu
Yingbo Gao
W. W. Ng
AAML
346
5
0
14 Sep 2022
Subband-based Generative Adversarial Network for Non-parallel
  Many-to-many Voice Conversion
Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion
Jianchun Ma
Zhedong Zheng
Hao Fei
Feng Zheng
Tat-Seng Chua
Yi Yang
GAN
132
0
0
13 Jul 2022
Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention
  VAE
Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAEInternational Conference on Artificial Intelligence for Industries (ICAII), 2022
Ziang Long
Yunling Zheng
Meng Yu
Jack Xin
DRL
103
6
0
30 Mar 2022
Noise-robust voice conversion with domain adversarial training
Noise-robust voice conversion with domain adversarial trainingNeural Networks (NN), 2022
Hongqiang Du
Lei Xie
Haizhou Li
117
19
0
26 Jan 2022
Emotion Intensity and its Control for Emotional Voice Conversion
Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
328
73
0
10 Jan 2022
Conditional Deep Hierarchical Variational Autoencoder for Voice
  Conversion
Conditional Deep Hierarchical Variational Autoencoder for Voice ConversionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
K. Akuzawa
Kotaro Onishi
Keisuke Takiguchi
Kohki Mametani
K. Mori
BDLDRL
134
9
0
06 Dec 2021
A Comparison of Discrete and Soft Speech Units for Improved Voice
  Conversion
A Comparison of Discrete and Soft Speech Units for Improved Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Benjamin van Niekerk
M. Carbonneau
Julian Zaïdi
Matthew Baas
Hugo Seuté
Herman Kamper
DRL
316
152
0
03 Nov 2021
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw WaveformInterspeech (Interspeech), 2021
Youxuan Ma
Zongze Ren
Shugong Xu
120
44
0
12 Aug 2021
Beyond Voice Identity Conversion: Manipulating Voice Attributes by
  Adversarial Learning of Structured Disentangled Representations
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations
L. Benaroya
Nicolas Obin
Axel Roebel
141
5
0
26 Jul 2021
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for
  Natural-Sounding Voice Conversion
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice ConversionInterspeech (Interspeech), 2021
Yinghao Aaron Li
A. Zare
N. Mesgarani
201
119
0
21 Jul 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
  Identity in Dysarthric Voice Conversion
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionInterspeech (Interspeech), 2021
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
122
13
0
02 Jun 2021
Emotional Voice Conversion: Theory, Databases and ESD
Emotional Voice Conversion: Theory, Databases and ESDSpeech Communication (Speech Commun.), 2021
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
304
237
0
31 May 2021
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice
  Conversion
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Hirokazu Kameoka
Kou Tanaka
Takuhiro Kaneko
161
23
0
14 Apr 2021
Non-autoregressive sequence-to-sequence voice conversion
Non-autoregressive sequence-to-sequence voice conversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Tomoki Hayashi
Wen-Chin Huang
Kazuhiro Kobayashi
Tomoki Toda
92
26
0
14 Apr 2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech:
  Two-stage Sequence-to-Sequence Training
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence TrainingInterspeech (Interspeech), 2021
Kun Zhou
Berrak Sisman
Haizhou Li
202
34
0
31 Mar 2021
Deepfakes Generation and Detection: State-of-the-art, open challenges,
  countermeasures, and way forward
Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward
Momina Masood
M. Nawaz
K. Malik
A. Javed
Aun Irtaza
AAML
457
392
0
25 Feb 2021
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in
  Frames
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in FramesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
121
70
0
25 Feb 2021
Axial Residual Networks for CycleGAN-based Voice Conversion
Axial Residual Networks for CycleGAN-based Voice Conversion
J. You
Gyuhyeon Nam
Dalhyun Kim
Gyeongsu Chae
152
3
0
16 Feb 2021
Adversarially learning disentangled speech representations for robust
  multi-factor voice conversion
Adversarially learning disentangled speech representations for robust multi-factor voice conversionInterspeech (Interspeech), 2021
Jie Wang
Jingbei Li
Xintao Zhao
Zhiyong Wu
Shiyin Kang
Helen Meng
DRL
260
32
0
30 Jan 2021
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Generative Adversarial Networks in Human Emotion Synthesis:A ReviewIEEE Access (IEEE Access), 2020
Noushin Hajarolasvadi
M. A. Ramírez
H. Demirel
GAN
230
26
0
28 Oct 2020
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and
  Fusing Fine-Grained Voice Fragments With Attention
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yist Y. Lin
C. Chien
Jheng-hao Lin
Hung-yi Lee
Lin-Shan Lee
146
82
0
27 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised
  Discrete Speech Representations
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
Tomoki Toda
BDL
223
45
0
23 Oct 2020
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram
  Conversion
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
141
89
0
22 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020:
  On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural
  Vocoders
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Toda
136
9
0
09 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
226
24
0
06 Oct 2020
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge
  2020: Cascading ASR and TTS
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Wen-Chin Huang
Tomoki Hayashi
Shinji Watanabe
Tomoki Toda
DRL
122
41
0
06 Oct 2020
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence
  Modeling
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence ModelingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Songxiang Liu
Yuewen Cao
Disong Wang
Xixin Wu
Xunying Liu
Helen Meng
BDL
222
114
0
06 Sep 2020
Voice Conversion by Cascading Automatic Speech Recognition and
  Text-to-Speech Synthesis with Prosody Transfer
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer
Jing-Xuan Zhang
Li-Juan Liu
Yan-Nian Chen
Ya-Jun Hu
Yuan Jiang
Zhenhua Ling
Lirong Dai
117
19
0
03 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
230
22
0
27 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
355
387
0
09 Aug 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
309
45
0
07 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
141
6
0
05 Aug 2020
NAUTILUS: a Versatile Voice Cloning System
NAUTILUS: a Versatile Voice Cloning System
Hieu-Thi Luong
Junichi Yamagishi
230
57
0
22 May 2020
Many-to-Many Voice Transformer Network
Many-to-Many Voice Transformer Network
Hirokazu Kameoka
Wen-Chin Huang
Kou Tanaka
Takuhiro Kaneko
Nobukatsu Hojo
Tomoki Toda
ViT
271
31
0
18 May 2020
Mel-spectrogram augmentation for sequence to sequence voice conversion
Mel-spectrogram augmentation for sequence to sequence voice conversion
Yeongtae Hwang
Hyemin Cho
Hongsun Yang
Dong-Ok Won
Insoo Oh
Seong-Whan Lee
173
20
0
06 Jan 2020
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating
  Mechanism for Accelerating Online Computation
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation
Yu-Tao Chang
Yuan-Hong Yang
Yu-Huai Peng
Syu-Siang Wang
T. Chi
Yu Tsao
Hsin-Min Wang
MoE
77
0
0
27 Dec 2019
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using
  Transformer with Text-to-Speech Pretraining
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech PretrainingInterspeech (Interspeech), 2019
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
126
107
0
14 Dec 2019
Voice Conversion for Whispered Speech Synthesis
Voice Conversion for Whispered Speech SynthesisIEEE Signal Processing Letters (SPL), 2019
Marius Cotescu
Thomas Drugman
Goeric Huybrechts
Jaime Lorenzo-Trueba
Alexis Moinet
103
33
0
11 Dec 2019
CycleGAN Voice Conversion of Spectral Envelopes using Adversarial
  Weights
CycleGAN Voice Conversion of Spectral Envelopes using Adversarial Weights
Rafael Ferro
Nicolas Obin
Axel Roebel
81
0
0
22 Oct 2019
Bootstrapping non-parallel voice conversion from speaker-adaptive
  text-to-speech
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speechAutomatic Speech Recognition & Understanding (ASRU), 2019
Hieu-Thi Luong
Junichi Yamagishi
130
18
0
14 Sep 2019
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice
  Conversion
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice ConversionInterspeech (Interspeech), 2019
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
200
163
0
29 Jul 2019
Hierarchical Sequence to Sequence Voice Conversion with Limited Data
Hierarchical Sequence to Sequence Voice Conversion with Limited Data
P. Narayanan
Punarjay Chakravarty
F. Charette
G. Puskorius
96
3
0
15 Jul 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled
  Linguistic and Speaker Representations
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker RepresentationsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
182
106
0
25 Jun 2019
Investigation of F0 conditioning and Fully Convolutional Networks in
  Variational Autoencoder based Voice Conversion
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionInterspeech (Interspeech), 2019
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
DRL
164
13
0
02 May 2019
12
Next