Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.01479
Cited By
OpenVoice: Versatile Instant Voice Cloning
3 December 2023
Zengyi Qin
Wenliang Zhao
Xumin Yu
Xin Sun
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenVoice: Versatile Instant Voice Cloning"
16 / 16 papers shown
Title
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
Jianping Jiang
Weiye Xiao
Zhengyu Lin
H. Zhang
Tianxiang Ren
Yang Gao
Zhiqian Lin
Zhongang Cai
Lei Yang
Ziwei Liu
81
3
0
29 Nov 2024
Zero-shot Voice Conversion with Diffusion Transformers
Songting Liu
37
2
0
15 Nov 2024
Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model
Ben Koska
Mojmír Horváth
MoE
27
1
0
08 Nov 2024
DMTG: A Human-Like Mouse Trajectory Generation Bot Based on Entropy-Controlled Diffusion Networks
Jiahua Liu
Zeyuan Cui
Wenhan Ge
Pengxiang Zhan
AAML
15
0
0
23 Oct 2024
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample
Zhiyong Chen
Zhiqi Ai
Xinnuo Li
Shugong Xu
33
0
0
24 Sep 2024
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis
Zhiyong Chen
Xinnuo Li
Zhiqi Ai
Shugong Xu
DiffM
34
1
0
24 Sep 2024
DiffSSD: A Diffusion-Based Dataset For Speech Forensics
Kratika Bhagtani
Amit Kumar Singh Yadav
Paolo Bestagini
Edward J. Delp
DiffM
18
1
0
19 Sep 2024
SpMis: An Investigation of Synthetic Spoken Misinformation Detection
Peizhuo Liu
Li Wang
Renqiang He
Haorui He
Lei Wang
Huadi Zheng
Jie Shi
Tong Xiao
Zhizheng Wu
27
1
0
17 Sep 2024
VoiceWukong: Benchmarking Deepfake Voice Detection
Ziwei Yan
Yanjie Zhao
Haoyu Wang
27
1
0
10 Sep 2024
Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems
Daniel Platnick
Bishoy Abdelnour
Eamon Earl
Rahul Kumar
Zahra Rezaei
Thomas Tsangaris
Faraj Lagum
23
0
0
18 Jul 2024
TTSDS -- Text-to-Speech Distribution Score
Christoph Minixhofer
Ondˇrej Klejch
Peter Bell
26
0
0
17 Jul 2024
SecureSpectra: Safeguarding Digital Identity from Deep Fake Threats via Intelligent Signatures
Oguzhan Baser
Kaan Kale
Sandeep P. Chinchali
19
0
0
01 Jul 2024
The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective
Andrew Shin
Yusuke Mori
Kunitake Kaneko
VGen
EGVM
16
2
0
13 May 2024
Certification of Speaker Recognition Models to Additive Perturbations
Dmitrii Korzh
Elvir Karimov
Mikhail Aleksandrovich Pautov
Oleg Y. Rogov
Ivan V. Oseledets
36
1
0
29 Apr 2024
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Jingyi Li
Weiping Tu
Li Xiao
41
96
0
27 Oct 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
171
377
0
04 Dec 2021
1