v1v2 (latest)

P2VA: Converting Persona Descriptions into Voice Attributes for Fair and Controllable Text-to-Speech

21 May 2025

Papers citing "P2VA: Converting Persona Descriptions into Voice Attributes for Fair and Controllable Text-to-Speech"

21 / 21 papers shown

Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

264

22 Mar 2025

VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language ModellingACM Multimedia (MM), 2024

Yixuan Zhou

Xiaoyu Qin

Zeyu Jin

Shuoyi Zhou

Shun Lei

Songtao Zhou

Zhiyong Wu

Jia Jia

AuLLM

292

28 Aug 2024

Scaling Synthetic Data Creation with 1,000,000,000 Personas

573

273

28 Jun 2024

Controlling Emotion in Text-to-Speech with Natural Language Prompts

Thomas Bott

Florian Lux

Ngoc Thang Vu

295

10 Jun 2024

Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization

Yu-Min Tseng

882

188

03 Jun 2024

Evaluating Large Language Model Biases in Persona-Steered Generation

Andy Liu

Mona Diab

Daniel Fried

206

30 May 2024

MBIAS: Mitigating Bias in Large Language Models While Retaining ContextWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2024

Shaina Raza

Ananya Raval

Maximus Powers

432

18 May 2024

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Daniel Lyth

Simon King

308

02 Feb 2024

MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis

Wenhao Guan

Jiayan Lin

286

17 Dec 2023

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Shashank Gupta

Vaishnavi Shrivastava

465

170

08 Nov 2023

Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents

Jinyoung Yeo

236

13 Oct 2023

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language DescriptionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

348

15 Sep 2023

WHAT, WHEN, and HOW to Ground: Designing User Persona-Aware Conversational Agents for Engaging DialogueAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

373

06 Jun 2023

Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zezhong Wang

Ruifeng Xu

234

19 May 2023

Accented Text-to-Speech Synthesis with Limited DataIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Haizhou Li

191

08 May 2023

InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style PromptIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Dongchao Yang

Rongjie Huang

219

139

31 Jan 2023

PromptTTS: Controllable Text-to-Speech with Text DescriptionsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

200

158

22 Nov 2022

"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor DatasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

316

174

18 May 2022

UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022Interspeech (Interspeech), 2022

Hiroshi Saruwatari

318

412

05 Apr 2022

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

...

665

911

12 Jun 2018

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Douwe Kiela

Jason Weston

509

1,613

22 Jan 2018