Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.17093
Cited By
v1
v2 (latest)
P2VA: Converting Persona Descriptions into Voice Attributes for Fair and Controllable Text-to-Speech
21 May 2025
Yejin Lee
Jaehoon Kang
Kyuhong Shim
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"P2VA: Converting Persona Descriptions into Voice Attributes for Fair and Controllable Text-to-Speech"
21 / 21 papers shown
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ke Ji
Yixin Lian
Linxu Li
Jingsheng Gao
Weiyuan Li
Bin Dai
264
14
0
22 Mar 2025
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
ACM Multimedia (MM), 2024
Yixuan Zhou
Xiaoyu Qin
Zeyu Jin
Shuoyi Zhou
Shun Lei
Songtao Zhou
Zhiyong Wu
Jia Jia
AuLLM
292
22
0
28 Aug 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Tao Ge
Xin Chan
Dian Yu
Haitao Mi
Dong Yu
Dong Yu
SyDa
573
273
0
28 Jun 2024
Controlling Emotion in Text-to-Speech with Natural Language Prompts
Thomas Bott
Florian Lux
Ngoc Thang Vu
295
13
0
10 Jun 2024
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
Yu-Min Tseng
Yu-Chao Huang
Teng-Yun Hsiao
Yu-Ching Hsu
Chao-Wei Huang
Jia-Yin Foo
Yun-Nung Chen
LLMAG
882
188
0
03 Jun 2024
Evaluating Large Language Model Biases in Persona-Steered Generation
Andy Liu
Mona Diab
Daniel Fried
206
65
0
30 May 2024
MBIAS: Mitigating Bias in Large Language Models While Retaining Context
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2024
Shaina Raza
Ananya Raval
Maximus Powers
432
21
0
18 May 2024
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Daniel Lyth
Simon King
308
93
0
02 Feb 2024
MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis
Wenhao Guan
Yishuang Li
Tao Li
Hukai Huang
Feng Wang
Jiayan Lin
Lingyan Huang
Lin Li
Q. Hong
286
24
0
17 Dec 2023
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta
Vaishnavi Shrivastava
Ameet Deshpande
Ashwin Kalyan
Peter Clark
Ashish Sabharwal
Tushar Khot
465
170
0
08 Nov 2023
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents
Hyungjoo Chae
Yongho Song
Kai Tzu-iunn Ong
Taeyoon Kwon
Minjin Kim
Youngjae Yu
Dongha Lee
Luan Tuyen Chau
Jinyoung Yeo
LRM
236
56
0
13 Oct 2023
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Reo Shimizu
Ryuichi Yamamoto
Masaya Kawamura
Yuma Shirahata
Hironori Doi
Tatsuya Komatsu
Kentaro Tachibana
DiffM
348
43
0
15 Sep 2023
WHAT, WHEN, and HOW to Ground: Designing User Persona-Aware Conversational Agents for Engaging Dialogue
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
D. Kwon
Sunwoo Lee
Ki Hyun Kim
Seojin Lee
Tae-Yoon Kim
Eric Davis
373
13
0
06 Jun 2023
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongru Wang
Rui Wang
Fei Mi
Yang Deng
Zezhong Wang
Bin Liang
Ruifeng Xu
Kam-Fai Wong
LRM
234
83
0
19 May 2023
Accented Text-to-Speech Synthesis with Limited Data
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Xuehao Zhou
Mingyang Zhang
Yi Zhou
Zhizheng Wu
Haizhou Li
191
21
0
08 May 2023
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Dongchao Yang
Songxiang Liu
Rongjie Huang
Chao Weng
Helen Meng
DiffM
VLM
219
139
0
31 Jan 2023
PromptTTS: Controllable Text-to-Speech with Text Descriptions
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhifang Guo
Yichong Leng
Yihan Wu
Sheng Zhao
Xuejiao Tan
DiffM
200
158
0
22 Nov 2022
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Eric Michael Smith
Melissa Hall
Melanie Kambadur
Eleonora Presani
Adina Williams
316
174
0
18 May 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
Interspeech (Interspeech), 2022
Takaaki Saeki
Detai Xin
Wataru Nakata
Tomoki Koriyama
Shinnosuke Takamichi
Hiroshi Saruwatari
318
412
0
05 Apr 2022
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Zhiwen Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
665
911
0
12 Jun 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang
Emily Dinan
Jack Urbanek
Arthur Szlam
Douwe Kiela
Jason Weston
509
1,613
0
22 Jan 2018
1