Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.21619
Cited By
v1
v2 (latest)
IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech
23 June 2025
Siyi Zhou
Yiquan Zhou
Yi He
Xun Zhou
Jinchao Wang
Wei Deng
Jingchen Shu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech"
6 / 6 papers shown
UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models
Wenming Tu
Guanrou Yang
Ruiqi Yan
Wenxi Chen
Ziyang Ma
Yipeng Kang
Kai Yu
Xie Chen
Zilong Zheng
156
0
0
26 Oct 2025
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Yixuan Zhou
Guoyang Zeng
Xin Liu
Xiang Li
Renjie Yu
...
Weiyue Sun
Jiancheng Gui
Kehan Li
Z. Wu
Zhiyuan Liu
136
3
0
29 Sep 2025
Evaluating Bias in Spoken Dialogue LLMs for Real-World Decisions and Recommendations
Y. Wu
Tianrui Wang
Yizhou Peng
Yi-Wen Chao
Xuyi Zhuang
Xinsheng Wang
Shunshun Yin
Ziyang Ma
158
0
0
27 Sep 2025
Bridging the gap between training and inference in LM-based TTS models
Ruonan Zhang
Lingzhou Mu
Xixin Wu
Kai Zhang
145
0
0
21 Sep 2025
Vevo2: A Unified and Controllable Framework for Speech and Singing Voice Generation
Xueyao Zhang
Junan Zhang
Yuancheng Wang
Chaoren Wang
Yuanzhe Chen
Dongya Jia
Zhuo Chen
Zhizheng Wu
DiffM
240
6
0
22 Aug 2025
Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis
Yifan Yang
Shixuan Liu
Jiajian Li
Yuxuan Hu
Haibin Wu
...
Haiyang Sun
Yanqing Liu
Yan Lu
Kai Yu
Xie Chen
357
6
0
14 Apr 2025
1