Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2406.00976
Cited By
Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer
3 June 2024
Yongxin Zhu
Jane Polak Scowcroft
Liqiang He
Linli Xu
Dong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer"
4 / 4 papers shown
Title
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
Puyuan Peng
Shang-Wen Li
Abdelrahman Mohamed
David Harwath
87
0
0
26 May 2025
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Yemin Shi
Yu Shu
Siwei Dong
Guangyi Liu
Jaward Sesay
Jingwen Li
Zhiting Hu
AuLLM
VLM
124
1
0
05 May 2025
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Yongxin Zhu
Bing Li
Yifei Xin
Zhihua Xia
Linli Xu
166
23
0
04 Nov 2024
Recent Advances in Speech Language Models: A Survey
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
285
44
0
01 Oct 2024
1