Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.00976
Cited By
Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer
3 June 2024
Yongxin Zhu
Jane Polak Scowcroft
Liqiang He
Linli Xu
Dong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer"
8 / 8 papers shown
Title
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Issa Sugiura
Shuhei Kurita
Yusuke Oda
Ryuichiro Higashinaka
AuLLM
50
0
0
18 Sep 2025
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
Yiqun Yao
Xiang Li
Xin Jiang
Xuezhi Fang
N. Yu
Wenjia Ma
Aixin Sun
Yequan Wang
AuLLM
86
1
0
02 Sep 2025
Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling
Simon Lepage
Jérémie Mary
David Picard
36
1
0
12 Aug 2025
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
Puyuan Peng
Shang-Wen Li
Abdelrahman Mohamed
David Harwath
135
1
0
26 May 2025
Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space
Zhengrui Ma
Yang Feng
Chenze Shao
Fandong Meng
Jie Zhou
Min Zhang
117
3
0
19 May 2025
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Yemin Shi
Yu Shu
Siwei Dong
Guangyi Liu
Jaward Sesay
Jingwen Li
Zhiting Hu
AuLLM
VLM
184
2
0
05 May 2025
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Yongxin Zhu
Bing Li
Yifei Xin
Zhihua Xia
Linli Xu
250
31
0
04 Nov 2024
Recent Advances in Speech Language Models: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
341
56
0
01 Oct 2024
1