ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.00976
  4. Cited By
Generative Pre-trained Speech Language Model with Efficient Hierarchical
  Transformer

Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer

3 June 2024
Yongxin Zhu
Jane Polak Scowcroft
Liqiang He
Linli Xu
Dong Yu
ArXiv (abs)PDFHTML

Papers citing "Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer"

8 / 8 papers shown
Title
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Issa Sugiura
Shuhei Kurita
Yusuke Oda
Ryuichiro Higashinaka
AuLLM
50
0
0
18 Sep 2025
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
Yiqun Yao
Xiang Li
Xin Jiang
Xuezhi Fang
N. Yu
Wenjia Ma
Aixin Sun
Yequan Wang
AuLLM
86
1
0
02 Sep 2025
Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling
Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling
Simon Lepage
Jérémie Mary
David Picard
36
1
0
12 Aug 2025
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
Puyuan Peng
Shang-Wen Li
Abdelrahman Mohamed
David Harwath
135
1
0
26 May 2025
Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space
Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space
Zhengrui Ma
Yang Feng
Chenze Shao
Fandong Meng
Jie Zhou
Min Zhang
117
3
0
19 May 2025
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Yemin Shi
Yu Shu
Siwei Dong
Guangyi Liu
Jaward Sesay
Jingwen Li
Zhiting Hu
AuLLMVLM
184
2
0
05 May 2025
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Yongxin Zhu
Bing Li
Yifei Xin
Zhihua Xia
Linli Xu
250
31
0
04 Nov 2024
Recent Advances in Speech Language Models: A Survey
Recent Advances in Speech Language Models: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
341
56
0
01 Oct 2024
1