Title |
---|
![]() SALM: Speech-augmented Language Model with In-context Learning for
Speech Recognition and Translation Zhehuai Chen He Huang A. Andrusenko Oleksii Hrinchuk Krishna C. Puvvada Jason Chun Lok Li Subhankar Ghosh Jagadeesh Balam Boris Ginsburg |
![]() Low-latency Speech Enhancement via Speech Token Generation Huaying Xue Xiulian Peng Yan Lu |
![]() LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Zhihao Du Jiaming Wang Qian Chen Yunfei Chu Zhifu Gao ...Wen Wang Siqi Zheng Chang Zhou Zhijie Yan Shiliang Zhang |
![]() UniAudio: An Audio Foundation Model Toward Universal Audio Generation Dongchao Yang Jinchuan Tian Xuejiao Tan Rongjie Huang Songxiang Liu ...Jiang Bian Xixin Wu Zhou Zhao Shinji Watanabe Helen M. Meng |
![]() Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with
Multi-Scale Acoustic Prompts Shunwei Lei Yixuan Zhou Liyang Chen Dan Luo Zhiyong Wu ...Shiyin Kang Tao Jiang Yahui Zhou Yuxing Han Helen M. Meng |