Title |
---|
![]() Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec
models Haibin Wu Xuanjun Chen Yi-Cheng Lin Kaiwei Chang Jiawei Du ...Yi-Chiao Wu Xu Tan James Glass Shinji Watanabe Hung-yi Lee |
![]() WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Shengpeng Ji Ziyue Jiang Xize Cheng Yifu Chen Minghui Fang ...Rongjie Huang Yidi Jiang Qian Chen Zhou Zhao Zhou Zhao |
![]() FunAudioLLM: Voice Understanding and Generation Foundation Models for
Natural Interaction Between Humans and LLMs Keyu An Qian Chen Chong Deng Zhihao Du Changfeng Gao ...Bin Zhang Qinglin Zhang Shiliang Zhang Nan Zhao Siqi Zheng |
![]() ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and
Zero-shot Language Style Control With Decoupled Codec Shengpeng Ji Jia-li Zuo Minghui Fang Siqi Zheng Qian Chen ...Ziyue Jiang Hai Huang Xize Cheng Rongjie Huang Zhou Zhao |