Title |
---|
![]() MIO: A Foundation Model on Multimodal Tokens Zekun Wang King Zhu Chunpu Xu Wangchunshu Zhou Jiaheng Liu ...Yuanxing Zhang Ge Zhang Ke Xu Jie Fu Wenhao Huang |
![]() Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec
models Haibin Wu Xuanjun Chen Yi-Cheng Lin Kaiwei Chang Jiawei Du ...Yi-Chiao Wu Xu Tan James Glass Shinji Watanabe Hung-yi Lee |
![]() SpoofCeleb: Speech Deepfake Detection and SASV In The Wild Jee-weon Jung Yihan Wu Xin Wang Ji-Hoon Kim Soumi Maiti ...Joon Son Chung Wangyou Zhang Seyun Um Shinnosuke Takamichi Shinji Watanabe |
![]() Text-To-Speech Synthesis In The Wild Jee-weon Jung Wangyou Zhang Soumi Maiti Yihan Wu Xin Wang ...Hye-jin Shim Nicholas W. D. Evans Joon Son Chung Shinnosuke Takamichi Shinji Watanabe |
![]() Investigating Neural Audio Codecs for Speech Language Model-Based Speech
Generation Jiaqi Li Dongmei Wang Xiaofei Wang Yao Qian Long Zhou ...Junkun Chen Sheng Zhao Jinyu Li Zhizheng Wu Michael Zeng |