Title |
---|
![]() Investigating Neural Audio Codecs for Speech Language Model-Based Speech
Generation Jiaqi Li Dongmei Wang Xiaofei Wang Yao Qian Long Zhou ...Junkun Chen Sheng Zhao Jinyu Li Zhizheng Wu Michael Zeng |
![]() WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Shengpeng Ji Ziyue Jiang Xize Cheng Yifu Chen Minghui Fang ...Rongjie Huang Yidi Jiang Qian Chen Zhou Zhao Zhou Zhao |
![]() Does Current Deepfake Audio Detection Model Effectively Detect ALM-based
Deepfake Audio? Yuankun Xie Chenxu Xiong Xiaopeng Wang Zhiyong Wang Yi Lu ...Yukun Liu Zhengqi Wen Jianhua Tao Guanjun Li Long Ye |
![]() E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Sefik Emre Eskimez Xiaofei Wang Manthan Thakker Canrun Li Chung-Hsien Tsai ...Min Tang Xu Tan Yanqing Liu Sheng Zhao Naoyuki Kanda |