Title |
---|
![]() Joint Speaker Features Learning for Audio-visual Multichannel Speech
Separation and Recognition Guinan Li Jiajun Deng Youjun Chen Mengzhe Geng Shujie Hu ...Zengrui Jin Tianzi Wang Xurong Xie Helen Meng Xunying Liu |
![]() On the Evaluation of Speech Foundation Models for Spoken Language
Understanding Siddhant Arora Ankita Pasad Chung-Ming Chien Jionghao Han Roshan S. Sharma ...William Chen Suwon Shon Hung-yi Lee Karen Livescu Shinji Watanabe |
![]() Towards Effective and Efficient Non-autoregressive Decoding Using
Block-based Attention Mask Tianzi Wang Xurong Xie Zhaoqing Li Shoukang Hu Zengrui Jin ...Shujie Hu Mengzhe Geng Guinan Li Helen Meng Xunying Liu |
![]() VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via
Monotonic Alignment Bing Han Long Zhou Shujie Liu Sanyuan Chen Lingwei Meng Yanming Qian Yanqing Liu Sheng Zhao Jinyu Li Furu Wei |
![]() The Interspeech 2024 Challenge on Speech Processing Using Discrete Units Xuankai Chang Jiatong Shi Jinchuan Tian Yuning Wu Yuxun Tang Yihan Wu Shinji Watanabe Yossi Adi Xie Chen Qin Jin |