Title |
---|
![]() Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for
Large-Scale Speech Generation Haorui He Zengqiang Shang Chaoren Wang Xuyuan Li Yicheng Gu ...Peiyang Shi Yuancheng Wang Kai Chen Pengyuan Zhang Zhizheng Wu |
![]() Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based
Speech Recognition Ye Bai Jingping Chen Jitong Chen Wei Chen Zhuo Chen ...Wanyi Zhang Yang Zhang Yawei Zhang Yijie Zheng Ming Zou |
![]() Towards Robust Speech Representation Learning for Thousands of Languages William Chen Wangyou Zhang Yifan Peng Xinjian Li Jinchuan Tian Jiatong Shi Xuankai Chang Soumi Maiti Karen Livescu Shinji Watanabe |
![]() AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling Jun Zhan Junqi Dai Jiasheng Ye Yunhua Zhou Dong Zhang ...Jie Fu Tao Gui Tianxiang Sun Yugang Jiang Xipeng Qiu |