Title |
---|
![]() SSDM: Scalable Speech Dysfluency Modeling Jiachen Lian Xuanru Zhou Z. Ezzes Jet M J Vonk Brittany Morin D. Baquirin Zachary Mille M. G. Tempini Gopala Anumanchipalli |
![]() WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Shengpeng Ji Ziyue Jiang Xize Cheng Yifu Chen Minghui Fang ...Rongjie Huang Yidi Jiang Qian Chen Zhou Zhao Zhou Zhao |
![]() NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech
Processing Tasks He Huang Taejin Park Kunal Dhawan Ivan Medennikov Krishna C. Puvvada Nithin Rao Koluguri Weiqing Wang Jagadeesh Balam Boris Ginsburg |
![]() BUT Systems and Analyses for the ASVspoof 5 Challenge Johan Rohdin Lin Zhang Oldřich Plchot Vojtěch Staněk David Mihola ...Themos Stafylakis Dmitriy Beveraki Anna Silnova Jan Brukner Lukáš Burget |
![]() MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and
Disentangled Multi-Modality Fusion Chencan Fu Yabiao Wang Jiangning Zhang Zhengkai Jiang Xiaofeng Mao Jiafu Wu Weijian Cao Chengjie Wang Yanhao Ge Yong Liu |