
Title |
|---|
![]() SSDM: Scalable Speech Dysfluency ModelingNeural Information Processing Systems (NeurIPS), 2024 Jiachen Lian Xuanru Zhou Z. Ezzes Jet M J Vonk Brittany Morin D. Baquirin Zachary Mille M. G. Tempini Gopala Anumanchipalli |
![]() WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language ModelingInternational Conference on Learning Representations (ICLR), 2024 Shengpeng Ji Ziyue Jiang Xize Cheng Yifu Chen Minghui Fang ...Rongjie Huang Yidi Jiang Qian Chen Zhou Zhao Zhou Zhao |
![]() Articulatory Encodec: Coding Speech through Vocal Tract KinematicsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2024 |
![]() UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot
Audio Task LearnerNeural Information Processing Systems (NeurIPS), 2024 |
![]() Neural Codec-based Adversarial Sample Detection for Speaker VerificationInterspeech (Interspeech), 2024 |