
Title |
|---|
![]() 3DiFACE: Synthesizing and Editing Holistic 3D Facial AnimationInternational Conference on 3D Vision (3DV), 2025 |
![]() Text-Queried Audio Source Separation via Hierarchical ModelingIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025 |
![]() Audio Texture Manipulation by Exemplar-Based AnalogyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 |
![]() Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 |
![]() COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() FlowSep: Language-Queried Sound Separation with Rectified Flow MatchingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() Self-Supervised Audio-Visual Soundscape StylizationEuropean Conference on Computer Vision (ECCV), 2024 |
![]() AudioEditor: A Training-Free Diffusion-Based Audio Editing FrameworkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image
Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024 |
![]() Listen, Chat, and Remix: Text-Guided Soundscape Remixing for Enhanced Auditory ExperienceIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2024 |
![]() UniAudio: An Audio Foundation Model Toward Universal Audio Generation Dongchao Yang Jinchuan Tian Xuejiao Tan Rongjie Huang Songxiang Liu ...Jiang Bian Xixin Wu Zhou Zhao Shinji Watanabe Helen M. Meng |
![]() Vision-Infused Deep Audio InpaintingIEEE International Conference on Computer Vision (ICCV), 2019 |