EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart GlassesIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025 |
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound GenerationInternational Conference on Learning Representations (ICLR), 2024 |
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley SoundIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024 |