Learning to Highlight Audio by Watching MoviesComputer Vision and Pattern Recognition (CVPR), 2025 |
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video ParsingComputer Vision and Pattern Recognition (CVPR), 2025 |
Aligned Better, Listen Better for Audio-Visual Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025 |