Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.03179
Cited By
v1
v2
v3 (latest)
UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
4 April 2024
Tiantian Geng
Teng Wang
Jinming Duan
Yanfu Zhang
Weili Guan
Feng Zheng
Ling Shao
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"UniAV: Unified Audio-Visual Perception for Multi-Task Video Event Localization"
5 / 5 papers shown
R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios
Lu Zhu
Tiantian Geng
Yangye Chen
Teng Wang
Ping Lu
Feng Zheng
AI4TS
261
0
0
21 Nov 2025
ESG-Net: Event-Aware Semantic Guided Network for Dense Audio-Visual Event Localization
Huilai Li
Yonghao Dang
Ying Xing
Yiming Wang
Jianqin Yin
183
0
0
14 Jul 2025
PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling
X. Yu
Yan Fang
Xiaojie Jin
Yao Zhao
Yunchao Wei
285
1
0
29 May 2025
Self-supervised Transformation Learning for Equivariant Representations
Neural Information Processing Systems (NeurIPS), 2025
Jaemyung Yu
Jaehyun Choi
Dong-Jae Lee
H. Hong
Junmo Kim
283
0
0
15 Jan 2025
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
568
8
0
12 Sep 2024
1