Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.05242
Cited By
Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization
11 October 2022
Yuanyuan Jiang
Jianqin Yin
Yonghao Dang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization"
3 / 3 papers shown
Title
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering
Yuanyuan Jiang
Jianqin Yin
38
1
0
13 May 2024
Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Shijian Deng
Erin E. Kosloski
Siddhi Patel
Zeke A. Barnett
Yiyang Nan
...
William T. Doan
Matthew Wang
Harsh Singh
P. Rollins
Yapeng Tian
26
4
0
22 Mar 2024
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,170
0
01 Sep 2014
1