Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.10693
Cited By
DAVE: A Deep Audio-Visual Embedding for Dynamic Saliency Prediction
25 May 2019
Hamed R. Tavakoli
Ali Borji
Esa Rahtu
Juho Kannala
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DAVE: A Deep Audio-Visual Embedding for Dynamic Saliency Prediction"
8 / 8 papers shown
Title
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction
Kiana Hoshanfar
Alireza Hosseini
Ahmad Kalhor
Babak Nadjar Araabi
124
0
0
14 Apr 2025
How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and Model
Yuxin Zhu
Huiyu Duan
Kaiwei Zhang
Yucheng Zhu
Xilei Zhu
Long Teng
Xiongkuo Min
Guangtao Zhai
72
2
0
10 Aug 2024
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
21
9
0
25 Oct 2023
NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos
Ziyuan Yang
Sucheng Ren
Zongwei Wu
Nanxuan Zhao
Junle Wang
Jing Qin
Shengfeng He
30
2
0
23 Aug 2023
ViDaS Video Depth-aware Saliency Network
Ioanna Di̇amanti̇
A. Tsiami
Petros Koutras
Petros Maragos
MDE
29
0
0
19 May 2023
Audio-Visual Collaborative Representation Learning for Dynamic Saliency Prediction
Hailong Ning
Bin Zhao
Zhanxuan Hu
Lang He
Ercheng Pei
27
10
0
17 Sep 2021
ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency Prediction
Samyak Jain
P. Yarlagadda
Shreyank Jyoti
Shyamgopal Karthik
Subramanian Ramanathan
Vineet Gandhi
ViT
29
65
0
11 Dec 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
21
73
0
09 Jan 2020
1