Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2003.00832
Cited By
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
AAAI Conference on Artificial Intelligence (AAAI), 2020
12 February 2020
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
Re-assign community
ArXiv (abs)
PDF
HTML
Github (78★)
Papers citing
"An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos"
25 / 25 papers shown
A Multimodal Neural Network for Recognizing Subjective Self-Disclosure Towards Social Robots
Henry Powell
Guy Laban
Emily S. Cross
206
0
0
14 Aug 2025
eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos
Xuecheng Wu
Dingkang Yang
Danlei Huang
Xinyi Yin
Yifan Wang
...
Liangyu Fu
Yang Liu
Junxiao Xue
Hadi Amirpour
Wei Zhou
212
1
0
09 Aug 2025
Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
Jun-chen Yu
Yang Zheng
Lei Wang
Yongqi Wang
Shengfan Xu
CVBM
639
0
0
15 Mar 2025
Smile upon the Face but Sadness in the Eyes: Emotion Recognition based on Facial Expressions and Eye Behaviors
Yuanyuan Liu
Lin Wei
Kejun Liu
Yibing Zhan
Zijing Chen
Zhe Chen
Shiguang Shan
347
3
0
08 Nov 2024
VEMOCLAP: A video emotion classification web application
IEEE International Symposium on Multimedia (ISM), 2024
Serkan Sulun
Paula Viana
M. Davies
VLM
297
1
0
22 Oct 2024
Dual-path Collaborative Generation Network for Emotional Video Captioning
ACM Multimedia (MM), 2024
Cheng Ye
Weidong Chen
Jingyu Li
Li Zhang
Zhendong Mao
358
16
0
06 Aug 2024
Benchmarking Micro-action Recognition: Dataset, Methods, and Applications
Dan Guo
Kun Li
Bin Hu
Yan Zhang
Meng Wang
447
116
0
08 Mar 2024
Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics
International Conference on Neural Information Processing (ICONIP), 2024
Pengcheng Zhao
Yanxiang Chen
Yang Zhao
Wei Jia
Zhao Zhang
Ronggang Wang
Richang Hong
DiffM
234
1
0
24 Jan 2024
Affective Video Content Analysis: Decade Review and New Perspectives
Big Data Mining and Analytics (BDMA), 2023
Junxiao Xue
Jie Wang
Xuecheng Wu
Qian Zhang
350
6
0
26 Oct 2023
MACP: Efficient Model Adaptation for Cooperative Perception
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Yunsheng Ma
Juanwu Lu
Can Cui
Sicheng Zhao
Xu Cao
Wenqian Ye
Ziran Wang
322
24
0
25 Oct 2023
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion
Proceedings of the IEEE (Proc. IEEE), 2023
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
404
60
0
25 Jul 2023
CORAE: A Tool for Intuitive and Continuous Retrospective Evaluation of Interactions
Affective Computing and Intelligent Interaction (ACII), 2023
Michael J. Sack
Maria Teresa Parreira
Jenny Xiyu Fu
Asher Lipman
Hifza Javed
Nawid Jamali
Malte Jung
152
4
0
29 Jun 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Lingxi Li
Kyungtae Han
Rohit Gupta
Ziran Wang
196
19
0
13 May 2023
Noise-Resistant Multimodal Transformer for Emotion Recognition
International Journal of Computer Vision (IJCV), 2023
Y. Liu
Haoyu Zhang
Yibing Zhan
Zijing Chen
Guanghao Yin
Lin Wei
Zhe Chen
ViT
219
24
0
04 May 2023
ViT-DD: Multi-Task Vision Transformer for Semi-Supervised Driver Distraction Detection
Yunsheng Ma
Ziran Wang
ViT
414
25
0
19 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
IEEE Transactions on Biometrics Behavior and Identity Science (TBBIS), 2022
R Gnana Praveen
Mohammadhadi Shateri
P. Cardinal
CVBM
198
55
0
19 Sep 2022
Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yi Zhang
Mingyuan Chen
Jundong Shen
Chongjun Wang
246
91
0
15 Jan 2022
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
271
24
0
14 Dec 2021
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2021
R Gnana Praveen
Mohammadhadi Shateri
P. Cardinal
CVBM
244
53
0
09 Nov 2021
Affective Image Content Analysis: Two Decades Review and New Perspectives
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sicheng Zhao
Xingxu Yao
Jufeng Yang
G. Jia
Guiguang Ding
Tat-Seng Chua
Björn W. Schuller
Kurt Keutzer
3DV
210
156
0
30 Jun 2021
Computational Emotion Analysis From Images: Recent Advances and Future Directions
Sicheng Zhao
Quanwei Huang
Youbao Tang
Xingxu Yao
Jufeng Yang
Guiguang Ding
Björn W. Schuller
230
21
0
19 Mar 2021
Privacy-Preserving Video Classification with Convolutional Neural Networks
IACR Cryptology ePrint Archive (IACR ePrint), 2021
Sikha Pentyala
Rafael Dowsley
Martine De Cock
PICV
292
25
0
06 Feb 2021
Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources
The Web Conference (WWW), 2020
Sicheng Zhao
Yang Xiao
Jiang Guo
Xiangyu Yue
Jufeng Yang
Ravi Krishna
Pengfei Xu
Kurt Keutzer
375
19
0
17 Nov 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2020
Amirhossein Hajavi
Ali Etemad
235
2
0
03 Sep 2020
Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space
Sicheng Zhao
Yaxian Li
Xingxu Yao
Weizhi Nie
Pengfei Xu
Jufeng Yang
Kurt Keutzer
275
37
0
22 Aug 2020
1
Page 1 of 1