ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.09165
  4. Cited By
A Pre-trained Audio-Visual Transformer for Emotion Recognition

A Pre-trained Audio-Visual Transformer for Emotion Recognition

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
23 January 2022
Minh Tran
M. Soleymani
ArXiv (abs)PDFHTML

Papers citing "A Pre-trained Audio-Visual Transformer for Emotion Recognition"

9 / 9 papers shown
eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos
eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos
Xuecheng Wu
Dingkang Yang
Danlei Huang
Xinyi Yin
Yifan Wang
...
Liangyu Fu
Yang Liu
Junxiao Xue
Hadi Amirpour
Wei Zhou
212
1
0
09 Aug 2025
MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion
  Recognition
MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion Recognition
Peihao Xiang
Chaohao Lin
Kaida Wu
Ou Bai
251
9
0
28 Apr 2024
Recursive Joint Cross-Modal Attention for Multimodal Fusion in
  Dimensional Emotion Recognition
Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition
R Gnana Praveen
Jahangir Alam
445
41
0
20 Mar 2024
Joint Multimodal Transformer for Emotion Recognition in the Wild
Joint Multimodal Transformer for Emotion Recognition in the Wild
Paul Waligora
Haseeb Aslam
Osama Zeeshan
Soufiane Belharbi
A. L. Koerich
M. Pedersoli
Simon L Bacon
Eric Granger
352
27
0
15 Mar 2024
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised
  Audio-Visual Emotion Recognition
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion RecognitionInformation Fusion (Inf. Fusion), 2024
Guoying Zhao
Zheng Lian
Yinan Han
Jianhua Tao
333
77
0
11 Jan 2024
SVFAP: Self-supervised Video Facial Affect Perceiver
SVFAP: Self-supervised Video Facial Affect PerceiverIEEE Transactions on Affective Computing (TAC), 2023
Guoying Zhao
Zheng Lian
Kexin Wang
Yu He
Ming Xu
Haiyang Sun
Yinan Han
Jianhua Tao
207
30
0
31 Dec 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic
  Facial Expression Recognition
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression RecognitionACM Multimedia (ACM MM), 2023
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
264
85
0
05 Jul 2023
A vector quantized masked autoencoder for audiovisual speech emotion recognition
A vector quantized masked autoencoder for audiovisual speech emotion recognitionComputer Vision and Image Understanding (CVIU), 2023
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
632
15
0
05 May 2023
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
603
394
0
25 Oct 2019
1
Page 1 of 1