ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.03103
25
0

MVP: Multimodal Emotion Recognition based on Video and Physiological Signals

6 January 2025
Valeriya Strizhkova
Hadi Kachmar
Hava Chaptoukaev
Raphael Kalandadze
Natia Kukhilava
Tatia Tsmindashvili
Nibras Abo-Alzahab
Maria A. Zuluaga
Michal Balazia
A. Dantcheva
François Brémond
Laura M. Ferrari
ArXivPDFHTML
Abstract

Human emotions entail a complex set of behavioral, physiological and cognitive changes. Current state-of-the-art models fuse the behavioral and physiological components using classic machine learning, rather than recent deep learning techniques. We propose to fill this gap, designing the Multimodal for Video and Physio (MVP) architecture, streamlined to fuse video and physiological signals. Differently then others approaches, MVP exploits the benefits of attention to enable the use of long input sequences (1-2 minutes). We have studied video and physiological backbones for inputting long sequences and evaluated our method with respect to the state-of-the-art. Our results show that MVP outperforms former methods for emotion recognition based on facial videos, EDA, and ECG/PPG.

View on arXiv
Comments on this paper