HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised
Audio-Visual Emotion RecognitionInformation Fusion (Inf. Fusion), 2024 |
Exploring Emotion Expression Recognition in Older Adults Interacting
with a Virtual CoachIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023 |
3M-TRANSFORMER: A Multi-Stage Multi-Stream Multimodal Transformer for
Embodied Turn-Taking PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
A Survey on Image-text Multimodal Models Ruifeng Guo Jingxuan Wei Linzhuang Sun Khai-Nguyen Nguyen Guiyong Chang Dawei Liu Sibo Zhang Zhengbing Yao Mingjun Xu Liping Bu |
Efficient Multimodal Transformer with Dual-Level Feature Restoration for
Robust Multimodal Sentiment AnalysisIEEE Transactions on Affective Computing (IEEE TAC), 2022 |