ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.05561
  4. Cited By
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild

16 August 2018
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
    CVBM
ArXivPDFHTML

Papers citing "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"

31 / 31 papers shown
Title
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer
  Learning
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Yanan Wang
Donghuo Zeng
Shinya Wada
Satoshi Kurihara
32
6
0
27 Sep 2023
Teacher-Student Architecture for Knowledge Distillation: A Survey
Teacher-Student Architecture for Knowledge Distillation: A Survey
Chengming Hu
Xuan Li
Danyang Liu
Haolun Wu
Xi Chen
Ju Wang
Xue Liu
21
16
0
08 Aug 2023
Recursive Joint Attention for Audio-Visual Fusion in Regression based
  Emotion Recognition
Recursive Joint Attention for Audio-Visual Fusion in Regression based Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
19
10
0
17 Apr 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
18
8
0
25 Feb 2023
Audio Representation Learning by Distilling Video as Privileged
  Information
Audio Representation Learning by Distilling Video as Privileged Information
Amirhossein Hajavi
Ali Etemad
13
4
0
06 Feb 2023
Vision Transformer with Attentive Pooling for Robust Facial Expression
  Recognition
Vision Transformer with Attentive Pooling for Robust Facial Expression Recognition
Fanglei Xue
Qiangchang Wang
Zichang Tan
Zhongsong Ma
G. Guo
ViT
33
67
0
11 Dec 2022
Teacher-Student Architecture for Knowledge Learning: A Survey
Teacher-Student Architecture for Knowledge Learning: A Survey
Chengming Hu
Xuan Li
Dan Liu
Xi Chen
Ju Wang
Xue Liu
14
35
0
28 Oct 2022
Learning Diversified Feature Representations for Facial Expression
  Recognition in the Wild
Learning Diversified Feature Representations for Facial Expression Recognition in the Wild
Negar Heidari
Alexandros Iosifidis
CVBM
16
3
0
17 Oct 2022
Rethinking the Learning Paradigm for Facial Expression Recognition
Rethinking the Learning Paradigm for Facial Expression Recognition
Weijie Wang
N. Sebe
Bruno Lepri
29
2
0
30 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space
  Using Joint Cross-Attention
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
48
31
0
19 Sep 2022
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised
  Contrastive Loss
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss
Riccardo Franceschini
Enrico Fini
Cigdem Beyan
Alessandro Conti
F. Arrigoni
Elisa Ricci
SSL
OffRL
34
16
0
23 Jul 2022
Deep Multimodal Guidance for Medical Image Classification
Deep Multimodal Guidance for Medical Image Classification
Mayur Mallya
Ghassan Hamarneh
30
13
0
10 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with
  Utterance-Specific Dirichlet Priors
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors
Wen Wu
C. Zhang
Xixin Wu
P. Woodland
40
14
0
08 Mar 2022
Multimodal Emotion Recognition using Transfer Learning from Speaker
  Recognition and BERT-based models
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models
Sarala Padi
S. O. Sadjadi
Dinesh Manocha
Ram D. Sriram
22
36
0
16 Feb 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
19
7
0
02 Feb 2022
Cross Attentional Audio-Visual Fusion for Dimensional Emotion
  Recognition
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
23
40
0
09 Nov 2021
TransFER: Learning Relation-aware Facial Expression Representations with
  Transformers
TransFER: Learning Relation-aware Facial Expression Representations with Transformers
Fanglei Xue
Qiangchang Wang
G. Guo
ViT
31
183
0
25 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and
  Spectrogram Augmentation
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Sarala Padi
S. O. Sadjadi
Dinesh Manocha
Ram D. Sriram
16
34
0
05 Aug 2021
Speech Emotion Recognition using Semantic Information
Speech Emotion Recognition using Semantic Information
Panagiotis Tzirakis
Anh-Tuan Nguyen
S. Zafeiriou
Björn W. Schuller
15
19
0
04 Mar 2021
Disentanglement for audio-visual emotion recognition using multitask
  setup
Disentanglement for audio-visual emotion recognition using multitask setup
Raghuveer Peri
Srinivas Parthasarathy
Charles Bradshaw
Shiva Sundaram
18
11
0
11 Feb 2021
asya: Mindful verbal communication using deep learning
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
18
1
0
20 Aug 2020
Dynamic Emotion Modeling with Learnable Graphs and Graph Inception
  Network
Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network
A. Shirian
S. Tripathi
T. Guha
13
7
0
06 Aug 2020
Knowledge Distillation: A Survey
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
19
2,832
0
09 Jun 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
19
25
0
28 Apr 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
11
88
0
20 Feb 2020
An empirical analysis of information encoded in disentangled neural
  speaker representations
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
19
13
0
10 Feb 2020
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
27
251
0
10 Dec 2019
MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion
  Recognition
MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion Recognition
Didan Deng
Zhaokang Chen
Yuqian Zhou
Bertram Shi
4
45
0
21 Nov 2019
STEP: Spatial Temporal Graph Convolutional Networks for Emotion
  Perception from Gaits
STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
Uttaran Bhattacharya
Trisha Mittal
Rohan Chandra
Tanmay Randhavane
Aniket Bera
Dinesh Manocha
CVBM
20
100
0
28 Oct 2019
Who Do I Sound Like? Showcasing Speaker Recognition Technology by
  YouTube Voice Search
Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search
R. Krishnan
Bilal Soomro
Mahesh Subedar
Ville Hautamaki
Tomi Kinnunen
11
5
0
08 Nov 2018
Learnable PINs: Cross-Modal Embeddings for Person Identity
Learnable PINs: Cross-Modal Embeddings for Person Identity
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
SSL
15
140
0
02 May 2018
1