Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.05561
Cited By
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
16 August 2018
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"
31 / 31 papers shown
Title
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Yanan Wang
Donghuo Zeng
Shinya Wada
Satoshi Kurihara
32
6
0
27 Sep 2023
Teacher-Student Architecture for Knowledge Distillation: A Survey
Chengming Hu
Xuan Li
Danyang Liu
Haolun Wu
Xi Chen
Ju Wang
Xue Liu
21
16
0
08 Aug 2023
Recursive Joint Attention for Audio-Visual Fusion in Regression based Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
19
10
0
17 Apr 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
18
8
0
25 Feb 2023
Audio Representation Learning by Distilling Video as Privileged Information
Amirhossein Hajavi
Ali Etemad
13
4
0
06 Feb 2023
Vision Transformer with Attentive Pooling for Robust Facial Expression Recognition
Fanglei Xue
Qiangchang Wang
Zichang Tan
Zhongsong Ma
G. Guo
ViT
33
67
0
11 Dec 2022
Teacher-Student Architecture for Knowledge Learning: A Survey
Chengming Hu
Xuan Li
Dan Liu
Xi Chen
Ju Wang
Xue Liu
14
35
0
28 Oct 2022
Learning Diversified Feature Representations for Facial Expression Recognition in the Wild
Negar Heidari
Alexandros Iosifidis
CVBM
16
3
0
17 Oct 2022
Rethinking the Learning Paradigm for Facial Expression Recognition
Weijie Wang
N. Sebe
Bruno Lepri
29
2
0
30 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
48
31
0
19 Sep 2022
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss
Riccardo Franceschini
Enrico Fini
Cigdem Beyan
Alessandro Conti
F. Arrigoni
Elisa Ricci
SSL
OffRL
34
16
0
23 Jul 2022
Deep Multimodal Guidance for Medical Image Classification
Mayur Mallya
Ghassan Hamarneh
30
13
0
10 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors
Wen Wu
C. Zhang
Xixin Wu
P. Woodland
40
14
0
08 Mar 2022
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models
Sarala Padi
S. O. Sadjadi
Dinesh Manocha
Ram D. Sriram
22
36
0
16 Feb 2022
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
19
7
0
02 Feb 2022
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
23
40
0
09 Nov 2021
TransFER: Learning Relation-aware Facial Expression Representations with Transformers
Fanglei Xue
Qiangchang Wang
G. Guo
ViT
31
183
0
25 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Sarala Padi
S. O. Sadjadi
Dinesh Manocha
Ram D. Sriram
16
34
0
05 Aug 2021
Speech Emotion Recognition using Semantic Information
Panagiotis Tzirakis
Anh-Tuan Nguyen
S. Zafeiriou
Björn W. Schuller
15
19
0
04 Mar 2021
Disentanglement for audio-visual emotion recognition using multitask setup
Raghuveer Peri
Srinivas Parthasarathy
Charles Bradshaw
Shiva Sundaram
18
11
0
11 Feb 2021
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
18
1
0
20 Aug 2020
Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network
A. Shirian
S. Tripathi
T. Guha
13
7
0
06 Aug 2020
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
19
2,832
0
09 Jun 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
19
25
0
28 Apr 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
11
88
0
20 Feb 2020
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
19
13
0
10 Feb 2020
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
27
251
0
10 Dec 2019
MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion Recognition
Didan Deng
Zhaokang Chen
Yuqian Zhou
Bertram Shi
4
45
0
21 Nov 2019
STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
Uttaran Bhattacharya
Trisha Mittal
Rohan Chandra
Tanmay Randhavane
Aniket Bera
Dinesh Manocha
CVBM
20
100
0
28 Oct 2019
Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search
R. Krishnan
Bilal Soomro
Mahesh Subedar
Ville Hautamaki
Tomi Kinnunen
11
5
0
08 Nov 2018
Learnable PINs: Cross-Modal Embeddings for Person Identity
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
SSL
15
140
0
02 May 2018
1