ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.08708
  4. Cited By
Variational Autoencoders for Learning Latent Representations of Speech
  Emotion: A Preliminary Study
v1v2v3 (latest)

Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study

Interspeech (Interspeech), 2017
23 December 2017
S. Latif
R. Rana
Junaid Qadir
J. Epps
    SSLDRL
ArXiv (abs)PDFHTML

Papers citing "Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study"

37 / 37 papers shown
Can Large Language Models Aid in Annotating Speech Emotional Data?
  Uncovering New Frontiers
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New FrontiersIEEE Computational Intelligence Magazine (IEEE CIM), 2023
S. Latif
Muhammad Usama
Mohammad Ibrahim Malik
Björn W. Schuller
379
20
0
12 Jul 2023
Multitask Learning from Augmented Auxiliary Data for Improving Speech
  Emotion Recognition
Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE TAC), 2022
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Björn W. Schuller
188
36
0
12 Jul 2022
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and
  Cross-Language Speech Emotion Recognition
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE TAC), 2022
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Björn Schuller
215
68
0
19 Apr 2022
Continuous Metric Learning For Transferable Speech Emotion Recognition
  and Embedding Across Low-resource Languages
Continuous Metric Learning For Transferable Speech Emotion Recognition and Embedding Across Low-resource Languages
Sneha Das
N. Lund
N. Lønfeldt
A. Pagsberg
Line H. Clemmensen
167
6
0
28 Mar 2022
Towards Transferable Speech Emotion Representation: On loss functions
  for cross-lingual latent representations
Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Sneha Das
N. Lønfeldt
A. Pagsberg
Line H. Clemmensen
178
9
0
28 Mar 2022
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio
  Representation Learning
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning
Sreyan Ghosh
Ashish Seth
and Deepak Mittal
Maneesh Singh
S. Umesh
SSL
320
6
0
25 Mar 2022
How Speech is Recognized to Be Emotional - A Study Based on Information
  Decomposition
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Haoran Sun
Lantian Li
Tianshi Zheng
Dong Wang
CVBM
134
0
0
24 Nov 2021
DECAR: Deep Clustering for learning general-purpose Audio
  Representations
DECAR: Deep Clustering for learning general-purpose Audio Representations
Sreyan Ghosh
Sandesh V Katta
Ashish Seth
S. Umesh
SSL
260
14
0
17 Oct 2021
Multi-modal Residual Perceptron Network for Audio-Video Emotion
  Recognition
Multi-modal Residual Perceptron Network for Audio-Video Emotion RecognitionItalian National Conference on Sensors (INS), 2021
Xin Chang
W. Skarbek
165
22
0
21 Jul 2021
Towards Interpretable and Transferable Speech Emotion Recognition:
  Latent Representation Based Analysis of Features, Methods and Corpora
Towards Interpretable and Transferable Speech Emotion Recognition: Latent Representation Based Analysis of Features, Methods and Corpora
Sneha Das
N. Lønfeldt
A. Pagsberg
Line H. Clemmensen
51
6
0
05 May 2021
Privacy Enhanced Speech Emotion Communication using Deep Learning Aided
  Edge Computing
Privacy Enhanced Speech Emotion Communication using Deep Learning Aided Edge Computing
Hafiz Shehbaz Ali
Fakhar ul Hassan
S. Latif
H. Manzoor
Junaid Qadir
223
17
0
31 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based ApplicationsArtificial Intelligence Review (AIR), 2021
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Xiaoshi Zhong
OffRL
414
93
0
01 Jan 2021
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Generative Adversarial Networks in Human Emotion Synthesis:A ReviewIEEE Access (IEEE Access), 2020
Noushin Hajarolasvadi
M. A. Ramírez
H. Demirel
GAN
321
28
0
28 Oct 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve
  Multimodal Speech Emotion Recognition
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
220
117
0
15 Aug 2020
Expressive TTS Training with Frame and Style Reconstruction Loss
Expressive TTS Training with Frame and Style Reconstruction Loss
Rui Liu
Berrak Sisman
Guanglai Gao
Haizhou Li
312
82
0
04 Aug 2020
It's LeVAsa not LevioSA! Latent Encodings for Valence-Arousal Structure
  Alignment
It's LeVAsa not LevioSA! Latent Encodings for Valence-Arousal Structure Alignment
Surabhi S. Nath
Vishaal Udandarao
Jainendra Shukla
205
5
0
20 Jul 2020
Modality Dropout for Improved Performance-driven Talking Faces
Modality Dropout for Improved Performance-driven Talking FacesInternational Conference on Multimodal Interaction (ICMI), 2020
Ahmed Hussen Abdelaziz
B. Theobald
Paul Dixon
Reinhard Knothe
N. Apostoloff
Sachin Kajareker
219
46
0
27 May 2020
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks,
  and Cross-corpus Setting for Speech Emotion Recognition
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Björn W. Schuller
426
32
0
18 May 2020
Augmenting Generative Adversarial Networks for Speech Emotion
  Recognition
Augmenting Generative Adversarial Networks for Speech Emotion Recognition
S. Latif
Muhammad Asim
R. Rana
Sara Khalifa
Raja Jurdak
Björn W. Schuller
GAN
364
30
0
18 May 2020
Towards Learning a Universal Non-Semantic Representation of Speech
Towards Learning a Universal Non-Semantic Representation of SpeechInterspeech (Interspeech), 2020
Joel Shor
A. Jansen
Ronnie Maor
Oran Lang
Omry Tuval
Félix de Chaumont Quitry
Marco Tagliasacchi
Ira Shavitt
Dotan Emanuel
Yinnon A. Haviv
SSL
605
167
0
25 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
387
90
0
02 Jan 2020
Modeling Feature Representations for Affective Speech using Generative
  Adversarial Networks
Modeling Feature Representations for Affective Speech using Generative Adversarial NetworksIEEE Transactions on Affective Computing (TAC), 2019
Saurabh Sahu
Rahul Gupta
C. Espy-Wilson
GAN
253
16
0
31 Oct 2019
Unsupervised Adversarial Domain Adaptation for Cross-Lingual Speech
  Emotion Recognition
Unsupervised Adversarial Domain Adaptation for Cross-Lingual Speech Emotion RecognitionAffective Computing and Intelligent Interaction (ACII), 2019
S. Latif
Junaid Qadir
Muhammad Bilal
GAN
338
62
0
13 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion
  Recognition
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2019
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
366
113
0
13 Jul 2019
Disentangled Representation Learning with Information Maximizing
  Autoencoder
Disentangled Representation Learning with Information Maximizing Autoencoder
Kazi Nazmul Haque
S. Latif
R. Rana
DRL
97
1
0
18 Apr 2019
Deep Generative Models for Reject Inference in Credit Scoring
Deep Generative Models for Reject Inference in Credit Scoring
R. A. Mancisidor
Michael C. Kampffmeyer
K. Aas
Robert Jenssen
BDLFaML
168
55
0
12 Apr 2019
Direct Modelling of Speech Emotion from Raw Speech
Direct Modelling of Speech Emotion from Raw Speech
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
341
117
0
08 Apr 2019
Improving Cross-Corpus Speech Emotion Recognition with Adversarial
  Discriminative Domain Generalization (ADDoG)
Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
John Gideon
M. McInnis
E. Provost
258
125
0
28 Mar 2019
Learning Latent Representations of Bank Customers With The Variational
  Autoencoder
Learning Latent Representations of Bank Customers With The Variational Autoencoder
R. A. Mancisidor
Michael C. Kampffmeyer
K. Aas
Robert Jenssen
DRLBDL
160
30
0
14 Mar 2019
Automated Screening for Distress: A Perspective for the Future
Automated Screening for Distress: A Perspective for the Future
R. Rana
S. Latif
R. Gururajan
A. Gray
G. Mackenzie
G. Humphris
J. Dunn
189
46
0
22 Feb 2019
Motion Corrected Multishot MRI Reconstruction Using Generative Networks
  with Sensitivity Encoding
Motion Corrected Multishot MRI Reconstruction Using Generative Networks with Sensitivity Encoding
Muhammad Usman
Muhammad Asim
S. Latif
B. Lee
Junaid Qadir
MedImDiffMGAN
434
68
0
20 Feb 2019
Variational Recurrent Neural Networks for Graph Classification
Variational Recurrent Neural Networks for Graph Classification
Edouard Pineau
Nathan de Lara
GNN
244
0
0
07 Feb 2019
Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
S. Latif
A. Qayyum
Muhammad Usman
Junaid Qadir
CVBM
237
121
0
15 Dec 2018
Adversarial Machine Learning And Speech Emotion Recognition: Utilizing
  Generative Adversarial Networks For Robustness
Adversarial Machine Learning And Speech Emotion Recognition: Utilizing Generative Adversarial Networks For Robustness
S. Latif
R. Rana
Junaid Qadir
GANAAML
182
44
0
28 Nov 2018
Predicting Expressive Speaking Style From Text In End-To-End Speech
  Synthesis
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Daisy Stanton
Yuxuan Wang
RJ Skerry-Ryan
213
126
0
04 Aug 2018
Segment-Based Credit Scoring Using Latent Clusters in the Variational
  Autoencoder
Segment-Based Credit Scoring Using Latent Clusters in the Variational Autoencoder
R. A. Mancisidor
Michael C. Kampffmeyer
K. Aas
Robert Jenssen
DRL
140
3
0
07 Jun 2018
Phonocardiographic Sensing using Deep Learning for Abnormal Heartbeat
  Detection
Phonocardiographic Sensing using Deep Learning for Abnormal Heartbeat Detection
S. Latif
Muhammad Usman
R. Rana
Junaid Qadir
277
162
0
25 Jan 2018
1
Page 1 of 1