v1v2v3v4 (latest)

Dawn of the transformer era in speech emotion recognition: closing the valence gap

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

14 March 2022

Johannes Wagner

Andreas Triantafyllopoulos

Björn W. Schuller

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Dawn of the transformer era in speech emotion recognition: closing the valence gap"

50 / 130 papers shown

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Andreas Triantafyllopoulos

403

22 Jul 2024

DISCOVER: A Data-driven Interactive System for Comprehensive Observation, Visualization, and ExploRation of Human Behaviour

Dominik Schiller

Tobias Hallmen

Daksitha Senel Withanage Don

Elisabeth André

Tobias Baur

159

18 Jul 2024

Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech

...

253

17 Jul 2024

Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training

Dingkang Yang

Lihua Zhang

196

06 Jul 2024

Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition

Björn Schuller

192

01 Jul 2024

Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment

Maurice Gerczuk

Björn W. Schuller

26 Jun 2024

This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach

Lukas Christ

Shahin Amiriparian

Friederike Hawighorst

Ann-Kathrin Schill

Angelo Boutalikakis

Lorenz Graf-Vlachy

Andreas Konig

Björn W. Schuller

136

25 Jun 2024

What Does it Take to Generalize SER Model Across Datasets? A Comprehensive BenchmarkInterspeech (Interspeech), 2024

Muhammad Abdul-Mageed

176

14 Jun 2024

EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

Deok-Hyeon Cho

Hyung-Seok Oh

Seung-Bin Kim

Sang-Hoon Lee

Seong-Whan Lee

199

12 Jun 2024

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

Yuanchao Li

Peter Bell

Catherine Lai

367

12 Jun 2024

The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition

Maurice Gerczuk

...

Björn Schuller

336

11 Jun 2024

ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets

Shahin Amiriparian

Filip Packañ

Maurice Gerczuk

Björn W. Schuller

101

11 Jun 2024

ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks

Xin Jing

Andreas Triantafyllopoulos

Björn Schuller

146

11 Jun 2024

Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition

Andreas Triantafyllopoulos

Björn Schuller

141

10 Jun 2024

INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition

Andreas Triantafyllopoulos

Björn Schuller

191

10 Jun 2024

On the social bias of speech self-supervised modelsInterspeech (Interspeech), 2024

314

07 Jun 2024

Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning

206

04 Jun 2024

Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition

286

01 May 2024

Usefulness of Emotional Prosody in Neural Machine Translation

Charles Brazier

Jean-Luc Rouas

161

27 Apr 2024

Improving Personalisation in Valence and Arousal Prediction using Data Augmentation

Munachiso Nwadike

Jialin Li

Hanan Salam

189

13 Apr 2024

The VoicePrivacy 2024 Challenge Evaluation Plan

Xin Wang

285

03 Apr 2024

Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision

254

19 Mar 2024

SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition

298

19 Mar 2024

Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction

319

18 Mar 2024

PAVITS: Exploring Prosody-aware VITS for End-to-End Emotional Voice Conversion

Wenming Zheng

152

03 Mar 2024

The AffectToolbox: Affect Analysis for Everyone

204

23 Feb 2024

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Haibin Wu

Jiawei Du

Chi-Chun Lee

Hung-Yi Lee

396

20 Feb 2024

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition

Björn W. Schuller

207

02 Feb 2024

Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Sunan Li

Wenming Zheng

165

23 Jan 2024

DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024

580

16 Jan 2024

A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions

Alex-Răzvan Ispas

Théo Deschamps-Berger

Laurence Devillers

147

31 Dec 2023

DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition

Chengxin Chen

Pengyuan Zhang

133

25 Dec 2023

Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach

Wenming Zheng

136

11 Dec 2023

Testing Correctness, Fairness, and Robustness of Speech Emotion Recognition Models

Björn W. Schuller

357

11 Dec 2023

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech SynthesisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

389

21 Nov 2023

Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual CoachIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023

...

Dijana Petrovska – Delacretaz

M. Inés Torres

Sergio Escalera

235

09 Nov 2023

EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks

233

19 Oct 2023

Active Learning Based Fine-Tuning Framework for Speech Emotion RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

329

30 Sep 2023

Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from SpeechAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023

Bagus Tris Atmaja

A. Sasou

20 Sep 2023

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Ziyang Ma

Wen Wu

Zhisheng Zheng

Yiwei Guo

Qian Chen

Shiliang Zhang

Xie Chen

237

19 Sep 2023

EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

N. Prabhu

Bunlong Lay

Simon Welker

N. Lehmann-Willenbrock

Timo Gerkmann

DiffM

296

14 Sep 2023

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Debaditya Shome

Ali Etemad

174

09 Sep 2023

Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion RecognitionInterspeech (Interspeech), 2023

Minh Tran

Yufeng Yin

M. Soleymani

176

05 Sep 2023

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Yu-Wen Chen

Julia Hirschberg

Yu Tsao

178

03 Sep 2023

Multiscale Contextual Learning for Speech Emotion Recognition in Emergency Call Center Conversations

Théo Deschamps-Berger

L. Lamel

Laurence Devillers

142

28 Aug 2023

Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech TasksACM Multimedia (ACM MM), 2023

284

28 Aug 2023

AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis

139

16 Aug 2023

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

Heng Lu

241

08 Aug 2023

Elucidate Gender Fairness in Singing Voice TranscriptionACM Multimedia (ACM MM), 2023

Xiangming Gu

Weizhen Zeng

Ye Wang

222

05 Aug 2023

CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for Dialogue Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023

325

28 Jul 2023