ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.07378
  4. Cited By
Dawn of the transformer era in speech emotion recognition: closing the
  valence gap
v1v2v3v4 (latest)

Dawn of the transformer era in speech emotion recognition: closing the valence gap

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
14 March 2022
Johannes Wagner
Andreas Triantafyllopoulos
H. Wierstorf
Maximilian Schmitt
Felix Burkhardt
F. Eyben
Björn W. Schuller
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Dawn of the transformer era in speech emotion recognition: closing the valence gap"

50 / 130 papers shown
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
A. Mesaros
Maria Sandsten
B. Schuller
403
7
0
22 Jul 2024
DISCOVER: A Data-driven Interactive System for Comprehensive
  Observation, Visualization, and ExploRation of Human Behaviour
DISCOVER: A Data-driven Interactive System for Comprehensive Observation, Visualization, and ExploRation of Human Behaviour
Dominik Schiller
Tobias Hallmen
Daksitha Senel Withanage Don
Elisabeth André
Tobias Baur
159
7
0
18 Jul 2024
Laugh Now Cry Later: Controlling Time-Varying Emotional States of
  Flow-Matching-Based Zero-Shot Text-to-Speech
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
Haibin Wu
Xiaofei Wang
Sefik Emre Eskimez
Manthan Thakker
Daniel Tompkins
...
Canrun Li
Zhen Xiao
Sheng Zhao
Jinyu Li
Naoyuki Kanda
253
19
0
17 Jul 2024
Towards Context-Aware Emotion Recognition Debiasing from a Causal
  Demystification Perspective via De-confounded Training
Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training
Dingkang Yang
Kun Yang
Haopeng Kuang
Zhaoyu Chen
Yuzheng Wang
Lihua Zhang
CML
196
14
0
06 Jul 2024
Are you sure? Analysing Uncertainty Quantification Approaches for
  Real-world Speech Emotion Recognition
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition
Oliver Schrufer
M. Milling
Felix Burkhardt
F. Eyben
Björn Schuller
192
5
0
01 Jul 2024
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk
  Assessment
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment
Maurice Gerczuk
Shahin Amiriparian
Justina Lutz
W. Strube
I. Papazova
Alkomiet Hasan
Björn W. Schuller
42
6
0
26 Jun 2024
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an
  Audio-Textual Transformer-Based Approach
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach
Lukas Christ
Shahin Amiriparian
Friederike Hawighorst
Ann-Kathrin Schill
Angelo Boutalikakis
Lorenz Graf-Vlachy
Andreas Konig
Björn W. Schuller
136
1
0
25 Jun 2024
What Does it Take to Generalize SER Model Across Datasets? A
  Comprehensive Benchmark
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive BenchmarkInterspeech (Interspeech), 2024
Adham Ibrahim
Shady Shehata
Ajinkya Kulkarni
Mukhtar Mohamed
Muhammad Abdul-Mageed
176
7
0
14 Jun 2024
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical
  Emotion Vector for Controllable Emotional Text-to-Speech
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech
Deok-Hyeon Cho
Hyung-Seok Oh
Seung-Bin Kim
Sang-Hoon Lee
Seong-Whan Lee
199
31
0
12 Jun 2024
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Yuanchao Li
Peter Bell
Catherine Lai
367
20
0
12 Jun 2024
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception
  and Humor Recognition
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition
Shahin Amiriparian
Lukas Christ
Alexander Kathan
Maurice Gerczuk
Niklas Muller
...
Lukas Stappen
Andreas Konig
Xiaoshi Zhong
Björn Schuller
Simone Eulitz
336
15
0
11 Jun 2024
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37
  Emotion Datasets
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets
Shahin Amiriparian
Filip Packañ
Maurice Gerczuk
Björn W. Schuller
101
18
0
11 Jun 2024
ParaCLAP -- Towards a general language-audio model for computational
  paralinguistic tasks
ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks
Xin Jing
Andreas Triantafyllopoulos
Björn Schuller
146
9
0
11 Jun 2024
Enrolment-based personalisation for improving individual-level fairness
  in speech emotion recognition
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition
Andreas Triantafyllopoulos
Björn Schuller
141
2
0
10 Jun 2024
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of
  Progress in Speech Emotion Recognition
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition
Andreas Triantafyllopoulos
A. Batliner
Simon Rampp
M. Milling
Björn Schuller
VLM
191
3
0
10 Jun 2024
On the social bias of speech self-supervised models
On the social bias of speech self-supervised modelsInterspeech (Interspeech), 2024
Yi-Cheng Lin
Tzu-Quan Lin
Hsi-Che Lin
Andy T. Liu
Hung-yi Lee
314
11
0
07 Jun 2024
Modeling Emotional Trajectories in Written Stories Utilizing
  Transformers and Weakly-Supervised Learning
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Lukas Christ
Shahin Amiriparian
M. Milling
Ilhan Aslan
B. Schuller
206
1
0
04 Jun 2024
Active Learning with Task Adaptation Pre-training for Speech Emotion
  Recognition
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
Dongyuan Li
Ying Zhang
Yusong Wang
Funakoshi Kataro
Manabu Okumura
286
3
0
01 May 2024
Usefulness of Emotional Prosody in Neural Machine Translation
Usefulness of Emotional Prosody in Neural Machine Translation
Charles Brazier
Jean-Luc Rouas
161
1
0
27 Apr 2024
Improving Personalisation in Valence and Arousal Prediction using Data
  Augmentation
Improving Personalisation in Valence and Arousal Prediction using Data Augmentation
Munachiso Nwadike
Jialin Li
Hanan Salam
189
0
0
13 Apr 2024
The VoicePrivacy 2024 Challenge Evaluation Plan
The VoicePrivacy 2024 Challenge Evaluation Plan
N. Tomashenko
Xiaoxiao Miao
Pierre Champion
Sarina Meyer
Xin Wang
Emmanuel Vincent
Michele Panariello
Nicholas W. D. Evans
Junichi Yamagishi
Massimiliano Todisco
285
58
0
03 Apr 2024
Audio-Visual Compound Expression Recognition Method based on Late
  Modality Fusion and Rule-based Decision
Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision
E. Ryumina
M. Markitantov
D. Ryumin
Heysem Kaya
Alexey Karpov
254
8
0
19 Mar 2024
SUN Team's Contribution to ABAW 2024 Competition: Audio-visual
  Valence-Arousal Estimation and Expression Recognition
SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition
D. Dresvyanskiy
M. Markitantov
Jiawei Yu
Peitong Li
Heysem Kaya
Alexey Karpov
298
8
0
19 Mar 2024
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction
Unimodal Multi-Task Fusion for Emotional Mimicry Intensity Prediction
Tobias Hallmen
Fabian Deuser
Norbert Oswald
Elisabeth André
319
5
0
18 Mar 2024
PAVITS: Exploring Prosody-aware VITS for End-to-End Emotional Voice
  Conversion
PAVITS: Exploring Prosody-aware VITS for End-to-End Emotional Voice Conversion
Tianhua Qi
Wenming Zheng
Cheng Lu
Yuan Zong
Hailun Lian
152
14
0
03 Mar 2024
The AffectToolbox: Affect Analysis for Everyone
The AffectToolbox: Affect Analysis for Everyone
Silvan Mertes
Dominik Schiller
Michael Dietz
Elisabeth André
Florian Lingenfelser
204
7
0
23 Feb 2024
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Haibin Wu
Huang-Cheng Chou
Kai-Wei Chang
Lucas Goncalves
Jiawei Du
Jyh-Shing Roger Jang
Chi-Chun Lee
Hung-Yi Lee
396
19
0
20 Feb 2024
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech
  Emotion Recognition
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Yi Chang
Zhao Ren
Zixing Zhang
Xin Jing
Kun Qian
Xi Shao
Bin Hu
Tanja Schultz
Björn W. Schuller
AAML
207
5
0
02 Feb 2024
Emotion-Aware Contrastive Adaptation Network for Source-Free
  Cross-Corpus Speech Emotion Recognition
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yan Zhao
Jincen Wang
Cheng Lu
Sunan Li
Bjorn Schuller
Yuan Zong
Wenming Zheng
165
4
0
23 Jan 2024
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024
Hyoung-Seok Oh
Sang-Hoon Lee
Deok-Hyun Cho
Seong-Whan Lee
580
1
0
16 Jan 2024
A Multi-Task, Multi-Modal Approach for Predicting Categorical and
  Dimensional Emotions
A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions
Alex-Răzvan Ispas
Théo Deschamps-Berger
Laurence Devillers
147
4
0
31 Dec 2023
DSNet: Disentangled Siamese Network with Neutral Calibration for Speech
  Emotion Recognition
DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
133
1
0
25 Dec 2023
Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach
Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach
Yan Zhao
Yuan Zong
Hailun Lian
Cheng Lu
Jingang Shi
Wenming Zheng
136
1
0
11 Dec 2023
Testing Correctness, Fairness, and Robustness of Speech Emotion Recognition Models
Testing Correctness, Fairness, and Robustness of Speech Emotion Recognition Models
Anna Derington
H. Wierstorf
Ali Özkil
F. Eyben
Felix Burkhardt
Björn W. Schuller
357
2
0
11 Dec 2023
HierSpeech++: Bridging the Gap between Semantic and Acoustic
  Representation of Speech by Hierarchical Variational Inference for Zero-shot
  Speech Synthesis
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech SynthesisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Sang-Hoon Lee
Haram Choi
Seung-Bin Kim
Seong-Whan Lee
BDL
389
60
0
21 Nov 2023
Exploring Emotion Expression Recognition in Older Adults Interacting
  with a Virtual Coach
Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual CoachIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Cristina Palmero
Mikel de Velasco
Mohamed Amine Hmani
Aymen Mtibaa
Leila Ben Letaifa
...
Anna Esposito
M. El-Yacoubi
Dijana Petrovska – Delacretaz
M. Inés Torres
Sergio Escalera
235
8
0
09 Nov 2023
EmoDiarize: Speaker Diarization and Emotion Identification from Speech
  Signals using Convolutional Neural Networks
EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks
Hanan Hamza
Fiza Gafoor
Fathima Sithara
Gayathri Anil
V. Anoop
233
1
0
19 Oct 2023
Active Learning Based Fine-Tuning Framework for Speech Emotion
  Recognition
Active Learning Based Fine-Tuning Framework for Speech Emotion RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023
Dongyuan Li
Yusong Wang
Kotaro Funakoshi
Manabu Okumura
329
5
0
30 Sep 2023
Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label
  Regression Emotion Share from Speech
Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from SpeechAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
Bagus Tris Atmaja
A. Sasou
95
4
0
20 Sep 2023
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion
  Recognition
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ziyang Ma
Wen Wu
Zhisheng Zheng
Yiwei Guo
Qian Chen
Shiliang Zhang
Xie Chen
237
28
0
19 Sep 2023
EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel
  and In-the-wild Data
EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
N. Prabhu
Bunlong Lay
Simon Welker
N. Lehmann-Willenbrock
Timo Gerkmann
DiffM
296
8
0
14 Sep 2023
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect
  Representations
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Debaditya Shome
Ali Etemad
174
9
0
09 Sep 2023
Personalized Adaptation with Pre-trained Speech Encoders for Continuous
  Emotion Recognition
Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion RecognitionInterspeech (Interspeech), 2023
Minh Tran
Yufeng Yin
M. Soleymani
176
8
0
05 Sep 2023
Noise robust speech emotion recognition with signal-to-noise ratio
  adapting speech enhancement
Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Yu-Wen Chen
Julia Hirschberg
Yu Tsao
178
9
0
03 Sep 2023
Multiscale Contextual Learning for Speech Emotion Recognition in
  Emergency Call Center Conversations
Multiscale Contextual Learning for Speech Emotion Recognition in Emergency Call Center Conversations
Théo Deschamps-Berger
L. Lamel
Laurence Devillers
142
2
0
28 Aug 2023
Effect of Attention and Self-Supervised Speech Embeddings on
  Non-Semantic Speech Tasks
Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech TasksACM Multimedia (ACM MM), 2023
Payal Mohapatra
Akash Pandey
Yueyuan Sui
Qi Zhu
284
6
0
28 Aug 2023
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect
  Transfer for Speech Synthesis
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis
Hrishikesh Viswanath
Aneesh Bhattacharya
Pascal Jutras-Dubé
Prerit Gupta
Mridu Prashanth
Yashvardhan Khaitan
Aniket Bera
139
2
0
16 Aug 2023
MSAC: Multiple Speech Attribute Control Method for Reliable Speech
  Emotion Recognition
MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
Yu Pan
Yuguang Yang
Yuheng Huang
Jixun Yao
Jingjing Yin
Yanni Hu
Heng Lu
Lei Ma
Jianjun Zhao
241
7
0
08 Aug 2023
Elucidate Gender Fairness in Singing Voice Transcription
Elucidate Gender Fairness in Singing Voice TranscriptionACM Multimedia (ACM MM), 2023
Xiangming Gu
Weizhen Zeng
Ye Wang
222
3
0
05 Aug 2023
CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for
  Dialogue Emotion Recognition
CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for Dialogue Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Jiang Li
Xiaoping Wang
Yingjian Liu
Zhigang Zeng
325
53
0
28 Jul 2023
Previous
123
Next