SERAB: A multi-lingual benchmark for speech emotion recognition

7 October 2021

Papers citing "SERAB: A multi-lingual benchmark for speech emotion recognition"

30 / 30 papers shown

Title
SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition Mohamed Osman Daniel Z. Kaplan Tamer Nadeem 24 1 0 14 Aug 2024
Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning Arnav Goel Medha Hira Anubha Gupta 19 0 0 13 Jun 2024
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark Ziyang Ma Mingjie Chen Hezhao Zhang Zhisheng Zheng Wenxi Chen Xiquan Li Jiaxin Ye Xie Chen Thomas Hain 25 12 0 11 Jun 2024
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features Gasser Elbanna Z. Mostaani Mathew Magimai.-Doss SSL 30 0 0 10 Jun 2024
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework Daisuke Niizumi Daiki Takeuchi Yasunori Ohishi Noboru Harada K. Kashino 29 10 0 09 Apr 2024
Are Paralinguistic Representations all that is needed for Speech Emotion Recognition? Orchid Chetia Phukan Gautam Siddharth Kashyap Arun Balaji Buduru Rajesh Sharma 23 0 0 02 Feb 2024
EnCodecMAE: Leveraging neural codecs for universal audio representation learning L. Pepino Pablo Riera Luciana Ferrer 14 4 0 14 Sep 2023
Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition Anant Singh Akshat Gupta 26 4 0 17 Aug 2023
Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques Samiul Islam Md. Maksudul Haque Abu Md. Sadat 11 2 0 04 Aug 2023
Speaker Embeddings as Individuality Proxy for Voice Stress Detection Zihan Wu Neil Scheidwasser Karl El Hajal Milos Cernak 24 3 0 09 Jun 2023
audb -- Sharing and Versioning of Audio and Annotation Data in Python H. Wierstorf Johannes Wagner F. Eyben Felix Burkhardt Björn W. Schuller 20 1 0 01 Mar 2023
cross-modal fusion techniques for utterance-level emotion recognition from text and speech Jiacheng Luo Huy P Phan Joshua Reiss 8 10 0 05 Feb 2023
deep learning of segment-level feature representation for speech emotion recognition in conversations Jiacheng Luo Huy P Phan Joshua Reiss 10 3 0 05 Feb 2023
A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora A. Yazdani Y. Shekofteh 9 2 0 18 Nov 2022
Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings Karl El Hajal Zihan Wu Neil Scheidwasser Gasser Elbanna Milos Cernak 18 9 0 12 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis Bryce Irvin Marko Stamenovic M. Kegler Li-Chia Yang 27 18 0 04 Nov 2022
Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search Zihan Wang Qianyu Meng HaiFeng Lan Xinrui Zhang Kehao Guo Akshat Gupta 6 3 0 31 Oct 2022
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input Daisuke Niizumi Daiki Takeuchi Yasunori Ohishi Noboru Harada K. Kashino SSL 16 29 0 26 Oct 2022
The Efficacy of Self-Supervised Speech Models for Audio Representations Tung-Yu Wu Chen An Li Tzu-Han Lin Tsung-Yuan Hsu Hung-yi Lee 19 5 0 26 Sep 2022
Semi-supervised cross-lingual speech emotion recognition Mirko Agarla Simone Bianco Luigi Celona Paolo Napoletano A. Petrovsky Flavio Piccoli Raimondo Schettini I. Shanin 11 14 0 14 Jul 2022
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping Gasser Elbanna Neil Scheidwasser M. Kegler P. Beckmann Karl El Hajal Milos Cernak SSL 24 21 0 24 Jun 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation Daisuke Niizumi Daiki Takeuchi Yasunori Ohishi N. Harada K. Kashino 13 65 0 26 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations Daisuke Niizumi Daiki Takeuchi Yasunori Ohishi N. Harada K. Kashino SSL 26 53 0 15 Apr 2022
Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load Gasser Elbanna A. Biryukov Neil Scheidwasser Lara Orlandic Pablo Mainar M. Kegler P. Beckmann Milos Cernak 4 11 0 30 Mar 2022
Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data A. Shirian Krishna Somandepalli T. Guha SSL 38 10 0 31 Jan 2022
An Ensemble 1D-CNN-LSTM-GRU Model with Data Augmentation for Speech Emotion Recognition Md. Rayhan Ahmed Salekul Islam Ph. D A. Muzahidul Islam Ph. D Swakkhar Shatabda Ph. D 8 105 0 10 Dec 2021
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates Björn W. Schuller A. Batliner Christian Bergler Cecilia Mascolo Jing Han ... Pietro Cicuta L. Rothkrantz J. Zwerts Jelle Treep Casper S. Kaandorp 47 111 0 24 Feb 2021
LSSED: a large-scale dataset and benchmark for speech emotion recognition Weiquan Fan Xiangmin Xu Xiaofen Xing Weidong Chen Dongyan Huang 51 32 0 30 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 294 6,927 0 20 Apr 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications Andrew G. Howard Menglong Zhu Bo Chen Dmitry Kalenichenko Weijun Wang Tobias Weyand M. Andreetto Hartwig Adam 3DH 948 20,214 0 17 Apr 2017