ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.03414
  4. Cited By
SERAB: A multi-lingual benchmark for speech emotion recognition

SERAB: A multi-lingual benchmark for speech emotion recognition

7 October 2021
Neil Scheidwasser
M. Kegler
P. Beckmann
Milos Cernak
ArXivPDFHTML

Papers citing "SERAB: A multi-lingual benchmark for speech emotion recognition"

30 / 30 papers shown
Title
SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion
  Recognition
SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition
Mohamed Osman
Daniel Z. Kaplan
Tamer Nadeem
24
1
0
14 Aug 2024
Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging
  Co-Attention Cues in Multitask Learning
Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning
Arnav Goel
Medha Hira
Anubha Gupta
19
0
0
13 Jun 2024
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and
  Benchmark
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Ziyang Ma
Mingjie Chen
Hezhao Zhang
Zhisheng Zheng
Wenxi Chen
Xiquan Li
Jiaxin Ye
Xie Chen
Thomas Hain
25
12
0
11 Jun 2024
Predicting Heart Activity from Speech using Data-driven and
  Knowledge-based features
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features
Gasser Elbanna
Z. Mostaani
Mathew Magimai.-Doss
SSL
30
0
0
10 Jun 2024
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
29
10
0
09 Apr 2024
Are Paralinguistic Representations all that is needed for Speech Emotion
  Recognition?
Are Paralinguistic Representations all that is needed for Speech Emotion Recognition?
Orchid Chetia Phukan
Gautam Siddharth Kashyap
Arun Balaji Buduru
Rajesh Sharma
23
0
0
02 Feb 2024
EnCodecMAE: Leveraging neural codecs for universal audio representation
  learning
EnCodecMAE: Leveraging neural codecs for universal audio representation learning
L. Pepino
Pablo Riera
Luciana Ferrer
14
4
0
14 Sep 2023
Decoding Emotions: A comprehensive Multilingual Study of Speech Models
  for Speech Emotion Recognition
Decoding Emotions: A comprehensive Multilingual Study of Speech Models for Speech Emotion Recognition
Anant Singh
Akshat Gupta
26
4
0
17 Aug 2023
Capturing Spectral and Long-term Contextual Information for Speech
  Emotion Recognition Using Deep Learning Techniques
Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Samiul Islam
Md. Maksudul Haque
Abu Md. Sadat
11
2
0
04 Aug 2023
Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Speaker Embeddings as Individuality Proxy for Voice Stress Detection
Zihan Wu
Neil Scheidwasser
Karl El Hajal
Milos Cernak
24
3
0
09 Jun 2023
audb -- Sharing and Versioning of Audio and Annotation Data in Python
audb -- Sharing and Versioning of Audio and Annotation Data in Python
H. Wierstorf
Johannes Wagner
F. Eyben
Felix Burkhardt
Björn W. Schuller
20
1
0
01 Mar 2023
cross-modal fusion techniques for utterance-level emotion recognition
  from text and speech
cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Jiacheng Luo
Huy P Phan
Joshua Reiss
8
10
0
05 Feb 2023
deep learning of segment-level feature representation for speech emotion
  recognition in conversations
deep learning of segment-level feature representation for speech emotion recognition in conversations
Jiacheng Luo
Huy P Phan
Joshua Reiss
10
3
0
05 Feb 2023
A Persian ASR-based SER: Modification of Sharif Emotional Speech
  Database and Investigation of Persian Text Corpora
A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora
A. Yazdani
Y. Shekofteh
9
2
0
18 Nov 2022
Efficient Speech Quality Assessment using Self-supervised Framewise
  Embeddings
Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Karl El Hajal
Zihan Wu
Neil Scheidwasser
Gasser Elbanna
Milos Cernak
18
9
0
12 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis
Self-Supervised Learning for Speech Enhancement through Synthesis
Bryce Irvin
Marko Stamenovic
M. Kegler
Li-Chia Yang
27
18
0
04 Nov 2022
Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and
  Neural Architecture Search
Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Zihan Wang
Qianyu Meng
HaiFeng Lan
Xinrui Zhang
Kehao Guo
Akshat Gupta
6
3
0
31 Oct 2022
Masked Modeling Duo: Learning Representations by Encouraging Both
  Networks to Model the Input
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
16
29
0
26 Oct 2022
The Efficacy of Self-Supervised Speech Models for Audio Representations
The Efficacy of Self-Supervised Speech Models for Audio Representations
Tung-Yu Wu
Chen An Li
Tzu-Han Lin
Tsung-Yuan Hsu
Hung-yi Lee
19
5
0
26 Sep 2022
Semi-supervised cross-lingual speech emotion recognition
Semi-supervised cross-lingual speech emotion recognition
Mirko Agarla
Simone Bianco
Luigi Celona
Paolo Napoletano
A. Petrovsky
Flavio Piccoli
Raimondo Schettini
I. Shanin
11
14
0
14 Jul 2022
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Gasser Elbanna
Neil Scheidwasser
M. Kegler
P. Beckmann
Karl El Hajal
Milos Cernak
SSL
24
21
0
24 Jun 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning
  General-purpose Audio Representation
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
13
65
0
26 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio
  Representations
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
N. Harada
K. Kashino
SSL
26
53
0
15 Apr 2022
Hybrid Handcrafted and Learnable Audio Representation for Analysis of
  Speech Under Cognitive and Physical Load
Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load
Gasser Elbanna
A. Biryukov
Neil Scheidwasser
Lara Orlandic
Pablo Mainar
M. Kegler
P. Beckmann
Milos Cernak
4
11
0
30 Mar 2022
Self-supervised Graphs for Audio Representation Learning with Limited
  Labeled Data
Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
A. Shirian
Krishna Somandepalli
T. Guha
SSL
38
10
0
31 Jan 2022
An Ensemble 1D-CNN-LSTM-GRU Model with Data Augmentation for Speech
  Emotion Recognition
An Ensemble 1D-CNN-LSTM-GRU Model with Data Augmentation for Speech Emotion Recognition
Md. Rayhan Ahmed
Salekul Islam
Ph. D
A. Muzahidul Islam
Ph. D
Swakkhar Shatabda
Ph. D
8
105
0
10 Dec 2021
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19
  Cough, COVID-19 Speech, Escalation & Primates
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates
Björn W. Schuller
A. Batliner
Christian Bergler
Cecilia Mascolo
Jing Han
...
Pietro Cicuta
L. Rothkrantz
J. Zwerts
Jelle Treep
Casper S. Kaandorp
47
111
0
24 Feb 2021
LSSED: a large-scale dataset and benchmark for speech emotion
  recognition
LSSED: a large-scale dataset and benchmark for speech emotion recognition
Weiquan Fan
Xiangmin Xu
Xiaofen Xing
Weidong Chen
Dongyan Huang
51
32
0
30 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,214
0
17 Apr 2017
1