ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.09212
  4. Cited By
On the use of Self-supervised Pre-trained Acoustic and Linguistic
  Features for Continuous Speech Emotion Recognition

On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

Spoken Language Technology Workshop (SLT), 2020
18 November 2020
Manon Macary
Marie Tahon
Yannick Esteve
Anthony Rousseau
    SSL
ArXiv (abs)PDFHTML

Papers citing "On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition"

23 / 23 papers shown
SENSE models: an open source solution for multilingual and multimodal semantic-based tasks
SENSE models: an open source solution for multilingual and multimodal semantic-based tasks
Salima Mdhaffar
Haroun Elleuch
Chaimae Chellaf
H. Nguyen
Yannick Esteve
VLM
190
5
0
15 Sep 2025
End-to-End Integration of Speech Emotion Recognition with Voice Activity
  Detection using Self-Supervised Learning Features
End-to-End Integration of Speech Emotion Recognition with Voice Activity Detection using Self-Supervised Learning Features
Natsuo Yamashita
Masaaki Yamamoto
Yohei Kawaguchi
323
1
0
17 Oct 2024
The Unreliability of Acoustic Systems in Alzheimer's Speech Datasets
  with Heterogeneous Recording Conditions
The Unreliability of Acoustic Systems in Alzheimer's Speech Datasets with Heterogeneous Recording Conditions
L. Gauder
Pablo Riera
A. Slachevsky
G. Forno
Adolfo M. Garcia
Luciana Ferrer
262
4
0
11 Sep 2024
MSP-Podcast SER Challenge 2024: Lántenne du Ventoux Multimodal
  Self-Supervised Learning for Speech Emotion Recognition
MSP-Podcast SER Challenge 2024: Lántenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
J. Duret
Mickael Rouvier
Yannick Esteve
163
5
0
08 Jul 2024
A dual task learning approach to fine-tune a multilingual semantic
  speech encoder for Spoken Language Understanding
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding
G. Laperriere
Sahar Ghannay
Bassam Jabaian
Yannick Esteve
339
1
0
17 Jun 2024
Acoustic and linguistic representations for speech continuous emotion
  recognition in call center conversations
Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations
Manon Macary
Marie Tahon
Yannick Esteve
Daniel Luzzati
243
3
0
06 Oct 2023
Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Zero-Shot Emotion Transfer For Cross-Lingual Speech SynthesisAutomatic Speech Recognition & Understanding (ASRU), 2023
Yuke Li
Xinfa Zhu
Yinjiao Lei
Hai Li
Junhui Liu
Danming Xie
Lei Xie
301
6
0
06 Oct 2023
Semantic enrichment towards efficient speech representations
Semantic enrichment towards efficient speech representationsInterspeech (Interspeech), 2023
G. Laperriere
H. Nguyen
Sahar Ghannay
Bassam Jabaian
Yannick Esteve
371
2
0
03 Jul 2023
Learning Multilingual Expressive Speech Representation for Prosody
  Prediction without Parallel Data
Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel DataSpeech Synthesis Workshop (SSW), 2023
J. Duret
Titouan Parcollet
Yannick Esteve
173
4
0
29 Jun 2023
A Comparative Study of Pre-trained Speech and Audio Embeddings for
  Speech Emotion Recognition
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition
Orchid Chetia Phukan
Arun Balaji Buduru
Rajesh Sharma
262
9
0
22 Apr 2023
A vector quantized masked autoencoder for speech emotion recognition
A vector quantized masked autoencoder for speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
307
32
0
21 Apr 2023
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion
  Recognition
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition
Soumya Dutta
Sriram Ganapathy
462
25
0
14 Apr 2023
On the Use of Semantically-Aligned Speech Representations for Spoken
  Language Understanding
On the Use of Semantically-Aligned Speech Representations for Spoken Language UnderstandingSpoken Language Technology Workshop (SLT), 2022
G. Laperriere
Valentin Pelloin
Mickael Rouvier
Themos Stafylakis
Yannick Esteve
315
10
0
11 Oct 2022
Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion,
  Age, and Origin from Vocal Bursts
Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts
Atijit Anuchitanukul
Lucia Specia
VLM
233
6
0
24 Jun 2022
Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
Aziz Ur Rehman
Zhentao Liu
Min Wu
Weihua Cao
Chengchong Jia
195
13
0
25 Apr 2022
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and
  Cross-Language Speech Emotion Recognition
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE TAC), 2022
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Björn Schuller
234
69
0
19 Apr 2022
Transformer-Based Self-Supervised Learning for Emotion Recognition
Transformer-Based Self-Supervised Learning for Emotion RecognitionInternational Conference on Pattern Recognition (ICPR), 2022
Juan Vazquez-Rodriguez
G. Lefebvre
Julien Cumin
James L. Crowley
338
44
0
08 Apr 2022
Learning Speech Emotion Representations in the Quaternion Domain
Learning Speech Emotion Representations in the Quaternion DomainIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
E. Guizzo
Tillman Weyde
Simone Scardapane
Danilo Comminiello
242
33
0
05 Apr 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark
  for Semantic and Generative Capabilities
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative CapabilitiesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Hsiang-Sheng Tsai
Heng-Jui Chang
Wen-Chin Huang
Zili Huang
Kushal Lakhotia
...
Hsuan-Jui Chen
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
332
127
0
14 Mar 2022
Multimodal Emotion Recognition with High-level Speech and Text Features
Multimodal Emotion Recognition with High-level Speech and Text Features
M. R. Makiuchi
Kuniaki Uto
Koichi Shinoda
252
85
0
29 Sep 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
Emotion Recognition from Speech Using Wav2vec 2.0 EmbeddingsInterspeech (Interspeech), 2021
L. Pepino
Pablo Riera
Luciana Ferrer
290
447
0
08 Apr 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion
  Recognition
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2021
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
270
75
0
10 Mar 2021
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
613
394
0
25 Oct 2019
1
Page 1 of 1