Vesper: A Compact and Effective Pretrained Model for Speech Emotion
Recognition

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

20 July 2023

Papers citing "Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition"

16 / 16 papers shown

Title
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages Heqing Zou Fengmao Lv Desheng Zheng E. Chng D. Rajan 26 0 0 25 Mar 2025
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network Vrushank Ahire Kunal Shah Mudasir Nazir Khan Nikhil Pakhale L. Sookha M. A. Ganaie Abhinav Dhall 63 0 0 16 Mar 2025
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Yuzhe Weng Haotian Wang Tian Gao Kewei Li Shutong Niu Jun Du 28 0 0 19 Oct 2024
End-to-End Integration of Speech Emotion Recognition with Voice Activity Detection using Self-Supervised Learning Features Natsuo Yamashita Masaaki Yamamoto Y. Kawaguchi 21 0 0 17 Oct 2024
Stimulus Modality Matters: Impact of Perceptual Evaluations from Different Modalities on Speech Emotion Recognition System Performance Huang-Cheng Chou Haibin Wu Chi-Chun Lee 24 0 0 16 Sep 2024
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers Ruchik Mishra Andrew Frye M. M. Rayguru Dan O. Popa 28 1 0 16 Sep 2024
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge Rui Liu Zening Ma SSL 29 1 0 10 Jun 2024
Adapting WavLM for Speech Emotion Recognition Daria Diatlova Anton Udalov Vitalii Shutov Egor Spirin 22 2 0 07 May 2024
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition Dongyuan Li Ying Zhang Yusong Wang Funakoshi Kataro Manabu Okumura 19 1 0 01 May 2024
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Zhihao Du Jiaming Wang Qian Chen Yunfei Chu Zhifu Gao ... Wen Wang Siqi Zheng Chang Zhou Zhijie Yan Shiliang Zhang LLMAG VLM AuLLM LM&MA 23 79 0 07 Oct 2023
Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition Dongyuan Li Yusong Wang Kotaro Funakoshi Manabu Okumura 20 3 0 30 Sep 2023
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition Ziyang Ma Wen Wu Zhisheng Zheng Yiwei Guo Qian Chen Shiliang Zhang Xie Chen 8 14 0 19 Sep 2023
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization Xiaokang Zhao Qiu-shi Zhu Jie M. Zhang 17 4 0 28 Sep 2022
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,735 0 24 Feb 2021
LSSED: a large-scale dataset and benchmark for speech emotion recognition Weiquan Fan Xiangmin Xu Xiaofen Xing Weidong Chen Dongyan Huang 45 32 0 30 Jan 2021
Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation Alexander R. Fabbri Simeng Han Haoyuan Li Haoran Li Marjan Ghazvininejad Shafiq R. Joty Dragomir R. Radev Yashar Mehdad 116 93 0 24 Oct 2020