Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.10757
Cited By
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
20 July 2023
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition"
16 / 16 papers shown
Title
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
Heqing Zou
Fengmao Lv
Desheng Zheng
E. Chng
D. Rajan
26
0
0
25 Mar 2025
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire
Kunal Shah
Mudasir Nazir Khan
Nikhil Pakhale
L. Sookha
M. A. Ganaie
Abhinav Dhall
63
0
0
16 Mar 2025
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Yuzhe Weng
Haotian Wang
Tian Gao
Kewei Li
Shutong Niu
Jun Du
28
0
0
19 Oct 2024
End-to-End Integration of Speech Emotion Recognition with Voice Activity Detection using Self-Supervised Learning Features
Natsuo Yamashita
Masaaki Yamamoto
Y. Kawaguchi
21
0
0
17 Oct 2024
Stimulus Modality Matters: Impact of Perceptual Evaluations from Different Modalities on Speech Emotion Recognition System Performance
Huang-Cheng Chou
Haibin Wu
Chi-Chun Lee
24
0
0
16 Sep 2024
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
Ruchik Mishra
Andrew Frye
M. M. Rayguru
Dan O. Popa
28
1
0
16 Sep 2024
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge
Rui Liu
Zening Ma
SSL
29
1
0
10 Jun 2024
Adapting WavLM for Speech Emotion Recognition
Daria Diatlova
Anton Udalov
Vitalii Shutov
Egor Spirin
22
2
0
07 May 2024
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
Dongyuan Li
Ying Zhang
Yusong Wang
Funakoshi Kataro
Manabu Okumura
19
1
0
01 May 2024
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Zhihao Du
Jiaming Wang
Qian Chen
Yunfei Chu
Zhifu Gao
...
Wen Wang
Siqi Zheng
Chang Zhou
Zhijie Yan
Shiliang Zhang
LLMAG
VLM
AuLLM
LM&MA
23
79
0
07 Oct 2023
Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition
Dongyuan Li
Yusong Wang
Kotaro Funakoshi
Manabu Okumura
20
3
0
30 Sep 2023
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Ziyang Ma
Wen Wu
Zhisheng Zheng
Yiwei Guo
Qian Chen
Shiliang Zhang
Xie Chen
8
14
0
19 Sep 2023
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie M. Zhang
17
4
0
28 Sep 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
LSSED: a large-scale dataset and benchmark for speech emotion recognition
Weiquan Fan
Xiangmin Xu
Xiaofen Xing
Weidong Chen
Dongyan Huang
45
32
0
30 Jan 2021
Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation
Alexander R. Fabbri
Simeng Han
Haoyuan Li
Haoran Li
Marjan Ghazvininejad
Shafiq R. Joty
Dragomir R. Radev
Yashar Mehdad
116
93
0
24 Oct 2020
1