ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.10757
  4. Cited By
Vesper: A Compact and Effective Pretrained Model for Speech Emotion
  Recognition

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

20 July 2023
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
    VLM
ArXivPDFHTML

Papers citing "Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition"

16 / 16 papers shown
Title
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
Heqing Zou
Fengmao Lv
Desheng Zheng
E. Chng
D. Rajan
26
0
0
25 Mar 2025
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire
Kunal Shah
Mudasir Nazir Khan
Nikhil Pakhale
L. Sookha
M. A. Ganaie
Abhinav Dhall
63
0
0
16 Mar 2025
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Yuzhe Weng
Haotian Wang
Tian Gao
Kewei Li
Shutong Niu
Jun Du
28
0
0
19 Oct 2024
End-to-End Integration of Speech Emotion Recognition with Voice Activity
  Detection using Self-Supervised Learning Features
End-to-End Integration of Speech Emotion Recognition with Voice Activity Detection using Self-Supervised Learning Features
Natsuo Yamashita
Masaaki Yamamoto
Y. Kawaguchi
21
0
0
17 Oct 2024
Stimulus Modality Matters: Impact of Perceptual Evaluations from
  Different Modalities on Speech Emotion Recognition System Performance
Stimulus Modality Matters: Impact of Perceptual Evaluations from Different Modalities on Speech Emotion Recognition System Performance
Huang-Cheng Chou
Haibin Wu
Chi-Chun Lee
24
0
0
16 Sep 2024
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
Ruchik Mishra
Andrew Frye
M. M. Rayguru
Dan O. Popa
28
1
0
16 Sep 2024
Emotion-Aware Speech Self-Supervised Representation Learning with
  Intensity Knowledge
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge
Rui Liu
Zening Ma
SSL
29
1
0
10 Jun 2024
Adapting WavLM for Speech Emotion Recognition
Adapting WavLM for Speech Emotion Recognition
Daria Diatlova
Anton Udalov
Vitalii Shutov
Egor Spirin
22
2
0
07 May 2024
Active Learning with Task Adaptation Pre-training for Speech Emotion
  Recognition
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
Dongyuan Li
Ying Zhang
Yusong Wang
Funakoshi Kataro
Manabu Okumura
19
1
0
01 May 2024
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
Zhihao Du
Jiaming Wang
Qian Chen
Yunfei Chu
Zhifu Gao
...
Wen Wang
Siqi Zheng
Chang Zhou
Zhijie Yan
Shiliang Zhang
LLMAG
VLM
AuLLM
LM&MA
23
79
0
07 Oct 2023
Active Learning Based Fine-Tuning Framework for Speech Emotion
  Recognition
Active Learning Based Fine-Tuning Framework for Speech Emotion Recognition
Dongyuan Li
Yusong Wang
Kotaro Funakoshi
Manabu Okumura
20
3
0
30 Sep 2023
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion
  Recognition
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Ziyang Ma
Wen Wu
Zhisheng Zheng
Yiwei Guo
Qian Chen
Shiliang Zhang
Xie Chen
8
14
0
19 Sep 2023
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector
  Quantization
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie M. Zhang
17
4
0
28 Sep 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
LSSED: a large-scale dataset and benchmark for speech emotion
  recognition
LSSED: a large-scale dataset and benchmark for speech emotion recognition
Weiquan Fan
Xiangmin Xu
Xiaofen Xing
Weidong Chen
Dongyan Huang
45
32
0
30 Jan 2021
Improving Zero and Few-Shot Abstractive Summarization with Intermediate
  Fine-tuning and Data Augmentation
Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation
Alexander R. Fabbri
Simeng Han
Haoyuan Li
Haoran Li
Marjan Ghazvininejad
Shafiq R. Joty
Dragomir R. Radev
Yashar Mehdad
116
93
0
24 Oct 2020
1