Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.14638
Cited By
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing
27 February 2023
Weidong Chen
Xiaofen Xing
Xiangmin Xu
Jianxin Pang
Lan Du
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing"
13 / 13 papers shown
Title
Enhancing Depression Detection via Question-wise Modality Fusion
Aishik Mandal
Dana Atzil-Slonim
Thamar Solorio
Iryna Gurevych
64
0
0
26 Mar 2025
Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
Heqing Zou
Fengmao Lv
Desheng Zheng
E. Chng
D. Rajan
34
0
0
25 Mar 2025
Temporal-Frequency State Space Duality: An Efficient Paradigm for Speech Emotion Recognition
Jiaqi Zhao
Fei Wang
Kun Li
Yanyan Wei
Shengeng Tang
Shu Zhao
Xiao Sun
Mamba
104
2
0
22 Dec 2024
An Empirical Analysis of Speech Self-Supervised Learning at Multiple Resolutions
Theo Clark
Benedetta Cevoli
Eloy de Jong
Timofey Abramski
Jamie Dougherty
SSL
36
0
0
31 Oct 2024
Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection
Yilin Pan
Yanpei Shi
Yijia Zhang
Mingyu Lu
26
0
0
09 Oct 2024
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech
Zhongren Dong
Zixing Zhang
Weixiang Xu
Jing Han
Jianjun Ou
Björn W. Schuller
35
1
0
07 May 2024
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition
Yong Wang
Cheng Lu
Hailun Lian
Yan Zhao
Bjorn Schuller
Yuan Zong
Wenming Zheng
21
10
0
19 Jan 2024
A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion
Zeinab Taghavi
Ali Satvaty
Hossein Sameti
17
4
0
21 Jul 2023
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
23
35
0
20 Jul 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
35
46
0
21 Mar 2023
Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Leyuan Qu
Taiha Li
C. Weber
Theresa Pekarek-Rosin
F. Ren
S. Wermter
16
8
0
14 Dec 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
114
264
0
02 Feb 2022
Learning Fine-Grained Cross Modality Excitement for Speech Emotion Recognition
Hang Li
Wenbiao Ding
Zhongqin Wu
Zitao Liu
30
32
0
24 Oct 2020
1