Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision TransformersIEEE Robotics and Automation Letters (RA-L), 2024 |
Cascaded Cross-Modal Transformer for Audio-Textual ClassificationArtificial Intelligence Review (Artif Intell Rev), 2024 |
Cascaded Cross-Modal Transformer for Request and Complaint DetectionACM Multimedia (ACM MM), 2023 |
SemanticAC: Semantics-Assisted Framework for Audio ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
Learning Rate CurriculumInternational Journal of Computer Vision (IJCV), 2022 |