MQAD: A Large-Scale Question Answering Dataset for Training Music Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 |
HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMsEuropean Workshop on Visual Information Processing (EUVIP), 2025 |
Describe What You See with Multimodal Large Language Models to Enhance Video RecommendationsACM Conference on Recommender Systems (RecSys), 2025 |