
Title |
|---|
![]() Audio-Language Datasets of Scenes and Events: A SurveyIEEE Access (IEEE Access), 2024 |
![]() Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio ReasoningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataInternational Conference on Learning Representations (ICLR), 2024 |