Audio-Language Datasets of Scenes and Events: A SurveyIEEE Access (IEEE Access), 2024 |
Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio ReasoningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio
ClassificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataInternational Conference on Learning Representations (ICLR), 2024 |