Enriching Music Descriptions with a Finetuned-LLM and Metadata for
Text-to-Music RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Multimodal Modeling For Spoken Language IdentificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Shikhar Bharadwaj Min Ma Shikhar Vashishth Ankur Bapna Sriram Ganapathy ...Yu Zhang D. Esch Sandy Ritchie Partha P. Talukdar Jason Riesa |