Title |
---|
![]() CLAIR-A: Leveraging Large Language Models to Judge Audio Captions Tsung-Han Wu Joseph E. Gonzalez Trevor Darrell David M. Chan |
![]() Qwen2-Audio Technical Report Yunfei Chu Jin Xu Qian Yang Haojie Wei Xipin Wei ...Yuanjun Lv Jinzheng He Junyang Lin Chang Zhou Jingren Zhou |
![]() FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion Zehan Wang Ziang Zhang Xize Cheng Rongjie Huang Luping Liu ...Haifeng Huang Yang Zhao Tao Jin Peng Gao Zhou Zhao |
![]() AIR-Bench: Benchmarking Large Audio-Language Models via Generative
Comprehension Qian Yang Jin Xu Wenrui Liu Yunfei Chu Ziyue Jiang ...Yichong Leng Yuanjun Lv Zhou Zhao Chang Zhou Jingren Zhou |