CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux ModellingInternational Conference on Learning Representations (ICLR), 2024 |
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024 |
Networking Systems for Video Anomaly Detection: A Tutorial and SurveyACM Computing Surveys (ACM CSUR), 2024 |
A Survey on Quality Metrics for Text-to-Image GenerationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024 |
Materials science in the era of large language models: a perspectiveDigital Discovery (DD), 2024 |
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other
ModalitiesComputer Vision and Pattern Recognition (CVPR), 2024 |
Cascaded Cross-Modal Transformer for Audio-Textual ClassificationArtificial Intelligence Review (Artif Intell Rev), 2024 |