ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.17040
  4. Cited By
Multimodal Alignment and Fusion: A Survey
v1v2 (latest)

Multimodal Alignment and Fusion: A Survey

26 November 2024
Songtao Li
Hao Tang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Multimodal Alignment and Fusion: A Survey"

21 / 21 papers shown
Title
PEAR: Planner-Executor Agent Robustness Benchmark
PEAR: Planner-Executor Agent Robustness Benchmark
Shen Dong
Mingxuan Zhang
Pengfei He
Li Ma
Bhavani Thuraisingham
Hui Liu
Yue Xing
LLMAG
223
0
0
08 Oct 2025
InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions
InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions
Junde Xu
Yapin Shi
Lijun Lang
Taoyong Cui
Z. Zhang
Guangyong Chen
Jiezhong Qiu
Pheng-Ann Heng
151
0
0
03 Oct 2025
Audio-Visual Separation with Hierarchical Fusion and Representation Alignment
Audio-Visual Separation with Hierarchical Fusion and Representation Alignment
Han Hu
Dongheng Lin
Qiming Huang
Yuqi Hou
Hyung Jin Chang
Jianbo Jiao
100
0
0
24 Sep 2025
Lost in Embeddings: Information Loss in Vision-Language Models
Lost in Embeddings: Information Loss in Vision-Language Models
Wenyan Li
Raphael Tang
Chengzu Li
Caiqi Zhang
Ivan Vulić
Anders Søgaard
VLM
119
5
0
15 Sep 2025
MVRS: The Multimodal Virtual Reality Stimuli-based Emotion Recognition Dataset
MVRS: The Multimodal Virtual Reality Stimuli-based Emotion Recognition Dataset
Seyed Muhammad Hossein Mousavi
Atiye Ilanloo
88
0
0
31 Aug 2025
Integrated Multivariate Segmentation Tree for the Analysis of Heterogeneous Credit Data in Small and Medium-Sized Enterprises
Integrated Multivariate Segmentation Tree for the Analysis of Heterogeneous Credit Data in Small and Medium-Sized Enterprises
Lu Han
Xiuying Wang
56
0
0
30 Aug 2025
Multimodal Data Storage and Retrieval for Embodied AI: A Survey
Multimodal Data Storage and Retrieval for Embodied AI: A Survey
Yihao Lu
Hao Tang
100
2
0
19 Aug 2025
MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention
MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention
Xin Du
Maoyuan Xu
Zhi Ying
100
0
0
13 Aug 2025
From Detection to Correction: Backdoor-Resilient Face Recognition via Vision-Language Trigger Detection and Noise-Based Neutralization
From Detection to Correction: Backdoor-Resilient Face Recognition via Vision-Language Trigger Detection and Noise-Based Neutralization
Farah Wahida
M. Chamikara
Yashothara Shanmugarasa
Mohan Baruwal Chhetri
Thilina Ranbaduge
Ibrahim Khalil
AAML
108
0
0
07 Aug 2025
Training-Free Multimodal Large Language Model Orchestration
Training-Free Multimodal Large Language Model Orchestration
Tianyu Xie
Yuhang Wu
Yongdong Luo
Jinfa Huang
Xiawu Zheng
124
0
0
06 Aug 2025
Explainability Through Systematicity: The Hard Systematicity Challenge for Artificial Intelligence
Explainability Through Systematicity: The Hard Systematicity Challenge for Artificial Intelligence
Matthieu Queloz
130
2
0
29 Jul 2025
M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models
M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models
Can Zheng
Jiguang He
Chung G. Kang
Guofa Cai
Zitong Yu
Merouane Debbah
MoE
140
4
0
17 Jun 2025
Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques
Jisu An
Junseok Lee
Jeoungeun Lee
Yongseok Son
412
2
0
05 Jun 2025
CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding
CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding
Wenxuan Ma
Xiaoge Cao
Yujiao Shi
Chaofan Zhang
Shaobo Yang
Peng Hao
Bin Fang
Yinghao Cai
Shaowei Cui
Shuo Wang
263
2
0
13 May 2025
Semantic-Space-Intervened Diffusive Alignment for Visual Classification
Semantic-Space-Intervened Diffusive Alignment for Visual ClassificationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Zixuan Li
Lei Meng
Guoqing Chao
Wei Wu
Xiaoshuo Yan
Yimeng Yang
Zhuang Qi
Xiangxu Meng
DiffM
330
0
0
09 May 2025
A Review of 3D Object Detection with Vision-Language Models
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I. Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
341
5
0
25 Apr 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Philip S. Yu
Xuming Hu
Qingsong Wen
696
21
0
23 Mar 2025
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Lang Huang
Qiyu Wu
Zhongtao Miao
T. Yamasaki
935
6
0
27 Feb 2025
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer InterfacesInformation Fusion (Inf. Fusion), 2025
Jing Wang
Weishan Ye
Jialin He
Li Zhang
G. Huang
Zhuliang Yu
Zhen Liang
280
2
0
26 Feb 2025
Large Language Models for Multi-Robot Systems: A Survey
Large Language Models for Multi-Robot Systems: A Survey
Peihan Li
Zijian An
Shams Abrar
Lifeng Zhou
LRMLM&Ro
493
27
0
06 Feb 2025
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Daniel Schwalbe-Koda
B. Selman
Qingsong Wen
LRM
506
26
0
05 Feb 2025
1