ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.15071
  4. Cited By
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on
  Generalizability, Trustworthiness and Causality through Four Modalities
v1v2 (latest)

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

26 January 2024
Chaochao Lu
Chao Qian
Guodong Zheng
Hongxing Fan
Hongzhi Gao
Jie Zhang
Jing Shao
Jingyi Deng
Jinlan Fu
Kexin Huang
Kunchang Li
Lijun Li
Limin Wang
Lu Sheng
Meiqiu Chen
Ming-bo Wen
Qibing Ren
SI-YIN Chen
Tao Gui
Wanli Ouyang
Yali Wang
Yan Teng
Yaru Wang
Yi Wang
Yinan He
Yingchun Wang
Yixu Wang
Yongting Zhang
Yu Qiao
Yujiong Shen
Yurong Mou
Yuxi Chen
Zaibin Zhang
Zhelun Shi
Zhen-fei Yin
Zhipin Wang
ArXiv (abs)PDFHTMLHuggingFace (38 upvotes)Github

Papers citing "From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities"

14 / 14 papers shown
Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
Yuan Xiong
Ziqi Miao
Lijun Li
Chen Qian
Jie Li
Jing Shao
AAML
325
0
0
02 Dec 2025
Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models
Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models
Ziqi Miao
Lijun Li
Yuan Xiong
Zhenhua Liu
Pengyu Zhu
Jing Shao
AAML
235
6
0
07 Jul 2025
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
Ziqi Miao
Yi Ding
Lijun Li
Jing Shao
AAML
328
18
0
03 Jul 2025
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
Libo Wang
LRM
1.2K
5
0
07 Feb 2025
Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval
Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval
Libo Wang
1.3K
0
0
24 Jan 2025
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Junxiao Xue
Quan Deng
Fei Yu
Yanhao Wang
Jun Wang
Yongqian Li
VLM
326
14
0
31 Dec 2024
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey
Atsuyuki Miyai
Jingkang Yang
Jingyang Zhang
Yifei Ming
Sisir Dhakal
...
Yixuan Li
Hai "Helen" Li
Ziwei Liu
Toshihiko Yamasaki
Kiyoharu Aizawa
521
39
0
31 Jul 2024
Principled Understanding of Generalization for Generative Transformer Models in Arithmetic Reasoning Tasks
Principled Understanding of Generalization for Generative Transformer Models in Arithmetic Reasoning Tasks
Xingcheng Xu
Zibo Zhao
Haipeng Zhang
Yanqing Yang
LRM
320
0
0
25 Jul 2024
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision
  Language Models
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Peng Xia
Ze Chen
Juanxi Tian
Yangrui Gong
Ruibo Hou
...
Jimeng Sun
Zongyuan Ge
Gang Li
James Zou
Huaxiu Yao
MUVLM
320
74
0
10 Jun 2024
A Misleading Gallery of Fluid Motion by Generative Artificial
  Intelligence
A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence
Ali Kashefi
VGen
386
9
0
24 May 2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via
  Reinforcement Learning
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Yuexiang Zhai
Hao Bai
Zipeng Lin
Jiayi Pan
Shengbang Tong
...
Alane Suhr
Saining Xie
Yann LeCun
Yi-An Ma
Sergey Levine
LLMAGLRM
435
169
0
16 May 2024
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language
  Models: A Causal Perspective
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective
Meiqi Chen
Yixin Cao
Yan Zhang
Chaochao Lu
602
41
0
27 Mar 2024
Assessment of Multimodal Large Language Models in Alignment with Human
  Values
Assessment of Multimodal Large Language Models in Alignment with Human Values
Zhelun Shi
Zhipin Wang
Hongxing Fan
Zaibin Zhang
Lijun Li
Yongting Zhang
Zhen-fei Yin
Lu Sheng
Yu Qiao
Jing Shao
280
37
0
26 Mar 2024
Review of Generative AI Methods in Cybersecurity
Review of Generative AI Methods in Cybersecurity
Yagmur Yigit
William J. Buchanan
Madjid G Tehrani
Leandros A. Maglaras
AAML
518
46
0
13 Mar 2024
1
Page 1 of 1