ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.02871
  4. Cited By
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

5 February 2025
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Daniel Schwalbe-Koda
B. Selman
Qingsong Wen
    LRM
ArXiv (abs)PDFHTML

Papers citing "Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning"

50 / 168 papers shown
Parameter Importance-Driven Continual Learning for Foundation Models
Parameter Importance-Driven Continual Learning for Foundation Models
LingXiang Wang
Hainan Zhang
Zhiming Zheng
KELMCLL
483
0
0
19 Nov 2025
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo
Utkarsh Tyagi
Advait Gosai
Paula Vergara
Ernesto Gabriel Hernández Montoya
...
Bin Hu
Yunzhong He
Bing Liu
Bing Liu
Rakshith S Srinivasa
VLMLRM
325
3
0
14 Oct 2025
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan
Guangwei Xu
Xin Zou
Shuliang Liu
James Kwok
Xuming Hu
190
5
0
28 Sep 2025
AIssistant: An Agentic Approach for Human--AI Collaborative Scientific Work on Reviews and Perspectives in Machine Learning
AIssistant: An Agentic Approach for Human--AI Collaborative Scientific Work on Reviews and Perspectives in Machine Learning
Sasi Kiran Gaddipati
Farhana Keya
Gollam Rabby
Sören Auer
109
0
0
14 Sep 2025
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Jindong Li
Yali Fu
Li Fan
Jiahong Liu
Yao Shu
Chengwei Qin
Menglin Yang
Irwin King
Rex Ying
OffRLLRMAI4CE
218
14
0
02 Sep 2025
Robust Diagram Reasoning: A Framework for Enhancing LVLM Performance on Visually Perturbed Scientific Diagrams
Robust Diagram Reasoning: A Framework for Enhancing LVLM Performance on Visually Perturbed Scientific Diagrams
Minghao Zhou
Rafael Souza
Yaqian Hu
Luming Che
102
0
0
23 Aug 2025
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
Jianghangfan Zhang
Yibo Yan
Kening Zheng
Xin Zou
Song Dai
Xuming Hu
LRM
274
4
0
06 Aug 2025
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou
Yin Xu
Yingtao Zhu
Chuxi Xiao
Haiyang Bian
Lei Wei
Xuegong Zhang
LM&MAVLM
212
5
0
30 May 2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Junyan Zhang
Yubo Gao
Yibo Yan
Jia-Chen Gu
Zhaorui Hou
...
Qi Zheng
Song Dai
Yonghua Hei
Junzhuo Li
Xuming Hu
212
3
0
27 May 2025
NeSyGeo: A Neuro-Symbolic Framework for Multimodal Geometric Reasoning Data Generation
NeSyGeo: A Neuro-Symbolic Framework for Multimodal Geometric Reasoning Data Generation
Weiming Wu
Zi-kang Wang
Jin Ye
Zhi Zhou
Yu-Feng Li
Lan-Zhe Guo
LRM
288
0
0
21 May 2025
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
Song Dai
Yibo Yan
Jiamin Su
Dongfang Zihao
Yubo Gao
...
Jia-Chen Gu
Junyan Zhang
Sicheng Tao
Zhuoran Gao
Xuming Hu
LRMAI4CE
286
4
0
21 May 2025
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Haoming Huang
Yibo Yan
Jiahao Huo
Xin Zou
Xinfeng Li
Kun Wang
Xuming Hu
582
1
0
20 May 2025
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
Jiamin Su
Yibo Yan
Zhuoran Gao
Han Zhang
Xiang Liu
Xuming Hu
327
4
0
20 May 2025
Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Yutong Xia
Ao Qu
Yunhan Zheng
Yihong Tang
Dingyi Zhuang
...
Cathy Wu
Roger Zimmermann
Lijun Sun
Roger Zimmermann
Jinhua Zhao
AI4CE
953
2
0
15 Apr 2025
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos
Yunming Liang
Zihao Chen
Chaofan Ding
Xinhan Di
DiffMVGen
373
3
0
28 Mar 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Philip S. Yu
Xuming Hu
Qingsong Wen
713
23
0
23 Mar 2025
Rolling Forward: Enhancing LightGCN with Causal Graph Convolution for Credit Bond Recommendation
Rolling Forward: Enhancing LightGCN with Causal Graph Convolution for Credit Bond RecommendationInternational Conference on AI in Finance (ICAF), 2024
Ashraf Ghiye
Baptiste Barreau
Laurent Carlier
Michalis Vazirgiannis
260
9
0
18 Mar 2025
EscapeCraft: A 3D Room Escape Environment for Benchmarking Complex Multimodal Reasoning Ability
EscapeCraft: A 3D Room Escape Environment for Benchmarking Complex Multimodal Reasoning Ability
Xiping Hu
Yurui Dong
Ziyue Wang
Minyuan Ruan
Zhili Cheng
Chong Chen
Ziwei Sun
Yang Liu
LRM
626
1
0
13 Mar 2025
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Jingheng Ye
Shang Qin
Hai-Tao Zheng
Hai-Tao Zheng
Shen Wang
Qingsong Wen
270
1
0
24 Feb 2025
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine UnlearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junkai Chen
Zhijie Deng
Kening Zheng
Yibo Yan
Qi Zheng
PeiJun Wu
Peijie Jiang
Qingbin Liu
Xuming Hu
MU
538
19
0
18 Feb 2025
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models
Kristian González Barman
S. Caron
Emily Sullivan
H. Regt
R. D. Austri
...
Sydney Otten
Pawel Pawlowski
Pietro Vischia
Erik Weber
Christoph Weniger
AI4CE
212
11
0
10 Jan 2025
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Yunzhuo Hao
Jiawei Gu
Huichen Will Wang
Linjie Li
Zhiyong Yang
Lijuan Wang
Yu Cheng
LRM
350
91
0
10 Jan 2025
Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches
A. Mumuni
F. Mumuni
AI4CELRMELM
237
14
0
06 Jan 2025
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mingyang Song
Zhaochen Su
Xiaoye Qu
Jiawei Zhou
Yu Cheng
LRM
655
64
0
06 Jan 2025
ChemDFM-X: Towards Large Multimodal Model for Chemistry
ChemDFM-X: Towards Large Multimodal Model for ChemistryScience China Information Sciences (Sci. China Inf. Sci.), 2024
Zihan Zhao
B. Chen
Jingpiao Li
Lu Chen
Liyang Wen
...
Ziping Wan
Yansi Li
Zhongyang Dai
Xin Chen
Kai Yu
AI4CE
496
22
0
03 Jan 2025
Survey of Large Multimodal Model Datasets, Application Categories and
  Taxonomy
Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy
Priyaranjan Pattnayak
Hitesh Laxmichand Patel
Bhargava Kumar
Amit Agarwal
Ishan Banerjee
Srikant Panda
Tejaswini Kumar
VLM
170
14
0
23 Dec 2024
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in
  LLM-Powered Error Detector for Math Word Problem Solutions
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem SolutionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Hang Li
Tianlong Xu
Kaiqi Yang
Yucheng Chu
Yanling Chen
Yichi Song
Qingsong Wen
Hui Liu
279
5
0
22 Dec 2024
Towards Scientific Discovery with Generative AI: Progress,
  Opportunities, and Challenges
Towards Scientific Discovery with Generative AI: Progress, Opportunities, and ChallengesAAAI Conference on Artificial Intelligence (AAAI), 2024
Chandan K. Reddy
Parshin Shojaee
327
27
0
16 Dec 2024
ProcessBench: Identifying Process Errors in Mathematical Reasoning
ProcessBench: Identifying Process Errors in Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chujie Zheng
Zizhuo Zhang
Beichen Zhang
Runji Lin
Keming Lu
Bowen Yu
Dayiheng Liu
Jingren Zhou
Junyang Lin
LRM
664
159
0
09 Dec 2024
Explainable and Interpretable Multimodal Large Language Models: A
  Comprehensive Survey
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
426
51
0
03 Dec 2024
Improving Physics Reasoning in Large Language Models Using Mixture of
  Refinement Agents
Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents
Raj Jaiswal
Dhruv Jain
Harsh Parimal Popat
Avinash Anand
Abhishek Dharmadhikari
Atharva Marathe
R. Shah
LRMAI4CE
289
11
0
01 Dec 2024
Multimodal Alignment and Fusion: A Survey
Multimodal Alignment and Fusion: A Survey
Songtao Li
Hao Tang
OffRL
227
37
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language
  Model
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yujiao Shi
Min Li
362
3
0
23 Nov 2024
Evaluating the Robustness of Analogical Reasoning in Large Language
  Models
Evaluating the Robustness of Analogical Reasoning in Large Language Models
Martha Lewis
Melanie Mitchell
ELM
237
23
0
21 Nov 2024
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Yunkai Dang
Mengxi Gao
Yibo Yan
Xin Zou
Yanggan Gu
...
Jingyu Wang
Peijie Jiang
Aiwei Liu
Jia Liu
Xuming Hu
351
11
0
05 Nov 2024
Improving Scientific Hypothesis Generation with Knowledge Grounded Large
  Language Models
Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models
Guangzhi Xiong
Eric Xie
Amir Hassan Shariatmadari
Sikun Guo
Stefan Bekiranov
Aidong Zhang
LRMHILM
202
20
0
04 Nov 2024
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesInternational Conference on Learning Representations (ICLR), 2024
Zonglin Yang
Wanhao Liu
Ben Gao
Tong Xie
You Li
Wanli Ouyang
Soujanya Poria
Xiaoshi Zhong
Dongzhan Zhou
LRM
565
45
0
09 Oct 2024
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large
  Language Models via Deciphering Attention Causality
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityInternational Conference on Learning Representations (ICLR), 2024
Guanyu Zhou
Yibo Yan
Xin Zou
Kun Wang
Aiwei Liu
Xuming Hu
230
22
0
07 Oct 2024
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Iman Mirzadeh
Keivan Alizadeh
Hooman Shahrokhi
Oncel Tuzel
Samy Bengio
Mehrdad Farajtabar
AIMatLRM
496
415
0
07 Oct 2024
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Xin Zou
Yizhou Wang
Yibo Yan
Yuanhuiyi Lyu
Kening Zheng
...
Junkai Chen
Peijie Jiang
Qingbin Liu
Chang Tang
Xuming Hu
461
28
0
04 Oct 2024
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level
  Mathematical Reasoning
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Di Zhang
Jianbo Wu
Jingdi Lei
Tong Che
Jiatong Li
...
Shufei Zhang
Marco Pavone
Yuqiang Li
Wanli Ouyang
Dongzhan Zhou
LRM
260
89
0
03 Oct 2024
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
Lin Li
Guikun Chen
Hanrong Shi
Jun Xiao
Long Chen
346
23
0
21 Sep 2024
Qwen2.5-Coder Technical Report
Qwen2.5-Coder Technical Report
Binyuan Hui
Jian Yang
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
...
Fei Huang
Xingzhang Ren
Xuancheng Ren
Jingren Zhou
Junyang Lin
OSLM
335
828
0
18 Sep 2024
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via
  Self-Improvement
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
An Yang
Beichen Zhang
Binyuan Hui
Bofei Gao
Bowen Yu
...
Mingfeng Xue
Runji Lin
Tianyu Liu
Xingzhang Ren
Zhenru Zhang
OSLMLRM
463
689
0
18 Sep 2024
AI-Driven Virtual Teacher for Enhanced Educational Efficiency:
  Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and CorrectionAAAI Conference on Artificial Intelligence (AAAI), 2024
Tianlong Xu
Yi-Fan Zhang
Zhendong Chu
Shen Wang
Qingsong Wen
170
15
0
14 Sep 2024
Language agents achieve superhuman synthesis of scientific knowledge
Language agents achieve superhuman synthesis of scientific knowledge
Michael D. Skarlinski
Sam Cox
Jon M. Laurent
James D. Braza
Michaela M. Hinks
M. Hammerling
Manvitha Ponnapati
Samuel G. Rodriques
Andrew D. White
ELMHILMALM
443
89
0
10 Sep 2024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large
  Language Model
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Zhen Yang
Jinhao Chen
Zhengxiao Du
Wenmeng Yu
Weihan Wang
Wenyi Hong
Zhihuan Jiang
Bin Xu
Yuxiao Dong
Jie Tang
VLMLRM
189
15
0
10 Sep 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual CommunicationInternational Joint Conference on Natural Language Processing (IJCNLP), 2024
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
259
27
0
28 Aug 2024
GeoReasoner: Reasoning On Geospatially Grounded Context For Natural
  Language Understanding
GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language UnderstandingInternational Conference on Information and Knowledge Management (CIKM), 2024
Yibo Yan
Joey Lee
LRM
263
11
0
21 Aug 2024
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Yifei Yang
Runhan Shi
Zuchao Li
Shu Jiang
Bao-Liang Lu
Yang Yang
Hai Zhao
250
8
0
19 Aug 2024
1234
Next