ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.02871
  4. Cited By
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

5 February 2025
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Daniel Schwalbe-Koda
B. Selman
Qingsong Wen
    LRM
ArXiv (abs)PDFHTML

Papers citing "Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning"

50 / 168 papers shown
Title
Parameter Importance-Driven Continual Learning for Foundation Models
Parameter Importance-Driven Continual Learning for Foundation Models
LingXiang Wang
Hainan Zhang
Zhiming Zheng
KELMCLL
345
0
0
19 Nov 2025
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo
Utkarsh Tyagi
Advait Gosai
Paula Vergara
Ernesto Gabriel Hernández Montoya
...
Bin Hu
Yunzhong He
Bing Liu
Bing Liu
Rakshith S Srinivasa
VLMLRM
248
1
0
14 Oct 2025
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning
Yibo Yan
Guangwei Xu
Xin Zou
Shuliang Liu
James Kwok
Xuming Hu
132
4
0
28 Sep 2025
AIssistant: An Agentic Approach for Human--AI Collaborative Scientific Work on Reviews and Perspectives in Machine Learning
AIssistant: An Agentic Approach for Human--AI Collaborative Scientific Work on Reviews and Perspectives in Machine Learning
Sasi Kiran Gaddipati
Farhana Keya
Gollam Rabby
Sören Auer
72
0
0
14 Sep 2025
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Jindong Li
Yali Fu
Li Fan
Jiahong Liu
Yao Shu
Chengwei Qin
Menglin Yang
Irwin King
Rex Ying
OffRLLRMAI4CE
147
10
0
02 Sep 2025
Robust Diagram Reasoning: A Framework for Enhancing LVLM Performance on Visually Perturbed Scientific Diagrams
Robust Diagram Reasoning: A Framework for Enhancing LVLM Performance on Visually Perturbed Scientific Diagrams
Minghao Zhou
Rafael Souza
Yaqian Hu
Luming Che
56
0
0
23 Aug 2025
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning
Jianghangfan Zhang
Yibo Yan
Kening Zheng
Xin Zou
Song Dai
Xuming Hu
LRM
180
3
0
06 Aug 2025
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou
Yin Xu
Yingtao Zhu
Chuxi Xiao
Haiyang Bian
Lei Wei
Xuegong Zhang
LM&MAVLM
179
3
0
30 May 2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Junyan Zhang
Yubo Gao
Yibo Yan
Jia-Chen Gu
Zhaorui Hou
...
Qi Zheng
Song Dai
Yonghua Hei
Junzhuo Li
Xuming Hu
183
1
0
27 May 2025
NeSyGeo: A Neuro-Symbolic Framework for Multimodal Geometric Reasoning Data Generation
NeSyGeo: A Neuro-Symbolic Framework for Multimodal Geometric Reasoning Data Generation
Weiming Wu
Zi-kang Wang
Jin Ye
Zhi Zhou
Yu-Feng Li
Lan-Zhe Guo
LRM
212
0
0
21 May 2025
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
Song Dai
Yibo Yan
Jiamin Su
Dongfang Zihao
Yubo Gao
...
Jia-Chen Gu
Junyan Zhang
Sicheng Tao
Zhuoran Gao
Xuming Hu
LRMAI4CE
246
4
0
21 May 2025
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
Haoming Huang
Yibo Yan
Jiahao Huo
Xin Zou
Xinfeng Li
Kun Wang
Xuming Hu
421
1
0
20 May 2025
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
Jiamin Su
Yibo Yan
Zhuoran Gao
Han Zhang
Xiang Liu
Xuming Hu
254
2
0
20 May 2025
Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Reimagining Urban Science: Scaling Causal Inference with Large Language Models
Yutong Xia
Ao Qu
Yunhan Zheng
Yihong Tang
Dingyi Zhuang
...
Cathy Wu
Roger Zimmermann
Lijun Sun
Roger Zimmermann
Jinhua Zhao
AI4CE
814
2
0
15 Apr 2025
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos
Yunming Liang
Zihao Chen
Chaofan Ding
Xinhan Di
DiffMVGen
283
3
0
28 Mar 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Philip S. Yu
Xuming Hu
Qingsong Wen
620
20
0
23 Mar 2025
Rolling Forward: Enhancing LightGCN with Causal Graph Convolution for Credit Bond Recommendation
Rolling Forward: Enhancing LightGCN with Causal Graph Convolution for Credit Bond RecommendationInternational Conference on AI in Finance (ICAF), 2024
Ashraf Ghiye
Baptiste Barreau
Laurent Carlier
Michalis Vazirgiannis
215
9
0
18 Mar 2025
EscapeCraft: A 3D Room Escape Environment for Benchmarking Complex Multimodal Reasoning Ability
EscapeCraft: A 3D Room Escape Environment for Benchmarking Complex Multimodal Reasoning Ability
Xiping Hu
Yurui Dong
Ziyue Wang
Minyuan Ruan
Zhili Cheng
Chong Chen
Ziwei Sun
Yang Liu
LRM
510
1
0
13 Mar 2025
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Jingheng Ye
Shang Qin
Hai-Tao Zheng
Hai-Tao Zheng
Shen Wang
Qingsong Wen
218
1
0
24 Feb 2025
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine UnlearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junkai Chen
Zhijie Deng
Kening Zheng
Yibo Yan
Qi Zheng
PeiJun Wu
Peijie Jiang
Qingbin Liu
Xuming Hu
MU
398
16
0
18 Feb 2025
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models
Kristian González Barman
S. Caron
Emily Sullivan
H. Regt
R. D. Austri
...
Sydney Otten
Pawel Pawlowski
Pietro Vischia
Erik Weber
Christoph Weniger
AI4CE
138
11
0
10 Jan 2025
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Yunzhuo Hao
Jiawei Gu
Huichen Will Wang
Linjie Li
Zhiyong Yang
Lijuan Wang
Yu Cheng
LRM
286
83
0
10 Jan 2025
Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches
A. Mumuni
F. Mumuni
AI4CELRMELM
201
14
0
06 Jan 2025
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mingyang Song
Zhaochen Su
Xiaoye Qu
Jiawei Zhou
Yu Cheng
LRM
530
63
0
06 Jan 2025
ChemDFM-X: Towards Large Multimodal Model for Chemistry
ChemDFM-X: Towards Large Multimodal Model for ChemistryScience China Information Sciences (Sci. China Inf. Sci.), 2024
Zihan Zhao
B. Chen
Jingpiao Li
Lu Chen
Liyang Wen
...
Ziping Wan
Yansi Li
Zhongyang Dai
Xin Chen
Kai Yu
AI4CE
422
21
0
03 Jan 2025
Survey of Large Multimodal Model Datasets, Application Categories and
  Taxonomy
Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy
Priyaranjan Pattnayak
Hitesh Laxmichand Patel
Bhargava Kumar
Amit Agarwal
Ishan Banerjee
Srikant Panda
Tejaswini Kumar
VLM
145
13
0
23 Dec 2024
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in
  LLM-Powered Error Detector for Math Word Problem Solutions
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem SolutionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Hang Li
Tianlong Xu
Kaiqi Yang
Yucheng Chu
Yanling Chen
Yichi Song
Qingsong Wen
Hui Liu
237
4
0
22 Dec 2024
Towards Scientific Discovery with Generative AI: Progress,
  Opportunities, and Challenges
Towards Scientific Discovery with Generative AI: Progress, Opportunities, and ChallengesAAAI Conference on Artificial Intelligence (AAAI), 2024
Chandan K. Reddy
Parshin Shojaee
287
24
0
16 Dec 2024
ProcessBench: Identifying Process Errors in Mathematical Reasoning
ProcessBench: Identifying Process Errors in Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chujie Zheng
Zizhuo Zhang
Beichen Zhang
Runji Lin
Keming Lu
Bowen Yu
Dayiheng Liu
Jingren Zhou
Junyang Lin
LRM
532
150
0
09 Dec 2024
Explainable and Interpretable Multimodal Large Language Models: A
  Comprehensive Survey
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
365
47
0
03 Dec 2024
Improving Physics Reasoning in Large Language Models Using Mixture of
  Refinement Agents
Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents
Raj Jaiswal
Dhruv Jain
Harsh Parimal Popat
Avinash Anand
Abhishek Dharmadhikari
Atharva Marathe
R. Shah
LRMAI4CE
243
11
0
01 Dec 2024
Multimodal Alignment and Fusion: A Survey
Multimodal Alignment and Fusion: A Survey
Songtao Li
Hao Tang
OffRL
175
37
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language
  Model
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yujiao Shi
Min Li
282
2
0
23 Nov 2024
Evaluating the Robustness of Analogical Reasoning in Large Language
  Models
Evaluating the Robustness of Analogical Reasoning in Large Language Models
Martha Lewis
Melanie Mitchell
ELM
183
21
0
21 Nov 2024
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Yunkai Dang
Mengxi Gao
Yibo Yan
Xin Zou
Yanggan Gu
...
Jingyu Wang
Peijie Jiang
Aiwei Liu
Jia Liu
Xuming Hu
279
10
0
05 Nov 2024
Improving Scientific Hypothesis Generation with Knowledge Grounded Large
  Language Models
Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models
Guangzhi Xiong
Eric Xie
Amir Hassan Shariatmadari
Sikun Guo
Stefan Bekiranov
Aidong Zhang
LRMHILM
168
21
0
04 Nov 2024
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesInternational Conference on Learning Representations (ICLR), 2024
Zonglin Yang
Wanhao Liu
Ben Gao
Tong Xie
You Li
Wanli Ouyang
Soujanya Poria
Xiaoshi Zhong
Dongzhan Zhou
LRM
458
43
0
09 Oct 2024
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large
  Language Models via Deciphering Attention Causality
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityInternational Conference on Learning Representations (ICLR), 2024
Guanyu Zhou
Yibo Yan
Xin Zou
Kun Wang
Aiwei Liu
Xuming Hu
160
21
0
07 Oct 2024
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Iman Mirzadeh
Keivan Alizadeh
Hooman Shahrokhi
Oncel Tuzel
Samy Bengio
Mehrdad Farajtabar
AIMatLRM
429
385
0
07 Oct 2024
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Xin Zou
Yizhou Wang
Yibo Yan
Yuanhuiyi Lyu
Kening Zheng
...
Junkai Chen
Peijie Jiang
Qingbin Liu
Chang Tang
Xuming Hu
352
23
0
04 Oct 2024
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level
  Mathematical Reasoning
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Di Zhang
Jianbo Wu
Jingdi Lei
Tong Che
Jiatong Li
...
Shufei Zhang
Marco Pavone
Yuqiang Li
Wanli Ouyang
Dongzhan Zhou
LRM
163
88
0
03 Oct 2024
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
A Survey on Multimodal Benchmarks: In the Era of Large AI Models
Lin Li
Guikun Chen
Hanrong Shi
Jun Xiao
Long Chen
299
23
0
21 Sep 2024
Qwen2.5-Coder Technical Report
Qwen2.5-Coder Technical Report
Binyuan Hui
Jian Yang
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
...
Fei Huang
Xingzhang Ren
Xuancheng Ren
Jingren Zhou
Junyang Lin
OSLM
287
752
0
18 Sep 2024
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via
  Self-Improvement
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
An Yang
Beichen Zhang
Binyuan Hui
Bofei Gao
Bowen Yu
...
Mingfeng Xue
Runji Lin
Tianyu Liu
Xingzhang Ren
Zhenru Zhang
OSLMLRM
283
649
0
18 Sep 2024
AI-Driven Virtual Teacher for Enhanced Educational Efficiency:
  Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and CorrectionAAAI Conference on Artificial Intelligence (AAAI), 2024
Tianlong Xu
Yi-Fan Zhang
Zhendong Chu
Shen Wang
Qingsong Wen
149
14
0
14 Sep 2024
Language agents achieve superhuman synthesis of scientific knowledge
Language agents achieve superhuman synthesis of scientific knowledge
Michael D. Skarlinski
Sam Cox
Jon M. Laurent
James D. Braza
Michaela M. Hinks
M. Hammerling
Manvitha Ponnapati
Samuel G. Rodriques
Andrew D. White
ELMHILMALM
344
81
0
10 Sep 2024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large
  Language Model
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Zhen Yang
Jinhao Chen
Zhengxiao Du
Wenmeng Yu
Weihan Wang
Wenyi Hong
Zhihuan Jiang
Bin Xu
Yuxiao Dong
Jie Tang
VLMLRM
139
14
0
10 Sep 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual CommunicationInternational Joint Conference on Natural Language Processing (IJCNLP), 2024
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
213
25
0
28 Aug 2024
GeoReasoner: Reasoning On Geospatially Grounded Context For Natural
  Language Understanding
GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language UnderstandingInternational Conference on Information and Knowledge Management (CIKM), 2024
Yibo Yan
Joey Lee
LRM
187
10
0
21 Aug 2024
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Yifei Yang
Runhan Shi
Zuchao Li
Shu Jiang
Bao-Liang Lu
Yang Yang
Hai Zhao
206
8
0
19 Aug 2024
1234
Next