ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.11155
  4. Cited By
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search

Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search

Annual Meeting of the Association for Computational Linguistics (ACL), 2025
11 June 2025
Linhao Yu
Xinguang Ji
Yahui Liu
Fanheng Kong
Chenxi Sun
Jingyuan Zhang
Hongzhi Zhang
Victoria A. Webster-Wood
Fuzheng Zhang
Deyi Xiong
ArXiv (abs)PDFHTML

Papers citing "Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search"

2 / 2 papers shown
Title
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models
Xiyang Wu
Zongxia Li
Jihui Jin
Guangyao Shi
Gouthaman KV
Vishnu Raj
Nilotpal Sinha
Jingxi Chen
Fan Du
Dinesh Manocha
104
0
0
23 Nov 2025
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLMOffRLVLMLRM
582
8
0
06 Oct 2025
1