Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2506.11155
Cited By
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
11 June 2025
Linhao Yu
Xinguang Ji
Yahui Liu
Fanheng Kong
Chenxi Sun
Jingyuan Zhang
Hongzhi Zhang
Victoria A. Webster-Wood
Fuzheng Zhang
Deyi Xiong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search"
2 / 2 papers shown
Title
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models
Xiyang Wu
Zongxia Li
Jihui Jin
Guangyao Shi
Gouthaman KV
Vishnu Raj
Nilotpal Sinha
Jingxi Chen
Fan Du
Dinesh Manocha
104
0
0
23 Nov 2025
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLM
OffRL
VLM
LRM
582
8
0
06 Oct 2025
1