Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.15928
Cited By
ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation
21 May 2025
Tony Montes
Fernando Lozano
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation"
3 / 3 papers shown
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLM
OffRL
VLM
LRM
734
8
0
06 Oct 2025
Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics
Saurav Jha
Stefan K. Ehrlich
LM&Ro
77
0
0
26 Sep 2025
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
Juhong Min
Shyamal Buch
Arsha Nagrani
Minsu Cho
Cordelia Schmid
LRM
409
62
0
09 Apr 2024
1