Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2501.06761
Cited By
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
AAAI Conference on Artificial Intelligence (AAAI), 2025
12 January 2025
Ji Soo Lee
Jongha Kim
Jeehye Na
Jinyoung Park
H. Kim
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning"
6 / 6 papers shown
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLM
OffRL
VLM
LRM
744
8
0
06 Oct 2025
Captioning for Text-Video Retrieval via Dual-Group Direct Preference Optimization
Ji Soo Lee
Byungoh Ko
Jaewon Cho
Howoong Lee
Jaewoon Byun
Hyunwoo J. Kim
196
1
0
20 Sep 2025
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi
S. Lee
Byungoh Ko
Eunseo Kim
Jihyung Kil
Hyunwoo J. Kim
190
0
0
01 Aug 2025
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Dohwan Ko
Ji Soo Lee
M. Choi
Zihang Meng
Hyunwoo J. Kim
374
1
0
31 Jul 2025
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO
Jinyoung Park
Jeehye Na
Jinyoung Kim
H. Kim
OffRL
358
22
0
09 Jun 2025
Time Blindness: Why Video-Language Models Can't See What Humans Can?
Ujjwal Upadhyay
Mukul Ranjan
Zhiqiang Shen
Mohamed Elhoseiny
VLM
214
3
0
30 May 2025
1