VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

AAAI Conference on Artificial Intelligence (AAAI), 2025

12 January 2025

Papers citing "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning"

6 / 6 papers shown

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

...

744

06 Oct 2025

Captioning for Text-Video Retrieval via Dual-Group Direct Preference Optimization

196

20 Sep 2025

Representation Shift: Unifying Token Compression with FlashAttention

190

01 Aug 2025

Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval

374

31 Jul 2025

DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO

358

09 Jun 2025

Time Blindness: Why Video-Language Models Can't See What Humans Can?

214

30 May 2025