ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.06761
  4. Cited By
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

AAAI Conference on Artificial Intelligence (AAAI), 2025
12 January 2025
Ji Soo Lee
Jongha Kim
Jeehye Na
Jinyoung Park
H. Kim
    VGen
ArXiv (abs)PDFHTML

Papers citing "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning"

6 / 6 papers shown
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang
Jing Bi
Pinxin Liu
Zhenyu Pan
Mingqian Feng
...
Zeliang Zhang
Daiki Shimada
Han Liu
Jiebo Luo
Chenliang Xu
MLLMOffRLVLMLRM
744
8
0
06 Oct 2025
Captioning for Text-Video Retrieval via Dual-Group Direct Preference Optimization
Captioning for Text-Video Retrieval via Dual-Group Direct Preference Optimization
Ji Soo Lee
Byungoh Ko
Jaewon Cho
Howoong Lee
Jaewoon Byun
Hyunwoo J. Kim
196
1
0
20 Sep 2025
Representation Shift: Unifying Token Compression with FlashAttention
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi
S. Lee
Byungoh Ko
Eunseo Kim
Jihyung Kil
Hyunwoo J. Kim
190
0
0
01 Aug 2025
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval
Dohwan Ko
Ji Soo Lee
M. Choi
Zihang Meng
Hyunwoo J. Kim
374
1
0
31 Jul 2025
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO
Jinyoung Park
Jeehye Na
Jinyoung Kim
H. Kim
OffRL
358
22
0
09 Jun 2025
Time Blindness: Why Video-Language Models Can't See What Humans Can?
Time Blindness: Why Video-Language Models Can't See What Humans Can?
Ujjwal Upadhyay
Mukul Ranjan
Zhiqiang Shen
Mohamed Elhoseiny
VLM
214
3
0
30 May 2025
1