ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.19723
  4. Cited By
Encoding and Controlling Global Semantics for Long-form Video Question
  Answering

Encoding and Controlling Global Semantics for Long-form Video Question Answering

30 May 2024
Thong Nguyen
Zhiyuan Hu
Xiaobao Wu
Cong-Duy Nguyen
See-Kiong Ng
Anh Tuan Luu
ArXiv (abs)PDFHTML

Papers citing "Encoding and Controlling Global Semantics for Long-form Video Question Answering"

2 / 2 papers shown
Title
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
Thong Nguyen
Zhiyuan Hu
Xu Lin
Cong-Duy Nguyen
See-Kiong Ng
Luu Anh Tuan
VLM
134
1
0
19 May 2025
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
Yiwu Zhong
Zhuoming Liu
Yin Li
Liwei Wang
220
13
0
04 Dec 2024
1