Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2405.19723
Cited By
Encoding and Controlling Global Semantics for Long-form Video Question Answering
30 May 2024
Thong Nguyen
Zhiyuan Hu
Xiaobao Wu
Cong-Duy Nguyen
See-Kiong Ng
Anh Tuan Luu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Encoding and Controlling Global Semantics for Long-form Video Question Answering"
2 / 2 papers shown
Title
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
Thong Nguyen
Zhiyuan Hu
Xu Lin
Cong-Duy Nguyen
See-Kiong Ng
Luu Anh Tuan
VLM
134
1
0
19 May 2025
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
Yiwu Zhong
Zhuoming Liu
Yin Li
Liwei Wang
220
13
0
04 Dec 2024
1