Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.16175
Cited By
v1
v2 (latest)
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
22 May 2025
Benjamin Schneider
Dongfu Jiang
Chao Du
Tianyu Pang
Wenhu Chen
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (42 upvotes)
Papers citing
"QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design"
5 / 5 papers shown
StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding
Haolin Yang
Feilong Tang
Linxiao Zhao
Xiang An
Ming Hu
...
Yifan Lu
Xiaofeng Zhang
Abdalla Swikir
Junjun He
Zongyuan Ge
343
4
0
03 Aug 2025
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Jinguo Zhu
Weiyun Wang
Zhe Chen
Ziwei Liu
Shenglong Ye
...
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
Wei Wang
MLLM
VLM
596
790
1
14 Apr 2025
Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More
Zichen Wen
Yifeng Gao
Shaobo Wang
J.N. Zhang
Qintong Zhang
Weijia Li
Conghui He
Linfeng Zhang
VLM
333
9
0
17 Feb 2025
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
Yi Wang
Xinhao Li
Ziang Yan
Yinan He
Jiashuo Yu
...
Kai Chen
Wenhai Wang
Yu Qiao
Yali Wang
Limin Wang
543
120
0
21 Jan 2025
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
Long Xing
Qidong Huang
Xiaoyi Dong
Jiajie Lu
Pan Zhang
...
Yuhang Cao
Bin Wang
Jiaqi Wang
Feng Wu
Dahua Lin
VLM
331
133
0
22 Oct 2024
1