QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

v1v2 (latest)

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

22 May 2025

Benjamin Schneider

ArXiv (abs)PDF HTML HuggingFace (42 upvotes)

Papers citing "QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design"

5 / 5 papers shown

StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding

StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding

...

343

4

0

03 Aug 2025

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

...

596

790

1

14 Apr 2025

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

333

9

0

17 Feb 2025

InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling

InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling

...

543

120

0

21 Jan 2025

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

...

Yuhang Cao

Jiaqi Wang

331

133

0

22 Oct 2024