Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2502.01549
Cited By

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

3 February 2025

Shuaiqiang Wang

ArXiv (abs)PDF HTML Github (2813★)

Papers citing "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

26 / 26 papers shown

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

476

6

0

30 Mar 2026

HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents

HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents

172

5

0

25 Nov 2025

Reasoning Text-to-Video Retrieval via Digital Twin Video Representations and Large Language Models

Reasoning Text-to-Video Retrieval via Digital Twin Video Representations and Large Language Models

Mathias Unberath

308

1

0

15 Nov 2025

Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation

Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation

Alexander Martin

Benjamin Van Durme

177

0

0

28 Oct 2025

LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding

LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding

301

0

0

20 Oct 2025

Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

Mohamed Elhoseiny

156

14

0

15 Oct 2025

RAG-Anything: All-in-One RAG Framework

RAG-Anything: All-in-One RAG Framework

188

4

0

14 Oct 2025

CFVBench: A Comprehensive Video Benchmark for Fine-grained Multimodal Retrieval-Augmented Generation

CFVBench: A Comprehensive Video Benchmark for Fine-grained Multimodal Retrieval-Augmented Generation

...

166

0

0

10 Oct 2025

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

185

2

0

09 Oct 2025

VideoPro: Adaptive Program Reasoning for Long Video Understanding

VideoPro: Adaptive Program Reasoning for Long Video Understanding

...

Feng Tao

Jingqi Tong

Yin Zhang

Jiaqi Wang

235

0

0

22 Sep 2025

See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model

See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model

213

4

0

19 Sep 2025

Empowering Multimodal LLMs with External Tools: A Comprehensive Survey

Empowering Multimodal LLMs with External Tools: A Comprehensive Survey

259

1

0

14 Aug 2025

AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning

AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning

Siminfar Samakoush Galougah

Sanjoy Chowdhury

Ramani Duraiswami

262

4

0

10 Aug 2025

ReMoMask: Retrieval-Augmented Masked Motion Generation

ReMoMask: Retrieval-Augmented Masked Motion Generation

417

7

0

04 Aug 2025

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding

Jiangning Zhang

463

9

0

16 Jun 2025

MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks

MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks

Sanjoy Chowdhury

Mohamed Elmoghany

Yohan Abeysinghe

Mohamed Elhoseiny

515

7

0

08 Jun 2025

WikiVideo: Article Generation from Multiple Videos

WikiVideo: Article Generation from Multiple Videos

Alexander Martin

Francis Ferraro

Benjamin Van Durme

533

6

0

01 Apr 2025

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

Dasha Metropolitansky

Robert Osazuwa Ness

Jonathan Larson

808

1,303

0

20 Feb 2025

Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation

Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Mohammad Mahdi Abootorabi

Amirhosein Zobeiri

Mohammadali Mohammadkhani

Bardia Mohammadi

Ehsaneddin Asgari

842

43

0

12 Feb 2025

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

...

757

96

0

20 Nov 2024

Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented GenerationInternational Conference on Learning Representations (ICLR), 2024

Pan Li

852

73

0

28 Oct 2024

ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems

ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems

Ishneet Sukhvinder Singh

Ritvik Aggarwal

Ibrahim Allahverdiyev

1.1K

30

0

25 Oct 2024

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality DocumentsInternational Conference on Learning Representations (ICLR), 2024

...

Zhiyuan Liu

570

172

0

14 Oct 2024

LightRAG: Simple and Fast Retrieval-Augmented Generation

LightRAG: Simple and Fast Retrieval-Augmented Generation

663

233

0

08 Oct 2024

ColPali: Efficient Document Retrieval with Vision Language Models

ColPali: Efficient Document Retrieval with Vision Language Models

C´eline Hudelot

1.0K

124

0

27 Jun 2024

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

Bernal Jiménez Gutiérrez

Michihiro Yasunaga

444

169

0

23 May 2024

Page 1 of 1