ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.01549
  4. Cited By
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

3 February 2025
Xubin Ren
Lingrui Xu
Long Xia
Shuaiqiang Wang
D. Yin
Chao Huang
    VGenVLM
ArXiv (abs)PDFHTMLGithub (2813★)

Papers citing "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

26 / 26 papers shown
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
Woongyeong Yeo
Kangsan Kim
Jaehong Yoon
Sung Ju Hwang
KELMVLMLRM
476
6
0
30 Mar 2026
HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
Anyang Tong
Xiang Niu
ZhiPing Liu
Chang Tian
Yanyan Wei
Zenglin Shi
Meng Wang
172
5
0
25 Nov 2025
Reasoning Text-to-Video Retrieval via Digital Twin Video Representations and Large Language Models
Reasoning Text-to-Video Retrieval via Digital Twin Video Representations and Large Language Models
Yiqing Shen
Chenxiao Fan
Chenjia Li
Mathias Unberath
VGenLRM
308
1
0
15 Nov 2025
Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation
Seeing Through the MiRAGE: Evaluating Multimodal Retrieval Augmented Generation
Alexander Martin
William Walden
Reno Kriz
Dengjia Zhang
Kate Sanders
Eugene Yang
Chihsheng Jin
Benjamin Van Durme
177
0
0
28 Oct 2025
LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding
LongInsightBench: A Comprehensive Benchmark for Evaluating Omni-Modal Models on Human-Centric Long-Video Understanding
Zhaoyang Han
Qihan Lin
Hao Liang
Bowen Chen
Zhou Liu
Wentao Zhang
VLM
301
0
0
20 Oct 2025
Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding
Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding
Xiaoqian Shen
Wenxuan Zhang
Jun-Cheng Chen
Mohamed Elhoseiny
VLMLRM
156
14
0
15 Oct 2025
RAG-Anything: All-in-One RAG Framework
RAG-Anything: All-in-One RAG Framework
Zirui Guo
Xubin Ren
Lingrui Xu
Jiahao Zhang
Chao Huang
VLM
188
4
0
14 Oct 2025
CFVBench: A Comprehensive Video Benchmark for Fine-grained Multimodal Retrieval-Augmented Generation
CFVBench: A Comprehensive Video Benchmark for Fine-grained Multimodal Retrieval-Augmented Generation
Kaiwen Wei
Xiao-Yang Liu
Jie Zhang
Zijian Wang
Ruida Liu
...
C. Pan
Y. Zhang
Jiang Zhong
Peijin Wang
Yingchao Feng
VGenVLM
166
0
0
10 Oct 2025
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
Peiran Wu
Zhuorui Yu
Yunze Liu
Chi-Hao Wu
Enmin Zhou
Junxiao Shen
OffRLVLM
185
2
0
09 Oct 2025
VideoPro: Adaptive Program Reasoning for Long Video Understanding
VideoPro: Adaptive Program Reasoning for Long Video Understanding
Chenglin Li
Feng Han
FengTao
Ruilin Li
Qianglong Chen
...
Jiaqi Wang
Feng Tao
Jingqi Tong
Yin Zhang
Jiaqi Wang
LRM
235
0
0
22 Sep 2025
See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model
See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model
Pengteng Li
Pinhao Song
Wuyang Li
Weiyu Guo
Huizai Yao
Ziyang Chen
Dugang Liu
Hui Xiong
LRMVLM
213
4
0
19 Sep 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
259
1
0
14 Aug 2025
AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning
AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning
Siminfar Samakoush Galougah
Rishie Raj
Sanjoy Chowdhury
Sayan Nag
Ramani Duraiswami
262
4
0
10 Aug 2025
ReMoMask: Retrieval-Augmented Masked Motion Generation
ReMoMask: Retrieval-Augmented Masked Motion Generation
Zhengdao Li
Siheng Wang
Zeyu Zhang
Hao Tang
DiffMVGen
417
7
0
04 Aug 2025
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
Zhucun Xue
Jiangning Zhang
Xurong Xie
Yuxuan Cai
Yong-Jin Liu
Xiangtai Li
Dacheng Tao
VGenVLM
463
9
0
16 Jun 2025
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
Sanjoy Chowdhury
Mohamed Elmoghany
Yohan Abeysinghe
Mahmoud Ahmed
Sayan Nag
Salman Khan
Mohamed Elhoseiny
Dinesh Manocha
515
7
0
08 Jun 2025
WikiVideo: Article Generation from Multiple Videos
WikiVideo: Article Generation from Multiple Videos
Alexander Martin
Reno Kriz
William Walden
Kate Sanders
Hannah Recknor
Eugene Yang
Francis Ferraro
Benjamin Van Durme
DiffMVGen
533
6
0
01 Apr 2025
From Local to Global: A Graph RAG Approach to Query-Focused Summarization
From Local to Global: A Graph RAG Approach to Query-Focused Summarization
Darren Edge
Ha Trinh
Newman Cheng
Joshua Bradley
Alex Chao
Apurva Mody
Steven Truitt
Dasha Metropolitansky
Robert Osazuwa Ness
Jonathan Larson
RALM
808
1,303
0
20 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
842
43
0
12 Feb 2025
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Yongdong Luo
Xiawu Zheng
Guilin Li
Guilin Li
Haojia Lin
...
Jinfa Huang
Jiayi Ji
Jiebo Luo
Rongrong Ji
Rongrong Ji
VLM
757
96
0
20 Nov 2024
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented GenerationInternational Conference on Learning Representations (ICLR), 2024
Mufei Li
Siqi Miao
Pan Li
RALM
852
73
0
28 Oct 2024
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems
Ishneet Sukhvinder Singh
Ritvik Aggarwal
Ibrahim Allahverdiyev
Muhammad Taha
Aslihan Akalin
Kevin Zhu
Sean O'Brien
1.1K
30
0
25 Oct 2024
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality DocumentsInternational Conference on Learning Representations (ICLR), 2024
S. Yu
C. Tang
Bokai Xu
Junbo Cui
Junhao Ran
...
Zhenghao Liu
Kaiyan Zhang
Xu Han
Zhiyuan Liu
Maosong Sun
VLM
570
172
0
14 Oct 2024
LightRAG: Simple and Fast Retrieval-Augmented Generation
LightRAG: Simple and Fast Retrieval-Augmented Generation
Zirui Guo
Lianghao Xia
Yanhua Yu
Tu Ao
Chao Huang
663
233
0
08 Oct 2024
ColPali: Efficient Document Retrieval with Vision Language Models
ColPali: Efficient Document Retrieval with Vision Language Models
Manuel Faysse
Hugues Sibille
Tony Wu
Bilel Omrani
Gautier Viaud
C´eline Hudelot
Pierre Colombo
VLM
1.0K
124
0
27 Jun 2024
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Bernal Jiménez Gutiérrez
Yiheng Shu
Yu Gu
Michihiro Yasunaga
Yu-Chuan Su
RALMCLL
444
169
0
23 May 2024
1
Page 1 of 1