ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.12589
  4. Cited By
SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

19 May 2025
Bo Liu
Pengfei Qiao
Minhan Ma
Xuange Zhang
Yinan Tang
Peng Xu
Kun Liu
Tongtong Yuan
ArXivPDFHTML

Papers citing "SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models"

12 / 12 papers shown
Title
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Wenxuan Huang
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MU
OffRL
LRM
MLLM
ReLM
VLM
98
85
0
09 Mar 2025
Qwen2.5-VL Technical Report
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
178
430
0
20 Feb 2025
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding
Zhenyu Yang
Yihan Hu
Zemin Du
Dizhan Xue
Shengsheng Qian
Jiahong Wu
Fan Yang
W. Dong
Changsheng Xu
76
7
0
15 Feb 2025
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark
Yunzhuo Hao
Jiawei Gu
Huichen Will Wang
Linjie Li
Zhiyong Yang
Lijuan Wang
Yu Cheng
LRM
76
30
0
10 Jan 2025
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
Hengyi Wang
Haizhou Shi
Shiwei Tan
Weiyi Qin
Wenyuan Wang
Tunyu Zhang
A. Nambi
T. Ganu
Hao Wang
80
18
0
17 Jun 2024
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video
  Understanding
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Muhammad Maaz
H. Rasheed
Salman Khan
Fahad A Khan
VLM
MLLM
62
57
0
13 Jun 2024
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Chaoyou Fu
Yuhan Dai
Yondong Luo
Lei Li
Shuhuai Ren
...
Xiawu Zheng
Enhong Chen
Caifeng Shan
Xing Sun
Xing Sun
VLM
MLLM
106
357
0
31 May 2024
Advancing Video Anomaly Detection: A Concise Review and a New Dataset
Advancing Video Anomaly Detection: A Concise Review and a New Dataset
Liyun Zhu
Lei Wang
Arjun Raj
Tom Gedeon
Chen Chen
72
19
0
07 Feb 2024
Towards Surveillance Video-and-Language Understanding: New Dataset,
  Baselines, and Challenges
Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges
Tongtong Yuan
Xuange Zhang
Kun Liu
Bo Liu
Chen Chen
Jian Jin
Zhenzhen Jiao
AI4TS
63
18
0
25 Sep 2023
Generalized Video Anomaly Event Detection: Systematic Taxonomy and
  Comparison of Deep Models
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models
Yang Liu
Dingkang Yang
Yan Wang
Jing Liu
Jun Liu
Azzedine Boukerche
Peng Sun
Liang Song
61
91
0
10 Feb 2023
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity
  Detection
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection
Kellie Corona
Katie Osterdahl
Roderic Collins
A. Hoogs
57
64
0
02 Dec 2020
Real-world Anomaly Detection in Surveillance Videos
Real-world Anomaly Detection in Surveillance Videos
Waqas Sultani
Chen Chen
M. Shah
AI4TS
128
1,468
0
12 Jan 2018
1