Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2508.19294
Cited By
v1
v2 (latest)
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
25 August 2025
Ranjan Sapkota
Manoj Karkee
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4010★)
Papers citing
"Object Detection with Multimodal Large Vision-Language Models: An In-depth Review"
6 / 6 papers shown
Title
MonitorVLM:A Vision Language Framework for Safety Violation Detection in Mining Operations
Jiang Wu
Sichao Wu
Yinsong Ma
Guangyuan Yu
Haoyuan Xu
Lifang Zheng
Jingliang Duan
25
0
0
04 Oct 2025
YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
Ranjan Sapkota
Rahul Harsha Cheppally
Ajay Sharda
Manoj Karkee
ObjD
44
1
0
29 Sep 2025
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
Zheda Mai
A. Chowdhury
Zihe Wang
Sooyoung Jeon
Jingyan Bai
Jiacheng Hou
Jihyung Kil
Wei-Lun Chao
CoGe
115
2
0
10 Jun 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Ranjan Sapkota
Konstantinos I. Roumeliotis
Manoj Karkee
AI4TS
442
64
0
15 May 2025
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges
Ranjan Sapkota
Yang Cao
Konstantinos I. Roumeliotis
Manoj Karkee
LM&Ro
599
27
0
07 May 2025
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I. Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
214
4
0
25 Apr 2025
1