Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian-jun Sun
AIMat
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 5,606 papers shown
Title
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
Crowd Detection Using Very-Fine-Resolution Satellite Imagery
Tong Xiao
Qunming Wang
Ping Lu
Tenghai Huang
Xiaohua Tong
P. M. Atkinson
59
0
0
28 Apr 2025
More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV
Kai Ye
Haidi Tang
Bowen Liu
Pingyang Dai
Liujuan Cao
Rongrong Ji
AI4TS
34
0
0
28 Apr 2025
SynergyAmodal: Deocclude Anything with Text Control
Xinyang Li
Chengjie Yi
Jiawei Lai
Mingbao Lin
Yansong Qu
Shengchuan Zhang
Liujuan Cao
DiffM
73
0
0
28 Apr 2025
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
Loc Phuc Truong Nguyen
Hung Truong Thanh Nguyen
Hung Cao
59
0
0
27 Apr 2025
Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction
Xiaoran Xu
Jiangang Yang
Wenyue Chong
Wenhui Shi
S.
Jing Xing
Jian Liu
ObjD
VLM
79
0
0
27 Apr 2025
Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation
Rayson Laroca
Marcelo dos Santos
David Menotti
ObjD
44
1
0
27 Apr 2025
Swapped Logit Distillation via Bi-level Teacher Alignment
Stephen Ekaputra Limantoro
Jhe-Hao Lin
Chih-Yu Wang
Yi-Lung Tsai
Hong-Han Shuai
Ching-Chun Huang
Wen-Huang Cheng
49
0
0
27 Apr 2025
Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality
Majid Behravan
Maryam Haghani
Denis Gračanin
77
1
0
27 Apr 2025
CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes
Tuan Nguyen
Naseem Khan
Issa Khalil
AAML
59
0
0
27 Apr 2025
R-Sparse R-CNN: SAR Ship Detection Based on Background-Aware Sparse Learnable Proposals
Kamirul Kamirul
Odysseas A. Pappas
A. Achim
53
0
0
26 Apr 2025
VISUALCENT: Visual Human Analysis using Dynamic Centroid Representation
Niaz Ahmad
Youngmoon Lee
Guanghui Wang
3DH
62
0
0
26 Apr 2025
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti
Roberto Basla
Riccardo Pieroni
Matteo Corno
S. Savaresi
Luca Magri
Giacomo Boracchi
3DPC
44
0
0
25 Apr 2025
MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View
Liugang Lu
Dabin He
Congxiang Liu
Zhixiang Deng
47
0
0
25 Apr 2025
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Brian K. S. Isaac-Medina
T. Breckon
OODD
131
0
0
25 Apr 2025
Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition
Yin Tang
Jiankai Li
Hongyu Yang
Xuan Dong
Lifeng Fan
Weixin Li
32
0
0
25 Apr 2025
A Large Vision-Language Model based Environment Perception System for Visually Impaired People
Zezhou Chen
Zhaoxiang Liu
Kai Wang
Kohou Wang
Shiguo Lian
50
0
0
25 Apr 2025
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
74
2
0
25 Apr 2025
S3MOT: Monocular 3D Object Tracking with Selective State Space Model
Zhuohao Yan
Shaoquan Feng
Xingxing Li
Yuxuan Zhou
Chunxi Xia
Shengyu Li
VOT
74
0
0
25 Apr 2025
A Decade of You Only Look Once (YOLO) for Object Detection
Leo Thomas Ramos
Angel D. Sappa
66
0
0
24 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
52
0
0
24 Apr 2025
CIVIL: Causal and Intuitive Visual Imitation Learning
Yinlong Dai
Robert Ramirez Sanchez
Ryan Jeronimus
Shahabedin Sagheb
Cara M. Nunez
Heramb Nemlekar
Dylan P. Losey
68
0
0
24 Apr 2025
AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models
Mohammad Zarei
Melanie A Jutras
Eliana Evans
Mike Tan
Omid Aaramoon
AAML
DiffM
50
0
0
24 Apr 2025
Improving Open-World Object Localization by Discovering Background
Ashish Singh
Michael J. Jones
Kuan-Chuan Peng
A. Cherian
Moitreya Chatterjee
Erik Learned-Miller
ObjD
OCL
VLM
64
0
0
24 Apr 2025
RGB-D Tracking via Hierarchical Modality Aggregation and Distribution Network
Boyue Xu
Y. Xu
Ruichao Hou
Jia Bei
Tongwei Ren
Gangshan Wu
44
1
0
24 Apr 2025
MTSGL: Multi-Task Structure Guided Learning for Robust and Interpretable SAR Aircraft Recognition
Qishan He
Lingjun Zhao
Ru Luo
Siqian Zhang
Lin Lei
Kefeng Ji
Gangyao Kuang
27
0
0
23 Apr 2025
Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection
Jens Petersen
Davide Abati
A. Habibian
Auke Wiggers
ViT
3DPC
48
0
0
23 Apr 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Y. Zhang
Chuan Qin
Bo Li
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
49
0
0
23 Apr 2025
You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection
Jun Dong
Wenli Wu
Jintao Cheng
Xiaoyu Tang
32
0
0
22 Apr 2025
SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems
Manjunath D
Aniruddh Sikdar
Prajwal Gurunath
Sumanth Udupa
Suresh Sundaram
29
0
0
22 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
31
0
0
22 Apr 2025
SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam
Tue Vo
Lakshay Sharma
Tuan Dinh
Khuong Dinh
T. Nguyen
Trung Phan
Minh Do
Duong Vu
35
0
0
21 Apr 2025
An Efficient Aerial Image Detection with Variable Receptive Fields
Liu Wenbin
30
0
0
21 Apr 2025
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
25
0
0
21 Apr 2025
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art
Jing Zhang
Dan Guo
Zhangbin Li
Meng Wang
31
0
0
20 Apr 2025
ISTD-YOLO: A Multi-Scale Lightweight High-Performance Infrared Small Target Detection Algorithm
Shang Zhang
Yujie Cui
Ruoyan Xiong
Huanbin Zhang
22
0
0
19 Apr 2025
Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network
Lu Pan
Yu-Hsuan Huang
Hongxia Xie
Cheng Zhang
H Zhao
Hong-Han Shuai
Wen-Huang Cheng
23
0
0
19 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
31
0
0
18 Apr 2025
Compile Scene Graphs with Reinforcement Learning
Zuyao Chen
Jinlin Wu
Zhen Lei
Marc Pollefeys
Chang Wen Chen
OffRL
LRM
57
0
0
18 Apr 2025
Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes
Sridevi Polavaram
Xin Zhou
Meenu Ravi
Mohammad Zarei
Anmol Srivastava
19
0
0
18 Apr 2025
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
37
3
0
18 Apr 2025
Collaborative Perception Datasets for Autonomous Driving: A Review
N. Wang
Deyong Shang
Yan Gong
X. S. Hu
Ziying Song
Lei Yang
Y. Huang
Xiaoyu Wang
J. Lu
37
0
0
17 Apr 2025
Accurate Tracking of Arabidopsis Root Cortex Cell Nuclei in 3D Time-Lapse Microscopy Images Based on Genetic Algorithm
Yu Song
Tatsuaki Goh
Yinhao Li
Jiahua Dong
Shunsuke Miyashima
Yutaro Iwamoto
Yohei Kondo
Keiji Nakajima
Yen-Wei Chen
33
0
0
17 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
26
0
0
17 Apr 2025
ChartQA-X: Generating Explanations for Charts
Shamanthak Hegde
Pooyan Fazli
H. Seifi
20
0
0
17 Apr 2025
Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification
Reek Majumder
M. Chowdhury
S. Khan
Zadid Khan
Fahim Ahmad
Frank Ngeni
G. Comert
Judith Mwakalonge
Dimitra Michalaka
AAML
33
0
0
17 Apr 2025
Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions
Yifei Dong
Fengyi Wu
Sanjian Zhang
Guangyu Chen
Yuzhi Hu
...
Jingdong Sun
Siyu Huang
Feng Liu
Qi Dai
Zhi-Qi Cheng
39
0
0
16 Apr 2025
MixSignGraph: A Sign Sequence is Worth Mixed Graphs of Nodes
Shiwei Gan
Yafeng Yin
Zhiwei Jiang
Hongkai Wen
Lei Xie
Sanglu Lu
SLR
39
0
0
16 Apr 2025
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
Shripad Pate
Aiman Farooq
Suvrankar Datta
Musadiq Aadil Sheikh
Atin Kumar
Deepak Mishra
26
0
0
15 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
36
0
0
15 Apr 2025
Previous
1
2
3
4
5
...
111
112
113
Next