Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1504.08083
Cited By
v1
v2 (latest)
Fast R-CNN
30 April 2015
Ross B. Girshick
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3402★)
Papers citing
"Fast R-CNN"
50 / 5,413 papers shown
YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications
Image and Vision Computing New Zealand (IVCNZ), 2025
Yida Lin
Bing Xue
Mengjie Zhang
Sam Schofield
Richard Green
9
3
0
05 Dec 2025
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
Jiazhe Wei
Ken Li
Tianyu Lao
Haofan Wang
Liang Wang
Caifeng Shan
Chenyang Si
147
2
0
03 Dec 2025
YOLOA: Real-Time Affordance Detection via LLM Adapter
Yuqi Ji
Junjie Ke
Lihuo He
J. Liu
Kaifan Zhang
Yu-Kun Lai
Guiguang Ding
Xinbo Gao
180
0
0
03 Dec 2025
From Black Hole to Galaxy: Neural Operator: Framework for Accretion and Feedback Dynamics
Nihaal Bhojwani
Chuwei Wang
Hai-Yang Wang
Chang Sun
Elias R. Most
Anima Anandkumar
AI4CE
118
1
0
01 Dec 2025
Analyzing Image Beyond Visual Aspect: Image Emotion Classification via Multiple-Affective Captioning
Zibo Zhou
Zhengjun Zhai
Huimin Chen
Wei Dai
Hansen Yang
100
0
0
28 Nov 2025
Geometry-Consistent 4D Gaussian Splatting for Sparse-Input Dynamic View Synthesis
Yiwei Li
Jiannong Cao
Penghui Ruan
Divya Saxena
Songye Zhu
Yinfeng Cao
3DGS
242
0
0
28 Nov 2025
SemOD: Semantic Enabled Object Detection Network under Various Weather Conditions
Aiyinsi Zuo
Zhaoliang Zheng
101
0
0
27 Nov 2025
Intelligent Image Search Algorithms Fusing Visual Large Models
Kehan Wang
Tingqiong Cui
Y. Zhang
Yu Chen
Shifeng Wu
Z. Li
VLM
205
0
0
25 Nov 2025
Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation
Shuyang Liu
Yuan Jin
Rui Lin
Shizhe Chen
Junyu Dai
Tao Jiang
200
0
0
24 Nov 2025
AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization
C. Koutlis
Symeon Papadopoulos
134
0
0
24 Nov 2025
Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety Framework
Italian National Conference on Sensors (INS), 2025
David Bricher
Andreas Mueller
176
1
0
24 Nov 2025
Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving
J. N. Han
Meng Tian
Jiangtong Zhu
Fan He
Huixin Zhang
...
Siyuan Dong
Lu Hou
Qingqiu Huang
Xiaosong Jia
H. Xu
VLM
161
1
0
24 Nov 2025
A lightweight detector for real-time detection of remote sensing images
Qianyi Wang
Guoqiang Ren
ObjD
232
1
0
21 Nov 2025
PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation
Information Fusion (Inf. Fusion), 2025
Ting Pan
Ye Wang
Peiguang Jing
Rui Ma
Zili Yi
Y. Liu
281
0
0
20 Nov 2025
Late-decoupled 3D Hierarchical Semantic Segmentation with Semantic Prototype Discrimination based Bi-branch Supervision
Shuyu Cao
Chongshou Li
Jie Xu
Tianrui Li
Na Zhao
189
0
0
20 Nov 2025
StreetView-Waste: A Multi-Task Dataset for Urban Waste Management
Diogo J. Paulo
João Martins
Hugo Manuel Proença
Joao Neves
80
0
0
20 Nov 2025
GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI
Naomi Simumba
Nils Lehmann
Paolo Fraccaro
Hamed Alemohammad
Geeth De Mel
...
Nicolas Longépé
Xiao Xiang Zhu
Hannah Kerner
Juan Bernabé-Moreno
Alexander Lacoste
ELM
VLM
252
1
0
19 Nov 2025
PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation
Xiangyu Li
C. Wang
Yumao Liu
Dengbo He
J. Zhang
Ke Ma
98
0
0
18 Nov 2025
Multi-task GINN-LP for Multi-target Symbolic Regression
Hussein Rajabu
Lijun Qian
Xishuang Dong
131
0
0
17 Nov 2025
MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning
Yoonjae Seo
Ermal Elbasani
Jaehong Lee
MQ
337
0
0
17 Nov 2025
SAE-MCVT: A Real-Time and Scalable Multi-Camera Vehicle Tracking Framework Powered by Edge Computing
Yuqiang Lin
Sam Lockyer
Florian Stanek
Markus Zarbock
Adrian Evans
Wenbin Li
Nic Zhang
205
0
0
17 Nov 2025
MaskAnyNet: Rethinking Masked Image Regions as Valuable Information in Supervised Learning
Jingshan Hong
Haigen Hu
Huihuang Zhang
Q. Zhou
Zhao Li
150
0
0
16 Nov 2025
Multimodal Peer Review Simulation with Actionable To-Do Recommendations for Community-Aware Manuscript Revisions
Journal of Information Systems Engineering & Management (JISEM), 2025
Mengze Hong
Di Jiang
Weiwei Zhao
Yawen Li
Y. Wang
Xinyuan Luo
Yanjie Sun
Chen Zhang
98
3
0
14 Nov 2025
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Jinfu Li
Yuqi Huang
Hong Song
Ting Wang
Jianghan Xia
Yucong Lin
Jingfan Fan
Jian Yang
ObjD
251
0
0
13 Nov 2025
Fast 3D Surrogate Modeling for Data Center Thermal Management
Soumyendu Sarkar
Antonio Guillen-Perez
Zachariah Carmichael
Avisek Naug
Refik Mert Cam
Vineet Gundecha
Ashwin Ramesh Babu
Sahand Ghorbanpour
Ricardo Luna Gutierrez
AI4CE
259
0
0
13 Nov 2025
High-Quality Proposal Encoding and Cascade Denoising for Imaginary Supervised Object Detection
Zhiyuan Chen
Yuelin Guo
Zitong Huang
Haoyu He
Renhao Lu
Weizhe Zhang
90
0
0
11 Nov 2025
Contrast-Guided Cross-Modal Distillation for Thermal Object Detection
SiWoo Kim
JhongHyun An
182
0
0
03 Nov 2025
SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment
Xinyu Mao
Junsi Li
H. Zhang
Yu Liang
Ming Sun
VLM
258
0
0
03 Nov 2025
A Hybrid YOLOv5-SSD IoT-Based Animal Detection System for Durian Plantation Protection
Anis Suttan Shahrir
Zakiah Ayop
Syarulnaziah Anawar
Norulzahrah Mohd Zainudin
87
0
0
02 Nov 2025
Confined Space Underwater Positioning Using Collaborative Robots
Xueliang Cheng
Kanzhong Yao
A. West
O. Marjanovic
Barry Lennox
K. Groves
118
1
0
31 Oct 2025
Gaussian Combined Distance: A Generic Metric for Object Detection
IEEE Geoscience and Remote Sensing Letters (GRSL), 2025
Ziqian Guan
Xieyi Fu
Pengjun Huang
Hengyuan Zhang
Hubin Du
Yongtao Liu
Yinglin Wang
Qang Ma
175
1
0
31 Oct 2025
Improving Classification of Occluded Objects through Scene Context
Courtney M. King
Daniel D. Leeds
Damian Lyons
George Kalaitzis
ObjD
347
0
0
30 Oct 2025
VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes
Simon Yu
Peilin Yu
Hongbo Zheng
Huajie Shao
Han Zhao
L. Sha
152
0
0
29 Oct 2025
AttnCache: Accelerating Self-Attention Inference for LLM Prefill via Attention Cache
IACR Cryptology ePrint Archive (IACR ePrint), 2025
Dinghong Song
Yuan Feng
Y. Wang
S. Chen
Cyril Guyot
F. Blagojevic
Hyeran Jeon
Pengfei Su
Dong Li
258
0
0
29 Oct 2025
GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction
Yang Jin
Guangyu Guo
Binglu Wang
109
0
0
29 Oct 2025
Mask-Robust Face Verification for Online Learning via YOLOv5 and Residual Networks
Zhifeng Wang
Minghui Wang
Chunyan Zeng
Jialong Yao
Yang Yang
Hongmin Xu
121
0
0
29 Oct 2025
FruitProm: Probabilistic Maturity Estimation and Detection of Fruits and Vegetables
Sidharth Rai
Rahul Harsha Cheppally
Benjamin Vail
Keziban Yalçın Dokumacı
Ajay Sharda
146
0
0
28 Oct 2025
MELDAE: A Framework for Micro-Expression Spotting, Detection, and Automatic Evaluation in In-the-Wild Conversational Scenes
Yigui Feng
Qinglin Wang
Yang Liu
Ke Liu
Haotian Mo
Enhao Huang
G. Liu
M. Liu
Jie Liu
93
1
0
26 Oct 2025
GRAP-MOT: Unsupervised Graph-based Position Weighted Person Multi-camera Multi-object Tracking in a Highly Congested Space
Marek Socha
M. Marczyk
Aleksander Kempski
M. Cogiel
P. Foszner
Radosław Zawiski
M. Staniszewski
VOT
250
0
0
24 Oct 2025
Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation
Thaweerath Phisannupawong
J. J. Damanik
Han-Lim Choi
188
0
0
24 Oct 2025
Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism
Junfei Zhou
Penglin Dai
Quanmin Wei
Bingyi Liu
Xiao-Jun Wu
Jianping Wang
353
2
0
22 Oct 2025
A Unified Detection Pipeline for Robust Object Detection in Fisheye-Based Traffic Surveillance
Neema Jakisa Owor
Joshua Kofi Asamoah
Tanner Muturi
Anneliese Jakisa Owor
Blessing Agyei Kyem
Andrews Danyo
Y. Adu-Gyamfi
Armstrong Aboah
182
3
0
22 Oct 2025
VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction
Junhong Lin
Kangli Wang
Shunzhou Wang
S. Fan
Ge Li
Wei-Nan Gao
3DGS
218
1
0
22 Oct 2025
Kinematic Analysis and Integration of Vision Algorithms for a Mobile Manipulator Employed Inside a Self-Driving Laboratory
Shifa Sulaiman
Tobias Busk Jensen
Stefan Hein Bengtson
Simon Bøgh
86
3
0
21 Oct 2025
SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery
Zhenqi He
Yuanpei Liu
Kai Han
208
2
0
21 Oct 2025
Enhancing Rotated Object Detection via Anisotropic Gaussian Bounding Box and Bhattacharyya Distance
Chien Thai
Mai Xuan Trang
Huong Ninh
Hoang Hiep Ly
Anh Son Le
166
1
0
18 Oct 2025
Post-surgical Endometriosis Segmentation in Laparoscopic Videos
International Conference on Content-Based Multimedia Indexing (CBMI), 2021
Andreas Leibetseder
Klaus Schoeffmann
Jörg Keckstein
Simon Keckstein
111
2
0
14 Oct 2025
Detect Anything via Next Point Prediction
Qing Jiang
Junan Huo
Xingyu Chen
Yuda Xiong
Zhaoyang Zeng
Yihao Chen
Tianhe Ren
Junzhi Yu
Lei Zhang
ObjD
226
18
0
14 Oct 2025
Source-Free Object Detection with Detection Transformer
IEEE Transactions on Image Processing (IEEE TIP), 2025
Huizai Yao
Sicheng Zhao
Shuo Lu
Hui Chen
Yangyang Li
Guoping Liu
Tengfei Xing
C. Yan
Jianhua Tao
Guiguang Ding
ViT
108
3
0
13 Oct 2025
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
Jinxuan Li
Chaolei Tan
Haoxuan Chen
Jianxin Ma
Jian-Fang Hu
Wei-Shi Zheng
Jianhuang Lai
VLM
211
1
0
12 Oct 2025
1
2
3
4
...
107
108
109
Next
Page 1 of 109
Page
of 109
Go