ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 13,128 papers shown
Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
Mitchell Keren Taraday
Shahaf Wagner
Chaim Baskin
VLM
110
1
0
08 Oct 2025
Explaining raw data complexity to improve satellite onboard processing
Explaining raw data complexity to improve satellite onboard processing
Adrien Dorise
Marjorie Bellizzi
Adrien Girard
Benjamin Francesconi
Stéphane May
128
0
0
08 Oct 2025
Neuroplastic Modular Framework: Cross-Domain Image Classification of Garbage and Industrial Surfaces
Neuroplastic Modular Framework: Cross-Domain Image Classification of Garbage and Industrial Surfaces
Debojyoti Ghosh
Soumya K Ghosh
Adrijit Goswami
88
0
0
06 Oct 2025
Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition
Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition
Ranjan Sapkota
Manoj Karkee
ObjDMU
266
3
0
06 Oct 2025
Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context
Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context
Ngeyen Yinkfu
Sunday Nwovu
Jonathan Kayizzi
Angelique Uwamahoro
64
1
0
06 Oct 2025
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Jyoti Kini
Rohit Gupta
Mubarak Shah
ObjDVLM
197
0
0
04 Oct 2025
Referring Expression Comprehension for Small Objects
Referring Expression Comprehension for Small Objects
Kanoko Goto
Takumi Hirose
Mahiro Ukai
Shuhei Kurita
Nakamasa Inoue
ObjD
143
1
0
04 Oct 2025
A Hybrid Co-Finetuning Approach for Visual Bug Detection in Video Games
A Hybrid Co-Finetuning Approach for Visual Bug Detection in Video Games
Faliu Yi
Sherif M. Abdelfattah
Wei Huang
Adrian Brown
117
0
0
04 Oct 2025
Conditional Pseudo-Supervised Contrast for Data-Free Knowledge Distillation
Conditional Pseudo-Supervised Contrast for Data-Free Knowledge DistillationPattern Recognition (Pattern Recogn.), 2023
Renrong Shao
Wei Zhang
Ning Yang
140
10
0
03 Oct 2025
Bayesian Test-time Adaptation for Object Recognition and Detection with Vision-language Models
Bayesian Test-time Adaptation for Object Recognition and Detection with Vision-language Models
Lihua Zhou
Mao Ye
Shuaifeng Li
Nianxin Li
Jinlin Wu
X. Zhu
Lei Deng
Hongbin Liu
Jiebo Luo
Zhen Lei
BDLVLMTTA
302
0
0
03 Oct 2025
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Yongyi Su
H. Zhang
Shijie Li
Nanqing Liu
Jingyi Liao
...
Chen Li
Nancy F. Chen
Shuicheng Yan
Xulei Yang
Xun Xu
MLLMVLM
174
3
0
02 Oct 2025
MMDEW: Multipurpose Multiclass Density Estimation in the Wild
MMDEW: Multipurpose Multiclass Density Estimation in the Wild
Villanelle O'Reilly
Jonathan Cox
Georgios Leontidis
Marc Hanheide
Petra Bosilj
James M. Brown
120
0
0
02 Oct 2025
Leveraging Prior Knowledge of Diffusion Model for Person Search
Leveraging Prior Knowledge of Diffusion Model for Person Search
Giyeol Kim
Sooyoung Yang
Jihyong Oh
Myungjoo Kang
Chanho Eom
DiffM
104
0
0
02 Oct 2025
IMAGEdit: Let Any Subject Transform
IMAGEdit: Let Any Subject Transform
Fei Shen
Weihao Xu
Rui Yan
Dong Zhang
Xiangbo Shu
Jinhui Tang
VGen
120
1
0
01 Oct 2025
Advances in Medical Image Segmentation: A Comprehensive Survey with a Focus on Lumbar Spine Applications
Advances in Medical Image Segmentation: A Comprehensive Survey with a Focus on Lumbar Spine ApplicationsComputers in Biology and Medicine (Comput. Biol. Med.), 2025
Ahmed Kabil
Ghada Khoriba
Mina Yousef
Essam A. Rashed
132
1
0
01 Oct 2025
Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions
Semantic Visual Simultaneous Localization and Mapping: A Survey on State of the Art, Challenges, and Future Directions
Thanh Nguyen Canh
Haolan Zhang
Xiem HoangVan
N. Chong
181
0
0
01 Oct 2025
Self-Supervised Anatomical Consistency Learning for Vision-Grounded Medical Report Generation
Self-Supervised Anatomical Consistency Learning for Vision-Grounded Medical Report Generation
Longzhen Yang
Zhangkai Ni
Y. Wen
Yihang Liu
Lianghua He
Heng Tao Shen
116
0
0
30 Sep 2025
Hybrid Dual-Batch and Cyclic Progressive Learning for Efficient Distributed Training
Hybrid Dual-Batch and Cyclic Progressive Learning for Efficient Distributed Training
Kuan-Wei Lu
Ding-Yong Hong
Pangfeng Liu
Jan-Jan Wu
123
0
0
30 Sep 2025
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
Shucheng Zhang
Yan Shi
Bingzhang Wang
Yuang Zhang
Muhammad Monjurul Karim
Kehua Chen
Chenxi Liu
Mehrdad Nasri
Yinhai Wang
155
0
0
30 Sep 2025
Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection
Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection
Anay Majee
Amitesh Gangrade
Rishabh K. Iyer
115
0
0
30 Sep 2025
Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection
Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection
Yash Kulkarni
Raman Jha
Renu Kachhoria
52
0
0
30 Sep 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLMObjDVLMLRM
213
2
0
30 Sep 2025
FishNet++: Analyzing the capabilities of Multimodal Large Language Models in marine biology
FishNet++: Analyzing the capabilities of Multimodal Large Language Models in marine biology
Faizan Farooq Khan
Yousef Radwan
Eslam Abdelrahman
Abdulwahab Felemban
Aymen Mir
Nico K. Michiels
Andrew J. Temple
M. Berumen
Mohamed Elhoseiny
68
0
0
29 Sep 2025
YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
Ranjan Sapkota
Rahul Harsha Cheppally
Ajay Sharda
Manoj Karkee
ObjD
314
2
0
29 Sep 2025
TP-MVCC: Tri-plane Multi-view Fusion Model for Silkie Chicken Counting
TP-MVCC: Tri-plane Multi-view Fusion Model for Silkie Chicken Counting
Sirui Chen
Yuhong Feng
Yifeng Wang
J. Liao
Qi Zhang
59
0
0
29 Sep 2025
A Multi-Camera Vision-Based Approach for Fine-Grained Assembly Quality Control
A Multi-Camera Vision-Based Approach for Fine-Grained Assembly Quality Control
Ali Nazeri
Shashank Mishra
A. Wagner
Martin Ruskowski
Didier Stricker
J. Rambach
117
0
0
28 Sep 2025
From Unstable to Playable: Stabilizing Angry Birds Levels via Object Segmentation
From Unstable to Playable: Stabilizing Angry Birds Levels via Object Segmentation
Mahdi Farrokhimaleki
Parsa Rahmati
Richard Zhao
53
0
0
28 Sep 2025
Focusing on What Matters: Object-Agent-centric Tokenization for Vision Language Action models
Focusing on What Matters: Object-Agent-centric Tokenization for Vision Language Action models
Rokas Bendikas
Daniel Dijkman
Markus Peschl
Sanjay Haresh
Pietro Mazzaglia
161
1
0
28 Sep 2025
Diff-3DCap: Shape Captioning with Diffusion Models
Diff-3DCap: Shape Captioning with Diffusion ModelsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Zhenyu Shu
Jiawei Wen
Shiyang Li
Shiqing Xin
Ligang Liu
DiffM
123
0
0
28 Sep 2025
Enhanced Fracture Diagnosis Based on Critical Regional and Scale Aware in YOLO
Enhanced Fracture Diagnosis Based on Critical Regional and Scale Aware in YOLO
Yuyang Sun
Junchuan Yu
Cuiming Zou
136
0
0
27 Sep 2025
FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging
FracDetNet: Advanced Fracture Detection via Dual-Focus Attention and Multi-scale Calibration in Medical X-ray Imaging
Yuyang Sun
Cuiming Zou
OODMedIm
84
0
0
27 Sep 2025
Incorporating Scene Context and Semantic Labels for Enhanced Group-level Emotion Recognition
Incorporating Scene Context and Semantic Labels for Enhanced Group-level Emotion Recognition
Qing Zhu
Wangdong Guo
Qirong Mao
Xiaohua Huang
Xiuyan Shao
Wenming Zheng
81
0
0
26 Sep 2025
Spatial Reasoning in Foundation Models: Benchmarking Object-Centric Spatial Understanding
Spatial Reasoning in Foundation Models: Benchmarking Object-Centric Spatial Understanding
Vahid Mirjalili
Ramin Giahi
Sriram Kollipara
Akshay Kekuda
Kehui Yao
...
Kaushiki Nag
Sinduja Subramaniam
Topojoy Biswas
Evren Körpeoglu
Kannan Achan
VLMLRM
92
0
0
26 Sep 2025
$γ$-Quant: Towards Learnable Quantization for Low-bit Pattern Recognition
γγγ-Quant: Towards Learnable Quantization for Low-bit Pattern Recognition
Mishal Fatima
Shashank Agnihotri
Marius Bock
Kanchana Vaishnavi Gandikota
Kristof Van Laerhoven
Michael Moeller
Margret Keuper
MQ
130
0
0
26 Sep 2025
Enhancing Vehicle Detection under Adverse Weather Conditions with Contrastive Learning
Enhancing Vehicle Detection under Adverse Weather Conditions with Contrastive Learning
Boying Li
Chang Liu
Petter Kyösti
Mattias Öhman
Devashish Singha Roy
Sofia Plazzi
Hamam Mokayed
Olle Hagner
114
0
0
26 Sep 2025
Multilingual Vision-Language Models, A Survey
Multilingual Vision-Language Models, A Survey
Andrei-Alexandru Manea
Jindřich Libovický
VLM
143
1
0
26 Sep 2025
HierLight-YOLO: A Hierarchical and Lightweight Object Detection Network for UAV Photography
HierLight-YOLO: A Hierarchical and Lightweight Object Detection Network for UAV Photography
Defan Chen
Yaohua Hu
Luchan Zhang
ObjD
204
0
0
26 Sep 2025
CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks
CompressAI-Vision: Open-source software to evaluate compression methods for computer vision tasks
Hyomin Choi
Heeji Han
Chris Rosewarne
Fabien Racapé
169
1
0
25 Sep 2025
AI-Enabled Crater-Based Navigation for Lunar Mapping
AI-Enabled Crater-Based Navigation for Lunar Mapping
Sofia Mcleod
Chee-Kheng Chng
Matthew Rodda
Tat-Jun Chin
53
0
0
25 Sep 2025
FSMODNet: A Closer Look at Few-Shot Detection in Multispectral Data
FSMODNet: A Closer Look at Few-Shot Detection in Multispectral Data
Manuel Nkegoum
M. Pham
Elisa Fromont
Bruno Avignon
Sébastien Lefèvre
144
1
0
25 Sep 2025
DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning
DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning
Tianrun Xu
Haoda Jing
Y. Li
Yuquan Wei
Jun Feng
Guanyu Chen
Haichuan Gao
Tianren Zhang
Feng Chen
OffRL
99
0
0
25 Sep 2025
MS-YOLO: Infrared Object Detection for Edge Deployment via MobileNetV4 and SlideLoss
MS-YOLO: Infrared Object Detection for Edge Deployment via MobileNetV4 and SlideLoss
Jiali Zhang
Thomas S. White
Haoliang Zhang
Wenqing Hu
Donald C. Wunsch II
Jian Liu
126
0
0
25 Sep 2025
BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting
BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting
Yixun Zhang
Feng Zhou
Jianqin Yin
AAML
143
0
0
24 Sep 2025
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Pengxiang Li
Yinan Zheng
Y. Wang
Huimin Wang
Hang Zhao
Jingjing Liu
Xianyuan Zhan
Kun Zhan
Xianpeng Lang
113
7
0
24 Sep 2025
SDE-DET: A Precision Network for Shatian Pomelo Detection in Complex Orchard Environments
SDE-DET: A Precision Network for Shatian Pomelo Detection in Complex Orchard Environments
Yihao Hu
Pan Wang
Xiaodong Bai
Shijie Cai
Hang Wang
...
Aiping Yang
Xiangxiang Li
Meiping Ding
Hongyan Liu
Jianguo Yao
101
0
0
24 Sep 2025
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image DehazingEuropean Conference on Computer Vision (ECCV), 2025
Zizheng Yang
Hu Yu
Bing Li
Jinghao Zhang
Jie Huang
Feng Zhao
221
9
0
24 Sep 2025
Latent Danger Zone: Distilling Unified Attention for Cross-Architecture Black-box Attacks
Latent Danger Zone: Distilling Unified Attention for Cross-Architecture Black-box Attacks
Yang Li
C. Wang
Tingrui Wang
Yongwei Wang
Haonan Li
Zhunga Liu
Quan Pan
AAMLDiffM
141
0
0
23 Sep 2025
Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset
Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset
Chuni Liu
Hongjie Li
Jiaqi Du
Yangyang Hou
Qian Sun
Lei Jin
Ke Xu
OnRLAI4CE
232
0
0
23 Sep 2025
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
Buyin Deng
Lingxin Huang
Kai Luo
Fei Teng
Kailun Yang
VOT
263
1
0
22 Sep 2025
An Analysis of Kalman Filter based Object Tracking Methods for Fast-Moving Tiny Objects
An Analysis of Kalman Filter based Object Tracking Methods for Fast-Moving Tiny Objects
Prithvi Raj Singh
Raju N. Gottumukkala
Anthony Maida
148
1
0
22 Sep 2025
Previous
12345...261262263
Next