ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 13,020 papers shown
Title
Learning Informative Attention Weights for Person Re-Identification
Learning Informative Attention Weights for Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
327
0
0
24 Dec 2025
ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection
ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection
Omid Reza Heidari
Yang Wang
Xinxin Zuo
152
0
0
02 Dec 2025
FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection
Ashish Vashist
Qiranul Saadiyean
Suresh Sundaram
Chandra Sekhar Seelamantula
16
0
0
01 Dec 2025
Artemis: Structured Visual Reasoning for Perception Policy Learning
Artemis: Structured Visual Reasoning for Perception Policy Learning
Wei Tang
Yanpeng Sun
Shan Zhang
Xiaofan Li
Piotr Koniusz
Wei Li
Na Zhao
Z. Li
LRMVLM
72
0
0
01 Dec 2025
BlinkBud: Detecting Hazards from Behind via Sampled Monocular 3D Detection on a Single EarbudProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025
Yunzhe Li
Jiajun Yan
Yuzhou Wei
Kechen Liu
Yize Zhao
Chong Zhang
Hongzi Zhu
Li Lu
Shan Chang
Minyi Guo
100
0
0
01 Dec 2025
Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery
Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery
Zhicheng Zhao
Y. Huang
Lingma Sun
Chenglong Li
Jin Tang
20
0
0
01 Dec 2025
Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback
Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback
Aiden Yiliu Li
Bizhi Yu
Daoan Lei
Tianhe Ren
Shilong Liu
LRMAI4CE
64
0
0
01 Dec 2025
DAONet-YOLOv8: An Occlusion-Aware Dual-Attention Network for Tea Leaf Pest and Disease Detection
DAONet-YOLOv8: An Occlusion-Aware Dual-Attention Network for Tea Leaf Pest and Disease Detection
Yefeng Wu
Shan Wan
Ling Wu
Yecheng Zhao
76
0
0
28 Nov 2025
Analysis of Incursive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation
Analysis of Incursive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation
Jayan Adhikari
Prativa Joshi
Susish Baral
8
0
0
28 Nov 2025
LC4-DViT: Land-cover Creation for Land-cover Classification with Deformable Vision Transformer
LC4-DViT: Land-cover Creation for Land-cover Classification with Deformable Vision Transformer
Kai Wang
S. Chen
Weicong Pang
Chenchen Zhang
Renjun Gao
Z. Chen
Cheng Li
Dasa Gu
Rui Huang
Alexis Kai Hon Lau
ViT
28
0
0
27 Nov 2025
UMind-VL: A Generalist Ultrasound Vision-Language Model for Unified Grounded Perception and Comprehensive Interpretation
UMind-VL: A Generalist Ultrasound Vision-Language Model for Unified Grounded Perception and Comprehensive Interpretation
Dengbo Chen
Ziwei Zhao
Kexin Zhang
Shishuang Zhao
J. Hou
...
AnLan Sun
Fei Gao
Jia Ding
Y. Liu
Dong Wang
VLM
60
0
0
27 Nov 2025
PAGen: Phase-guided Amplitude Generation for Domain-adaptive Object Detection
PAGen: Phase-guided Amplitude Generation for Domain-adaptive Object Detection
Shuchen Du
Shuo Lei
Feiran Li
Jiacheng Li
Daisuke Iso
48
0
0
27 Nov 2025
CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation
CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation
Shizhe Sun
Wataru Ohyama
173
0
0
26 Nov 2025
Co-Training Vision Language Models for Remote Sensing Multi-task Learning
Co-Training Vision Language Models for Remote Sensing Multi-task Learning
Qingyun Li
Shuran Ma
Junwei Luo
Yi Yu
Yue Zhou
...
Xiaoxing Wang
Xin He
Yushi Chen
Xue Yang
Junchi Yan
152
0
0
26 Nov 2025
HybriDLA: Hybrid Generation for Document Layout Analysis
HybriDLA: Hybrid Generation for Document Layout Analysis
Yufan Chen
Omar Moured
R. Liu
Junwei Zheng
Kunyu Peng
Jiaming Zhang
Rainer Stiefelhagen
53
0
0
25 Nov 2025
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
Tooba Tehreem Sheikh
Jean Lahoud
Rao Muhammad Anwer
Fahad Shahbaz Khan
Salman Khan
Hisham Cholakkal
ObjDMedImVLM
267
0
0
25 Nov 2025
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation
Daniel Kienzle
K. Ludwig
Julian Lorenz
ShinÍchi Satoh
Rainer Lienhart
64
0
0
25 Nov 2025
Exploring State-of-the-art models for Early Detection of Forest Fires
Exploring State-of-the-art models for Early Detection of Forest Fires
Sharjeel Ahmed
Daim Armaghan
Fatima Naweed
Umair Yousaf
Ahmad Zubair
Murtaza Taj
64
0
0
25 Nov 2025
Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection?
Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection?
Kun Guo
Yun Shen
Xijun Wang
Chaoqun You
Yun Rui
Tony Q.S. Quek
32
0
0
25 Nov 2025
Intelligent Image Search Algorithms Fusing Visual Large Models
Intelligent Image Search Algorithms Fusing Visual Large Models
Kehan Wang
Tingqiong Cui
Y. Zhang
Yu Chen
Shifeng Wu
Z. Li
VLM
126
0
0
25 Nov 2025
ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis
ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis
Advik Sinha
Saurabh Atreya
Aashutosh A V
Sk Aziz Ali
Abhijit Das
CLIP
120
0
0
25 Nov 2025
Multimodal Real-Time Anomaly Detection and Industrial Applications
Multimodal Real-Time Anomaly Detection and Industrial Applications
Aman Verma
Keshav Samdani
Mohd. Samiuddin Shafi
130
0
0
24 Nov 2025
Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety Framework
Analysis of Deep-Learning Methods in an ISO/TS 15066-Compliant Human-Robot Safety FrameworkItalian National Conference on Sensors (INS), 2025
David Bricher
Andreas Mueller
132
0
0
24 Nov 2025
Robust Physical Adversarial Patches Using Dynamically Optimized Clusters
Robust Physical Adversarial Patches Using Dynamically Optimized Clusters
Harrison Bagley
Will Meakin
Simon Lucey
Yee Wei Law
Tat-Jun Chin
AAML
100
0
0
23 Nov 2025
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Chuang Peng
Renshuai Tao
Zhongwei Ren
Xianglong Liu
Yunchao Wei
104
0
0
23 Nov 2025
SciPostLayoutTree: A Dataset for Structural Analysis of Scientific Posters
SciPostLayoutTree: A Dataset for Structural Analysis of Scientific Posters
Shohei Tanaka
Atsushi Hashimoto
Yoshitaka Ushiku
84
0
0
23 Nov 2025
State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
Jiaying Zhou
Qingchao Chen
80
0
0
22 Nov 2025
Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI
Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI
A. Gomaa
Annette Schwarz
Ludwig Singer
Arnd Dörfler
M. May
...
Andrea Wittig
R. Fietkau
Christoph Bert
Stefanie Corradini
F. Putz
60
0
0
22 Nov 2025
VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection
VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object Detection
Jianhang Yao
Yongbin Zheng
Siqi Lu
Wanying Xu
Peng Sun
ObjDVLM
215
0
0
22 Nov 2025
Person Recognition in Aerial Surveillance: A Decade Survey
Person Recognition in Aerial Surveillance: A Decade SurveyIEEE Transactions on Biometrics Behavior and Identity Science (TBBIS), 2025
Kien Nguyen
Feng Liu
Clinton Fookes
Sridha Sridharan
Xiaoming Liu
Arun Ross
80
0
0
21 Nov 2025
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
REXO: Indoor Multi-View Radar Object Detection via 3D Bounding Box Diffusion
Ryoma Yataka
Pu Perry Wang
P. Boufounos
R. Takahashi
93
0
0
21 Nov 2025
Controllable Layer Decomposition for Reversible Multi-Layer Image Generation
Controllable Layer Decomposition for Reversible Multi-Layer Image Generation
Zihao Liu
Zunnan Xu
Shi Shu
Jun Zhou
Ruicheng Zhang
Zhenchao Tang
Xiu Li
186
0
0
20 Nov 2025
StreetView-Waste: A Multi-Task Dataset for Urban Waste Management
Diogo J. Paulo
João Martins
Hugo Manuel Proença
Joao Neves
36
0
0
20 Nov 2025
Deep Learning for Accurate Vision-based Catch Composition in Tropical Tuna Purse Seiners
Deep Learning for Accurate Vision-based Catch Composition in Tropical Tuna Purse Seiners
Xabier Lekunberri
Ahmad Kamal
Izaro Goienetxea
Jon Ruiz
Iñaki Quincoces
Jaime Valls Miro
Ignacio Arganda-Carreras
Jose A. Fernandes-Salvador
104
0
0
19 Nov 2025
Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection
Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection
Spyridon Loukovitis
Vasileios Karampinis
Athanasios Voulodimos
60
0
0
19 Nov 2025
What Your Features Reveal: Data-Efficient Black-Box Feature Inversion Attack for Split DNNs
What Your Features Reveal: Data-Efficient Black-Box Feature Inversion Attack for Split DNNs
Zhihan Ren
Lijun He
Jiaxi Liang
Xinzhu Fu
Haixia Bi
Fan Li
AAML
184
1
0
19 Nov 2025
When CNNs Outperform Transformers and Mambas: Revisiting Deep Architectures for Dental Caries Segmentation
When CNNs Outperform Transformers and Mambas: Revisiting Deep Architectures for Dental Caries Segmentation
Aashish Ghimire
Jun Zeng
Roshan Paudel
Nikhil Kumar Tomar
Deepak Ranjan Nayak
Harshith Reddy Nalla
Vivek Jha
Glenda Reynolds
Debesh Jha
Mamba
250
0
0
18 Nov 2025
Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification
Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-Identification
Rifen Lin
Alex Jinpeng Wang
Jiawei Mo
Min Li
135
0
0
17 Nov 2025
MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning
MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning
Yoonjae Seo
Ermal Elbasani
Jaehong Lee
MQ
108
0
0
17 Nov 2025
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation
Henry Herzog
Favyen Bastani
Yawen Zhang
Gabriel Tseng
Joseph Redmon
...
Hannah Kerner
Evan Shelhamer
Ali Farhadi
Ranjay Krishna
Patrick Beukema
VGen
144
0
0
17 Nov 2025
T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving
T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving
Chen Ma
Ningfei Wang
Junhao Zheng
Qing Guo
Qian Wang
Qi Alfred Chen
Chao Shen
DiffM
173
0
0
17 Nov 2025
Uni-Hema: Unified Model for Digital Hematopathology
Uni-Hema: Unified Model for Digital Hematopathology
Abdul Rehman
Iqra Rasool
Ayisha Imran
Mohsen Ali
Waqas Sultani
VLM
116
0
0
17 Nov 2025
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Ankita Raj
Chetan Arora
ObjDAAMLVLM
173
0
0
16 Nov 2025
Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection
Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection
Xi Xiao
Zhuxuanzi Wang
Mingqiao Mo
Chen Liu
Chenrui Ma
Yanshu Li
Smita Krishnaswamy
Xiao Wang
Tianyang Wang
112
0
0
16 Nov 2025
Uncover and Unlearn Nuisances: Agnostic Fully Test-Time Adaptation
Uncover and Unlearn Nuisances: Agnostic Fully Test-Time AdaptationMachine-mediated learning (ML), 2025
Ponhvoan Srey
Yaxin Shi
Hangwei Qian
Jing Li
Ivor Tsang
TTA
168
0
0
16 Nov 2025
MixAR: Mixture Autoregressive Image Generation
MixAR: Mixture Autoregressive Image Generation
Jinyuan Hu
Jiayou Zhang
Shaobo Cui
Kun Zhang
Guangyi Chen
DiffM
128
0
0
15 Nov 2025
Multimodal Peer Review Simulation with Actionable To-Do Recommendations for Community-Aware Manuscript Revisions
Multimodal Peer Review Simulation with Actionable To-Do Recommendations for Community-Aware Manuscript RevisionsJournal of Information Systems Engineering & Management (JISEM), 2025
Mengze Hong
Di Jiang
Weiwei Zhao
Yawen Li
Y. Wang
Xinyuan Luo
Yanjie Sun
Chen Zhang
57
0
0
14 Nov 2025
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
Jinfu Li
Yuqi Huang
Hong Song
Ting Wang
Jianghan Xia
Yucong Lin
Jingfan Fan
Jian Yang
ObjD
182
0
0
13 Nov 2025
LLM-YOLOMS: Large Language Model-based Semantic Interpretation and Fault Diagnosis for Wind Turbine Components
LLM-YOLOMS: Large Language Model-based Semantic Interpretation and Fault Diagnosis for Wind Turbine Components
Y. Li
Y. Wang
Meng Li
Xinming Li
Jianbo Feng
56
0
0
13 Nov 2025
How Can We Effectively Use LLMs for Phishing Detection?: Evaluating the Effectiveness of Large Language Model-based Phishing Detection Models
How Can We Effectively Use LLMs for Phishing Detection?: Evaluating the Effectiveness of Large Language Model-based Phishing Detection Models
Fujiao Ji
Doowon Kim
173
0
0
12 Nov 2025
1234...259260261
Next