ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1504.08083
  4. Cited By
Fast R-CNN
v1v2 (latest)

Fast R-CNN

30 April 2015
Ross B. Girshick
    ObjD
ArXiv (abs)PDFHTMLGithub (3402★)

Papers citing "Fast R-CNN"

50 / 5,404 papers shown
Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection
Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection
Ahmet Oğuz Saltık
Alicia Allmendinger
Anthony Stein
299
14
0
18 Dec 2024
Differential Alignment for Domain Adaptive Object Detection
Differential Alignment for Domain Adaptive Object Detection
Xinyu He
Xinhui Li
Xiaojie Guo
348
2
0
17 Dec 2024
Open-World Panoptic Segmentation
Open-World Panoptic Segmentation
Matteo Sodano
Federico Magistri
Jens Behley
Cyrill Stachniss
VLM
343
1
0
17 Dec 2024
Domain Generalization in Autonomous Driving: Evaluating YOLOv8s,
  RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset
Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset
Madiyar Alimov
Temirlan Meiramkhanov
ViT
225
2
0
16 Dec 2024
Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness
  and Challenges
Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and ChallengesIEEE Journal of Oceanic Engineering (IEEE J. Ocean. Eng.), 2024
Martin Aubard
Ana Madureira
Luis F. Teixeira
José Pinto
AAML
318
24
0
16 Dec 2024
Neural Collapse Inspired Knowledge Distillation
Neural Collapse Inspired Knowledge DistillationAAAI Conference on Artificial Intelligence (AAAI), 2024
Shuoxi Zhang
Zijian Song
Kun He
431
1
0
16 Dec 2024
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D
  Annotations
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
303
5
0
16 Dec 2024
Redefining Normal: A Novel Object-Level Approach for Multi-Object
  Novelty Detection
Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty DetectionAsian Conference on Computer Vision (ACCV), 2024
Mohammadreza Salehi
Nikolaos Apostolikas
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
ObjD
366
0
0
15 Dec 2024
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set
  Object Detection Framework
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
Silin Cheng
Yuanpei Liu
Kai Han
EDL
372
0
0
12 Dec 2024
Object Detection using Event Camera: A MoE Heat Conduction based
  Detector and A New Benchmark Dataset
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024
Xinyu Wang
Yu Jin
Wentao Wu
Wei Zhang
Lin Zhu
Bo Jiang
Yonghong Tian
277
18
0
09 Dec 2024
From classical techniques to convolution-based models: A review of
  object detection algorithms
From classical techniques to convolution-based models: A review of object detection algorithmsInternational Conference on Image Processing, Applications and Systems (ICIPAS), 2024
Fnu Neha
Deepshikha Bhati
Deepak Kumar Shukla
Md. Amiruzzaman
ObjDVLM
172
11
0
06 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video
  Object Detection
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
289
0
0
06 Dec 2024
Explaining Object Detectors via Collective Contribution of Pixels
Explaining Object Detectors via Collective Contribution of Pixels
Toshinori Yamauchi
Hiroshi Kera
K. Kawamoto
ObjDFAtt
575
3
0
01 Dec 2024
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label NoiseNeural Information Processing Systems (NeurIPS), 2024
Yeonguk Yu
Minhwan Ko
Sungho Shin
Kangmin Kim
K. Lee
NoLa
407
5
0
29 Nov 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLMVLM3DV
670
1
0
29 Nov 2024
Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification
Junbo Jacob Lian
Haoran Chen
Kaichen Ouyang
Yujun Zhang
Rui Zhong
Huiling Chen
174
0
0
29 Nov 2024
Automatic Prompt Generation and Grounding Object Detection for Zero-Shot
  Image Anomaly Detection
Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly DetectionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024
Tsun-hin Cheung
Ka-Chun Fung
Songjiang Lai
Kwan-Ho Lin
Vincent To-Yee NG
K. Lam
248
0
0
28 Nov 2024
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
Zizhao Li
Zhengkang Xiang
Joseph West
Kourosh Khoshelham
ObjDVLM
442
3
0
27 Nov 2024
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic
  Segmentation Network For Relic Landslide Detection
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide DetectionIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Juefei He
Yuexing Peng
Wei Li
Junchuan Yu
Daqing Ge
Wei Xiang
221
2
0
26 Nov 2024
On-Road Object Importance Estimation: A New Dataset and A Model with
  Multi-Fold Top-Down Guidance
On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down GuidanceNeural Information Processing Systems (NeurIPS), 2024
Jingjing Jiang
Yilong Chen
Tianfei Zhou
Tao Xiang
304
0
0
26 Nov 2024
The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic
  Robotic Simulation
The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic SimulationIEEE International Conference on Robotics and Automation (ICRA), 2024
Georgina Nuthall
Richard Bowden
Oscar Mendez
VGen
229
0
0
25 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjDVLM
530
12
0
25 Nov 2024
Corner2Net: Detecting Objects as Cascade Corners
Corner2Net: Detecting Objects as Cascade CornersEuropean Conference on Artificial Intelligence (ECAI), 2024
Chenglong Liu
Jintao Liu
Haorao Wei
Jinze Yang
Liangyu Xu
Yuchen Guo
Lu Fang
209
0
0
24 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual
  Understanding Tasks
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Jingyu Sun
Elliot J. Crowley
VLM
322
5
0
22 Nov 2024
Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint
  Segmentation
Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint SegmentationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Ming Zhao
Xin Zhang
Andre Kaup
365
26
0
21 Nov 2024
Learning to Reason Iteratively and Parallelly for Complex Visual
  Reasoning Scenarios
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning ScenariosNeural Information Processing Systems (NeurIPS), 2024
Shantanu Jaiswal
Debaditya Roy
Basura Fernando
Cheston Tan
ReLMLRM
363
6
0
20 Nov 2024
Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal
  Identity and Motion Similarity
Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity
Wassim El Ahmar
Dhanvin Kolhatkar
F. Nowruzi
R. Laganière
293
4
0
20 Nov 2024
SL-YOLO: A Stronger and Lighter Drone Target Detection Model
Defan Chen
Luchan Zhang
ObjD
692
16
0
18 Nov 2024
DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization
DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization
C. Koutlis
Symeon Papadopoulos
424
7
0
15 Nov 2024
LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial
  Object Detection
LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
Chanyeong Park
Heegwang Kim
Joonki Paik
93
0
0
14 Nov 2024
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
Valentin Frank Ingmar Guenter
Athanasios Sideris
CVBM
298
1
0
14 Nov 2024
Drone Detection using Deep Neural Networks Trained on Pure Synthetic
  Data
Drone Detection using Deep Neural Networks Trained on Pure Synthetic Data
Mariusz Wisniewski
Zeeshan A. Rana
Ivan Petrunin
Alan Holt
Stephen Harman
187
2
0
13 Nov 2024
Dockformer: A transformer-based molecular docking paradigm for
  large-scale virtual screening
Dockformer: A transformer-based molecular docking paradigm for large-scale virtual screening
Zhangfan Yang
Junkai Ji
Shan He
Jianqiang Li
Ruibin Bai
Zexuan Zhu
Yew-Soon Ong
Yew-Soon Ong
339
1
0
11 Nov 2024
MEANT: Multimodal Encoder for Antecedent Information
MEANT: Multimodal Encoder for Antecedent InformationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Benjamin Iyoya Irving
Annika Marie Schoene
AIFin
180
0
1
10 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with
  Instance Representation
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationInternational Conference on 3D Vision (3DV), 2024
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
325
3
0
09 Nov 2024
Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis
  Player's Trajectory
Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's TrajectoryJournal of Visual Communication and Image Representation (JVCIR), 2023
Ali AlShami
Terrance Boult
Jugal Kalita
315
15
0
07 Nov 2024
Self-supervised cross-modality learning for uncertainty-aware object
  detection and recognition in applications which lack pre-labelled training
  data
Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data
Irum Mehboob
Li Sun
Alireza Astegarpanah
Rustam Stolkin
UQCV
236
0
0
05 Nov 2024
SIRA: Scalable Inter-frame Relation and Association for Radar Perception
SIRA: Scalable Inter-frame Relation and Association for Radar PerceptionComputer Vision and Pattern Recognition (CVPR), 2024
Ryoma Yataka
Peng Wang
P. Boufounos
R. Takahashi
276
10
0
04 Nov 2024
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Yan Li
Weiwei Guo
Songyuan Li
Ning Liao
Shaofeng Zhang
Yi Yu
Wenxian Yu
Junchi Yan
ObjD
261
2
0
04 Nov 2024
Goal-Oriented Semantic Communication for Wireless Visual Question
  Answering
Goal-Oriented Semantic Communication for Wireless Visual Question Answering
Sige Liu
Nan Li
Yansha Deng
Tony Q. S. Quek
286
3
0
03 Nov 2024
Interaction-Aware Trajectory Prediction for Safe Motion Planning in
  Autonomous Driving: A Transformer-Transfer Learning Approach
Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach
Jinhao Liang
Chaopeng Tan
Longhao Yan
Jingyuan Zhou
Guodong Yin
Kaidi Yang
191
20
0
03 Nov 2024
HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices
HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices
Xiang Li
Cheng Chen
Yuan-Yao Lou
Mustafa Abdallah
Kwang Taik Kim
S. Bagchi
VOT
834
1
0
01 Nov 2024
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive
  Position Correction for Visual Grounding
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual GroundingIEEE transactions on multimedia (IEEE TMM), 2024
Minghong Xie
Ming Wang
Huafeng Li
Yafei Zhang
Dapeng Tao
Z. Yu
ObjD
184
6
0
31 Oct 2024
SFA-UNet: More Attention to Multi-Scale Contrast and Contextual
  Information in Infrared Small Object Segmentation
SFA-UNet: More Attention to Multi-Scale Contrast and Contextual Information in Infrared Small Object Segmentation
Imad Ali Shah
Fahad Mumtaz Malik
Muhammad Waqas Ashraf
230
1
0
30 Oct 2024
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
Xuesong Li
Zeeshan Hayder
Ali Zia
Connor Cassidy
Shiming Liu
W. Stiller
Eric A. Stone
Warren C. Conaty
Lars Petersson
V. Rolland
194
4
0
30 Oct 2024
Unbiased Regression Loss for DETRs
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
150
0
0
30 Oct 2024
Symbolic Graph Inference for Compound Scene Understanding
Symbolic Graph Inference for Compound Scene Understanding
FNU Aryan
Simon Stepputtis
Sarthak Bhagat
Joseph Campbell
Kwonjoon Lee
Hossein Nourkhiz Mahjoub
Katia Sycara
OCL
125
0
0
30 Oct 2024
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial TrainingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Tejaswini Medi
Steffen Jung
Margret Keuper
AAML
424
5
0
30 Oct 2024
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion
  Enhancement for Laryngo-Pharyngeal Tumor Detection
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection
Jia Wei
Yun Li
Xiaomao Fan
Wenjun Ma
Meiyu Qiu
Hongyu Chen
Wenbin Lei
160
0
0
29 Oct 2024
Improving Detection of Person Class Using Dense Pooling
Improving Detection of Person Class Using Dense Pooling
Nouman Ahmad
ObjD
178
1
0
28 Oct 2024
Previous
123...678...107108109
Next
Page 7 of 109
Pageof 109