v1v2 (latest)

Fast R-CNN

30 April 2015

Ross B. Girshick

ObjD

ArXiv (abs)PDF HTML Github (3402★)

Papers citing "Fast R-CNN"

50 / 5,404 papers shown

Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection

Ahmet Oğuz Saltık

Alicia Allmendinger

Anthony Stein

299

18 Dec 2024

Differential Alignment for Domain Adaptive Object Detection

Xinyu He

Xinhui Li

Xiaojie Guo

348

17 Dec 2024

Open-World Panoptic Segmentation

343

17 Dec 2024

Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset

Madiyar Alimov

Temirlan Meiramkhanov

ViT

225

16 Dec 2024

Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and ChallengesIEEE Journal of Oceanic Engineering (IEEE J. Ocean. Eng.), 2024

318

16 Dec 2024

Neural Collapse Inspired Knowledge DistillationAAAI Conference on Artificial Intelligence (AAAI), 2024

Shuoxi Zhang

Zijian Song

Kun He

431

16 Dec 2024

V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

303

16 Dec 2024

Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty DetectionAsian Conference on Computer Vision (ACCV), 2024

366

15 Dec 2024

UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework

372

12 Dec 2024

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024

277

09 Dec 2024

From classical techniques to convolution-based models: A review of object detection algorithmsInternational Conference on Image Processing, Applications and Systems (ICIPAS), 2024

172

06 Dec 2024

Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

K. Hashmi

Talha Uddin Sheikh

Didier Stricker

Muhammad Zeshan Afzal

289

06 Dec 2024

Explaining Object Detectors via Collective Contribution of Pixels

575

01 Dec 2024

Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label NoiseNeural Information Processing Systems (NeurIPS), 2024

407

29 Nov 2024

ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model

670

29 Nov 2024

Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification

Junbo Jacob Lian

Haoran Chen

Kaichen Ouyang

Yujun Zhang

Rui Zhong

Huiling Chen

174

29 Nov 2024

Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly DetectionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024

248

28 Nov 2024

From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects

442

27 Nov 2024

MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide DetectionIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024

221

26 Nov 2024

On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down GuidanceNeural Information Processing Systems (NeurIPS), 2024

304

26 Nov 2024

The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic SimulationIEEE International Conference on Robotics and Automation (ICRA), 2024

229

25 Nov 2024

Open Vocabulary Monocular 3D Object Detection

530

25 Nov 2024

Corner2Net: Detecting Objects as Cascade CornersEuropean Conference on Artificial Intelligence (ECAI), 2024

209

24 Nov 2024

There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks

322

22 Nov 2024

Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint SegmentationIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024

Ming Zhao

Xin Zhang

Andre Kaup

365

21 Nov 2024

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning ScenariosNeural Information Processing Systems (NeurIPS), 2024

363

20 Nov 2024

Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity

293

20 Nov 2024

SL-YOLO: A Stronger and Lighter Drone Target Detection Model

Defan Chen

Luchan Zhang

ObjD

692

18 Nov 2024

DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization

C. Koutlis

Symeon Papadopoulos

424

15 Nov 2024

LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection

Chanyeong Park

Heegwang Kim

Joonki Paik

14 Nov 2024

Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery

Valentin Frank Ingmar Guenter

Athanasios Sideris

CVBM

298

14 Nov 2024

Drone Detection using Deep Neural Networks Trained on Pure Synthetic Data

187

13 Nov 2024

Dockformer: A transformer-based molecular docking paradigm for large-scale virtual screening

339

11 Nov 2024

MEANT: Multimodal Encoder for Antecedent InformationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Benjamin Iyoya Irving

Annika Marie Schoene

AIFin

180

10 Nov 2024

LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationInternational Conference on 3D Vision (3DV), 2024

325

09 Nov 2024

Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's TrajectoryJournal of Visual Communication and Image Representation (JVCIR), 2023

Ali AlShami

Terrance Boult

Jugal Kalita

315

07 Nov 2024

Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data

236

05 Nov 2024

SIRA: Scalable Inter-frame Relation and Association for Radar PerceptionComputer Vision and Pattern Recognition (CVPR), 2024

276

04 Nov 2024

Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

261

04 Nov 2024

Goal-Oriented Semantic Communication for Wireless Visual Question Answering

286

03 Nov 2024

Interaction-Aware Trajectory Prediction for Safe Motion Planning in Autonomous Driving: A Transformer-Transfer Learning Approach

191

03 Nov 2024

HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices

834

01 Nov 2024

Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual GroundingIEEE transactions on multimedia (IEEE TMM), 2024

Huafeng Li

184

31 Oct 2024

SFA-UNet: More Attention to Multi-Scale Contrast and Contextual Information in Infrared Small Object Segmentation

Imad Ali Shah

Fahad Mumtaz Malik

Muhammad Waqas Ashraf

230

30 Oct 2024

NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery

Lars Petersson

194

30 Oct 2024

Unbiased Regression Loss for DETRs

150

30 Oct 2024

Symbolic Graph Inference for Compound Scene Understanding

Hossein Nourkhiz Mahjoub

Katia Sycara

OCL

125

30 Oct 2024

FAIR-TAT: Improving Model Fairness Using Targeted Adversarial TrainingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

424

30 Oct 2024

SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection

Yun Li

160

29 Oct 2024

Improving Detection of Person Class Using Dense Pooling

Nouman Ahmad

ObjD

178

28 Oct 2024