v1v2 (latest)

Fast R-CNN

30 April 2015

Ross B. Girshick

ObjD

ArXiv (abs)PDF HTML Github (3402★)

Papers citing "Fast R-CNN"

50 / 5,404 papers shown

YOLOE: Real-Time Seeing Anything

549

10 Mar 2025

FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection

Takeru Inoue

Ryusuke Miyamoto

221

10 Mar 2025

VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic ManipulationComputer Vision and Pattern Recognition (CVPR), 2025

448

10 Mar 2025

IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map ConstructionACM Multimedia (MM), 2024

416

05 Mar 2025

Catheter Detection and Segmentation in X-ray Images via Multi-task LearningInternational Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025

240

04 Mar 2025

MonoLite3D: Lightweight 3D Object Properties EstimationInternational Conference on Computing: Theory and Applications (ICCTA), 2023

Ahmed El-Dawy

Amr El-Zawawi

Mohamed El-Habrouk

190

04 Mar 2025

Boltzmann Attention Sampling for Image Analysis with Small ObjectsComputer Vision and Pattern Recognition (CVPR), 2025

447

04 Mar 2025

Identity documents recognition and detection using semantic segmentation with convolutional neural network

208

03 Mar 2025

Can Optical Denoising Clean Sonar Images? A Benchmark and Fusion Approach

275

03 Mar 2025

Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized DetectionComputer Vision and Pattern Recognition (CVPR), 2025

541

03 Mar 2025

MI-DETR: An Object Detection Model with Multi-time Inquiries MechanismComputer Vision and Pattern Recognition (CVPR), 2025

361

03 Mar 2025

Insights into dendritic growth mechanisms in batteries: A combined machine learning and computational studyBattery Energy (BE), 2025

113

02 Mar 2025

Learning-Based Leader Localization for Underwater Vehicles With Optical-Acoustic-Pressure Sensor Fusion

Mingyang Yang

Zeyu Sha

Feitian Zhang

217

28 Feb 2025

RTGen: Real-Time Generative Detection Transformer

419

28 Feb 2025

Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal TranslationConference on Machine Translation (WMT), 2025

890

27 Feb 2025

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

...

Juan Carlos León Alcázar

297

27 Feb 2025

WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation

Yibo Wang

...

559

27 Feb 2025

An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving

Tianchen Ji

Neeloy Chakraborty

Andre Schreiber

Katherine Rose Driggs-Campbell

1.1K

23 Feb 2025

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

470

145

21 Feb 2025

EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous EnvironmentsIEEE International Conference on Robotics and Automation (ICRA), 2024

439

20 Feb 2025

Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly DetectionIEEE International Conference on Robotics and Automation (ICRA), 2025

...

275

17 Feb 2025

An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet

328

10 Feb 2025

Large Memory Network for RecommendationThe Web Conference (WWW), 2025

289

08 Feb 2025

RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data TechnologyInternational Conference on Knowledge and Systems Engineering (KSE), 2024

350

06 Feb 2025

RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing ImagesApplied Sciences (AS), 2022

421

05 Feb 2025

ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D EchocardiographiesApplied Sciences (AS), 2025

299

03 Feb 2025

A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches

570

31 Jan 2025

Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test ImagesInternational Conference on Artificial Intelligence Circuits and Systems (AICAS), 2025

339

30 Jan 2025

Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2024

450

28 Jan 2025

RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question AnsweringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

276

23 Jan 2025

GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation

411

22 Jan 2025

mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework

475

21 Jan 2025

Self-supervised Transformation Learning for Equivariant RepresentationsNeural Information Processing Systems (NeurIPS), 2025

283

15 Jan 2025

A novel multi-agent dynamic portfolio optimization learning system based on hierarchical deep reinforcement learning

160

12 Jan 2025

Zero-shot Shark Tracking and Biometrics from Aerial ImageryMethods in Ecology and Evolution (MEE), 2025

133

10 Jan 2025

UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping

295

10 Jan 2025

UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous VehiclesDesign, Automation and Test in Europe (DATE), 2025

Abhishek Balasubramaniam

Febin P. Sunny

S. Pasricha

3DPC

246

08 Jan 2025

Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work

170

08 Jan 2025

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

524

08 Jan 2025

Generalization-Enhanced Few-Shot Object Detection in Remote Sensing

398

05 Jan 2025

First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria

Stefan Schoder

414

31 Dec 2024

Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering

306

31 Dec 2024

Towards Visual Grounding: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

986

28 Dec 2024

Towards Unsupervised Model Selection for Domain Adaptive Object DetectionNeural Information Processing Systems (NeurIPS), 2024

275

23 Dec 2024

Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights RefinementAAAI Conference on Artificial Intelligence (AAAI), 2024

H. Kim

Jaejun Yoo

492

23 Dec 2024

V"Mean"ba: Visual State Space Models only need 1 hidden dimension

257

21 Dec 2024

Texture- and Shape-based Adversarial Attacks for Overhead Image Vehicle DetectionInternational Conference on Information Photonics (ICIP), 2024

Mikael Yeghiazaryan

Sai Abhishek Siddhartha Namburu

414

20 Dec 2024

Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV)International Conference on Machine Learning and Applications (ICMLA), 2024

Aneesha Guna

Parth Ganeriwala

S. Bhattacharyya

150

19 Dec 2024

TopView: Vectorising road users in a bird's eye view from uncalibrated street-level imagery with deep learning

Mohamed R Ibrahim

365

18 Dec 2024

Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report GenerationIEEE Transactions on Medical Imaging (IEEE TMI), 2024

Sotirios A. Tsaftaris

354

18 Dec 2024