Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1504.08083
Cited By
v1
v2 (latest)
Fast R-CNN
30 April 2015
Ross B. Girshick
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3402★)
Papers citing
"Fast R-CNN"
50 / 5,404 papers shown
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLM
ObjD
549
35
0
10 Mar 2025
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue
Ryusuke Miyamoto
221
0
0
10 Mar 2025
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Computer Vision and Pattern Recognition (CVPR), 2025
Hanzhi Chen
Boyang Sun
Anran Zhang
Marc Pollefeys
Stefan Leutenegger
LM&Ro
448
31
0
10 Mar 2025
IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map Construction
ACM Multimedia (MM), 2024
Jiangtong Zhu
Zhao Yang
Yinan Shi
Jianwu Fang
Jianru Xue
ISeg
416
1
0
05 Mar 2025
Catheter Detection and Segmentation in X-ray Images via Multi-task Learning
International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025
Lin Xi
Yingliang Ma
Ethan Koland
Sandra Howell
Aldo Rinaldi
Kawal S. Rhode
240
1
0
04 Mar 2025
MonoLite3D: Lightweight 3D Object Properties Estimation
International Conference on Computing: Theory and Applications (ICCTA), 2023
Ahmed El-Dawy
Amr El-Zawawi
Mohamed El-Habrouk
190
0
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Computer Vision and Pattern Recognition (CVPR), 2025
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
447
2
0
04 Mar 2025
Identity documents recognition and detection using semantic segmentation with convolutional neural network
Mykola Kozlenko
Volodymyr Sendetskyi
Oleksiy Simkiv
Nazar Savchenko
Andy Bosyi
208
5
0
03 Mar 2025
Can Optical Denoising Clean Sonar Images? A Benchmark and Fusion Approach
Ziyu Wang
Tao Xue
Jingyuan Li
Haibin Zhang
Zhiqiang Xu
Zhiqiang Xu
Gaofei Xu
Yanbin Wang
Zhiquan Liu
275
0
0
03 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
541
5
0
03 Mar 2025
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Computer Vision and Pattern Recognition (CVPR), 2025
Jingjing Jiang
Xianghong Li
Jifeng Dai
Tao Xiang
361
7
0
03 Mar 2025
Insights into dendritic growth mechanisms in batteries: A combined machine learning and computational study
Battery Energy (BE), 2025
Zirui Zhao
Junchao Xia
Si Wu
Xiaoke Wang
Guanping Xu
Yinghao Zhu
Jing Sun
Hai-Feng Li
113
6
0
02 Mar 2025
Learning-Based Leader Localization for Underwater Vehicles With Optical-Acoustic-Pressure Sensor Fusion
Mingyang Yang
Zeyu Sha
Feitian Zhang
217
1
0
28 Feb 2025
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
Jiying Zhao
Wenhu Chen
ObjD
VLM
419
0
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Conference on Machine Translation (WMT), 2025
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
890
8
0
27 Feb 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
297
6
0
27 Feb 2025
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation
Mingjie Wu
Chenggui Yang
Huihua Wang
Chen Xue
Yibo Wang
...
Yuqi Han
R. Li
Lijun Yun
Zaiqing Chen
Siyang Song
559
0
0
27 Feb 2025
An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving
Tianchen Ji
Neeloy Chakraborty
Andre Schreiber
Katherine Rose Driggs-Campbell
1.1K
2
0
23 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
470
145
0
21 Feb 2025
EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments
IEEE International Conference on Robotics and Automation (ICRA), 2024
Linus Nwankwo
Bjoern Ellensohn
Vedant Dave
Peter Hofer
Jan Forstner
Marlene Villneuve
Robert Galler
Elmar Rueckert
439
6
0
20 Feb 2025
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
IEEE International Conference on Robotics and Automation (ICRA), 2025
Xuan Tong
Yang Chang
Qing Zhao
Xuan Tong
Boyang Wang
...
Xinji Mai
Haoran Wang
Zeng Tao
Yan Wang
Wenqiang Zhang
275
1
0
17 Feb 2025
An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet
Hongyu Liu
Guowu Yuan
Lei Yang
Kunxiao Liu
Hao Zhou
328
30
0
10 Feb 2025
Large Memory Network for Recommendation
The Web Conference (WWW), 2025
Hui Lu
Zheng Chai
Y. Zheng
Zhe Chen
Deping Xie
Peng Xu
Xun Zhou
289
3
0
08 Feb 2025
RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology
International Conference on Knowledge and Systems Engineering (KSE), 2024
Nhat-Tan Do
Nhi Ngoc-Yen Nguyen
Dieu-Phuong Nguyen
Trong-Hop Do
VOT
350
2
0
06 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Applied Sciences (AS), 2022
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
421
39
0
05 Feb 2025
ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies
Applied Sciences (AS), 2025
C. Ciușdel
Alex Serban
Tiziano Passerini
CoGe
299
1
0
03 Feb 2025
A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches
Luca Ciampi
Ali Azmoudeh
Elif Ecem Akbaba
Erdi Sarıtaş
Ziya Ata Yazıcı
H. K. Ekenel
Giuseppe Amato
Fabrizio Falchi
570
2
0
31 Jan 2025
Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images
International Conference on Artificial Intelligence Circuits and Systems (AICAS), 2025
Wei-Lun Chen
Chia-Yeh Hsieh
Yu-Hsiang Kao
Kai-Chun Liu
Sheng-Yu Peng
Yu Tsao
339
0
0
30 Jan 2025
Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jielong Tang
Zhenxing Wang
Ziyang Gong
Jianxing Yu
Shuang Wang
Jian Yin
450
3
0
28 Jan 2025
RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Yang Bai
Christan Earl Grant
Daisy Zhe Wang
RALM
276
5
0
23 Jan 2025
GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation
Ruicheng Zhang
Haowei Guo
Zeyu Zhang
Puxin Yan
Shen Zhao
411
9
0
22 Jan 2025
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Bingyi Liu
Jian Teng
Hongfei Xue
Enshu Wang
Chuanhui Zhu
Pu Wang
Libing Wu
475
6
0
21 Jan 2025
Self-supervised Transformation Learning for Equivariant Representations
Neural Information Processing Systems (NeurIPS), 2025
Jaemyung Yu
Jaehyun Choi
Dong-Jae Lee
H. Hong
Junmo Kim
283
0
0
15 Jan 2025
A novel multi-agent dynamic portfolio optimization learning system based on hierarchical deep reinforcement learning
Tian Ding
Yue Xi
Angelos Stefanidis
Zhengyong Jiang
Jionglong Su
160
6
0
12 Jan 2025
Zero-shot Shark Tracking and Biometrics from Aerial Imagery
Methods in Ecology and Evolution (MEE), 2025
Chinmay K Lalgudi
Mark E Leone
Jaden V Clark
Sergio Madrigal-Mora
Mario Espinoza
133
4
0
10 Jan 2025
UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping
Yanjie Li
Wenxuan Zhang
K. Liang
AAML
295
6
0
10 Jan 2025
UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles
Design, Automation and Test in Europe (DATE), 2025
Abhishek Balasubramaniam
Febin P. Sunny
S. Pasricha
3DPC
246
0
0
08 Jan 2025
Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work
Takumi Kitsukawa
Kazuma Miura
Shigeki Yumoto
Sarthak Pathak
Alessandro Moro
K. Umeda
3DH
170
1
0
08 Jan 2025
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yan Lu
Cheng Wang
Lei Yang
Tianzhu Zhang
Yating Liu
Qi Chu
Tong He
Yonghui Li
W. Ouyang
524
16
0
08 Jan 2025
Generalization-Enhanced Few-Shot Object Detection in Remote Sensing
Hui Lin
Nan Li
Pengjuan Yao
Kexin Dong
Yuhan Guo
Danfeng Hong
Yanzhe Zhang
Congcong Wen
398
17
0
05 Jan 2025
First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria
Stefan Schoder
414
0
0
31 Dec 2024
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Junxiao Xue
Quan Deng
Fei Yu
Yanhao Wang
Jun Wang
Yongqian Li
VLM
306
11
0
31 Dec 2024
Towards Visual Grounding: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
986
31
0
28 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
275
4
0
23 Dec 2024
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement
AAAI Conference on Artificial Intelligence (AAAI), 2024
H. Kim
Jaejun Yoo
492
2
0
23 Dec 2024
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
Tien-Yu Chi
Hung-Yueh Chiang
Chi-Chih Chang
N. Huang
Kai-Chiang Wu
257
1
0
21 Dec 2024
Texture- and Shape-based Adversarial Attacks for Overhead Image Vehicle Detection
International Conference on Information Photonics (ICIP), 2024
Mikael Yeghiazaryan
Sai Abhishek Siddhartha Namburu
Emily Kim
Stanislav Panev
Celso de Melo
Brent Lance
Fernando de la Torre
AAML
414
0
0
20 Dec 2024
Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV)
International Conference on Machine Learning and Applications (ICMLA), 2024
Aneesha Guna
Parth Ganeriwala
S. Bhattacharyya
150
0
0
19 Dec 2024
TopView: Vectorising road users in a bird's eye view from uncalibrated street-level imagery with deep learning
Mohamed R Ibrahim
365
1
0
18 Dec 2024
Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation
IEEE Transactions on Medical Imaging (IEEE TMI), 2024
Jinghan Sun
Dong-mei Wei
Zhe Xu
Donghuan Lu
Hong Liu
Hong Wang
Sotirios A. Tsaftaris
Jingyu Sun
Yefeng Zheng
Liansheng Wang
MedIm
354
0
0
18 Dec 2024
Previous
1
2
3
...
5
6
7
...
107
108
109
Next
Page 6 of 109
Page
of 109
Go