Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.01497
Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian-jun Sun
AIMat
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 5,179 papers shown
Title
A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery
Pengyu Chen
Sicheng Wang
Cuizhen Wang
Senrong Wang
Beiao Huang
Lu Huang
Zhe Zang
32
0
0
29 Mar 2025
VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving
Haibo Hu
Jiacheng Zuo
Yang Lou
Yufei Cui
Jianping Wang
Nan Guan
Jin Wang
Yung-Hui Li
Chun Jason Xue
VLM
55
1
0
29 Mar 2025
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection
Marc-Antoine Lavoie
Anas Mahmoud
Steven Waslander
37
0
0
29 Mar 2025
RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations
Bin Zhang
Jinggang Chen
Xiaoyang Qu
Guokuan Li
Kai Lu
Jiguang Wan
Jing Xiao
Jianzong Wang
ObjD
41
0
0
28 Mar 2025
SIGHT: Single-Image Conditioned Generation of Hand Trajectories for Hand-Object Interaction
Alexey Gavryushin
Florian Redhardt
Gaia Di Lorenzo
Luc Van Gool
Marc Pollefeys
Kaichun Mo
Xi Wang
37
0
0
28 Mar 2025
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou
X. Zuo
Rui Ma
Li Cheng
ViT
35
0
0
27 Mar 2025
Embedding Compression Distortion in Video Coding for Machines
Y. Sun
Yao-Min Zhao
Meiqin Liu
Chao Yao
Weisi Lin
52
0
0
27 Mar 2025
Multimodal surface defect detection from wooden logs for sawing optimization
Bořek Reich
Matej Kunda
Fedor Zolotarev
Tuomas Eerola
Pavel Zemčík
Tomi Kauppi
41
0
0
27 Mar 2025
AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification
Earl Ranario
Lars Lundqvist
Heesup Yun
Brian N Bailey
J. M. Earles
VLM
38
0
0
27 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang
Xinzhu Ma
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGS
AI4TS
67
0
0
26 Mar 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
62
0
0
26 Mar 2025
Bandwidth Allocation for Cloud-Augmented Autonomous Driving
Peter Schafhalter
Alexander Krentsel
Joseph E. Gonzalez
Sylvia Ratnasamy
S. Shenker
Ion Stoica
74
0
0
26 Mar 2025
RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task
Chunshan Li
Rong Wang
Xiaofei Yang
Dianhui Chu
72
0
0
26 Mar 2025
TerraTorch: The Geospatial Foundation Models Toolkit
Carlos Gomes
Benedikt Blumenstiel
Joao Lucas de Sousa Almeida
Pedro Henrique de Oliveira
P. Fraccaro
Francesc Marti Escofet
Daniela Szwarcman
Naomi Simumba
Romeo Kienzler
Bianca Zadrozny
69
1
0
26 Mar 2025
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
Jan Kohút
Martin Dočekal
Michal Hradiš
Marek Vaško
37
0
0
25 Mar 2025
Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines
Junle Liu
Yun Zhang
Zixi Guo
39
0
0
25 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Yao Hu
Yongchao Xu
ObjD
47
0
0
24 Mar 2025
Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics
Md. Barkat Ullah Tusher
Shartaz Khan Akash
Amirul Islam Showmik
41
0
0
24 Mar 2025
Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining
Guanglu Dong
Tianheng Zheng
Yuanzhouhan Cao
L. Qing
Chao Ren
DiffM
56
0
0
24 Mar 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
54
0
0
24 Mar 2025
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Sara Al-Emadi
Yin Yang
Ferda Ofli
34
0
0
24 Mar 2025
Distilling Stereo Networks for Performant and Efficient Leaner Networks
Rafia Rahim
Samuel Woerz
A. Zell
77
0
0
24 Mar 2025
Frequency Dynamic Convolution for Dense Image Prediction
Linwei Chen
Lin Gu
Liang Li
C. Yan
Ying Fu
42
0
0
24 Mar 2025
Vision-Guided Loco-Manipulation with a Snake Robot
Adarsh Salagame
Sasank Potluri
Keshav Bharadwaj Vaidyanathan
Kruthika Gangaraju
Eric N. Sihite
Milad Ramezani
Alireza Ramezani
60
0
0
24 Mar 2025
SFDLA: Source-Free Document Layout Analysis
Sebastian Tewes
Yufan Chen
Omar Moured
Jiaming Zhang
Rainer Stiefelhagen
48
0
0
24 Mar 2025
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning
Yufei Zhan
Yousong Zhu
Shurong Zheng
Hongyin Zhao
Fan Yang
Ming Tang
J. T. Wang
VLM
67
3
0
23 Mar 2025
Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms
Nachuan Ma
Zhengfei Song
Qiang Hu
Chuang-Wei Liu
Yu Han
Yanting Zhang
Rui Fan
Lihua Xie
51
0
0
23 Mar 2025
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
Yu Wang
Junxian Mu
Hongzhi Huang
Qilong Wang
Pengfei Zhu
Q. Hu
55
0
0
22 Mar 2025
A Causal Adjustment Module for Debiasing Scene Graph Generation
Li Liu
Shuzhou Sun
Shuaifeng Zhi
Fan Shi
Zhen Liu
J. Heikkilä
Yongxiang Liu
CML
52
2
0
22 Mar 2025
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability
P. Hill
Zhiming Liu
Nantheera Anantrasirichai
Mamba
48
0
0
22 Mar 2025
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
Ting Sun
Cheng Cui
Yuning Du
Yi Liu
42
1
0
21 Mar 2025
EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision
Xiaofeng Mao
YueFeng Chen
Rong Zhang
Hui Xue
Zhao Li
Hang Su
AAML
VLM
41
0
0
21 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
53
0
0
20 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
39
2
0
20 Mar 2025
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation
Martin Kostelník
Karel Beneš
Michal Hradiš
37
0
0
20 Mar 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
47
0
0
19 Mar 2025
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark
Ying Liu
Yijing Hua
Haojiang Chai
Yanbo Wang
TengQi Ye
ObjD
54
0
0
19 Mar 2025
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
Saad Lahlali
Sandra Kara
Hejer Ammar
Florian Chabot
Nicolas Granger
Hervé Le Borgne
Q. C. Pham
3DPC
57
0
0
19 Mar 2025
DCA: Dividing and Conquering Amnesia in Incremental Object Detection
Aoting Zhang
Dongbao Yang
Chang-Shu Liu
Xiaopeng Hong
Miao Shang
Yu Zhou
CLL
60
0
0
19 Mar 2025
A Language Vision Model Approach for Automated Tumor Contouring in Radiation Oncology
Yi Luo
H. Hooshangnejad
Xue Feng
Gaofeng Huang
X. Chen
Rui Zhang
Quan Chen
Wil Ngwa
Kai Ding
57
0
0
19 Mar 2025
Test-Time Backdoor Detection for Object Detection Models
Hangtao Zhang
Yichen Wang
Shihui Yan
Chenyu Zhu
Ziqi Zhou
Linshan Hou
Shengshan Hu
Minghui Li
Yanjun Zhang
L. Zhang
AAML
51
0
0
19 Mar 2025
Universal Scene Graph Generation
Shengqiong Wu
Hao Fei
Tat-Seng Chua
36
0
0
19 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
59
0
0
18 Mar 2025
Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic
Monika Shah
Somdeb Sarkhel
Deepak Venugopal
MLLM
BDL
VLM
83
0
0
18 Mar 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
J. Liu
LM&Ro
74
0
0
18 Mar 2025
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
Sayak Nag
Udita Ghosh
Sarosij Bose
Calvin-Khang Ta
Jiachen Li
A. Roy-Chowdhury
61
0
0
18 Mar 2025
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
Xinqing Li
Ruiqi Song
Qingyu Xie
Ye Wu
Nanxin Zeng
Yunfeng Ai
VGen
SyDa
70
0
0
18 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
51
0
0
17 Mar 2025
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA
Michal Danilowicz
T. Kryjak
VOT
51
0
0
17 Mar 2025
Finite Samples for Shallow Neural Networks
Yu Xia
Zhiqiang Xu
43
0
0
17 Mar 2025
Previous
1
2
3
4
5
...
102
103
104
Next