Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1504.08083
Cited By
Fast R-CNN
30 April 2015
Ross B. Girshick
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast R-CNN"
50 / 2,357 papers shown
Title
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
59
0
0
18 Mar 2025
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
60
0
0
17 Mar 2025
Gun Detection Using Combined Human Pose and Weapon Appearance
Amulya Reddy Maligireddy
Manohar Reddy Uppula
Nidhi Rastogi
Yaswanth Reddy Parla
53
0
0
15 Mar 2025
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
Jiangning Wei
Lixiong Qin
Bo Yu
Tianjian Zou
Chuhan Yan
Dandan Xiao
Yang Yu
Lan Yang
Ke Li
Jun Liu
41
0
0
14 Mar 2025
Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume
Reef Alturki
A. Hilton
Jean-Yves Guillemaut
44
0
0
14 Mar 2025
TAU: Modeling Temporal Consistency Through Temporal Attentive U-Net for PPG Peak Detection
Chunsheng Zuo
Yu Zhao
Juntao Ye
39
0
0
13 Mar 2025
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue
Ryusuke Miyamoto
44
0
0
10 Mar 2025
LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs
Hanyu Zhou
Gim Hee Lee
42
0
0
10 Mar 2025
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Hanzhi Chen
Boyang Sun
Anran Zhang
Marc Pollefeys
Stefan Leutenegger
LM&Ro
65
0
0
10 Mar 2025
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
VLM
ObjD
72
1
0
10 Mar 2025
SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements
Haiyang Xie
Xi Shen
Shihua Huang
Qirui Wang
Zheng Wang
39
0
0
10 Mar 2025
IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map Construction
Jiangtong Zhu
Zhao Yang
Yinan Shi
Jianwu Fang
Jianru Xue
ISeg
42
0
0
05 Mar 2025
Catheter Detection and Segmentation in X-ray Images via Multi-task Learning
Lin Xi
Yingliang Ma
Ethan Koland
Sandra Howell
Aldo Rinaldi
Kawal S. Rhode
62
0
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
71
0
0
04 Mar 2025
MonoLite3D: Lightweight 3D Object Properties Estimation
Ahmed El-Dawy
Amr El-Zawawi
Mohamed El-Habrouk
38
0
0
04 Mar 2025
Enhancing Object Detection Accuracy in Underwater Sonar Images through Deep Learning-based Denoising
Ziyu Wang
Tao Xue
Yanbin Wang
J. Li
Haibin Zhang
Zhiqiang Xu
Gaofei Xu
64
0
0
03 Mar 2025
Identity documents recognition and detection using semantic segmentation with convolutional neural network
Mykola Kozlenko
Volodymyr Sendetskyi
Oleksiy Simkiv
Nazar Savchenko
Andy Bosyi
58
3
0
03 Mar 2025
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Zhixiong Nan
Xianghong Li
Jifeng Dai
Tao Xiang
46
0
0
03 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
58
0
0
03 Mar 2025
Insights into dendritic growth mechanisms in batteries: A combined machine learning and computational study
Zirui Zhao
Junchao Xia
Si Wu
Xiaoke Wang
Guanping Xu
Yinghao Zhu
Jing Sun
Hai-Feng Li
31
1
0
02 Mar 2025
Learning-Based Leader Localization for Underwater Vehicles With Optical-Acoustic-Pressure Sensor Fusion
Mingyang Yang
Zeyu Sha
Feitian Zhang
24
0
0
28 Feb 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
57
3
0
27 Feb 2025
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation
Mingjie Wu
Chenggui Yang
Huihua Wang
Chen Xue
Yibo Wang
...
Yuqi Han
R. Li
Lijun Yun
Zaiqing Chen
S.
59
0
0
27 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
101
0
0
27 Feb 2025
An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving
Tianchen Ji
Neeloy Chakraborty
Andre Schreiber
Katherine Rose Driggs-Campbell
99
1
0
23 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
146
51
0
21 Feb 2025
EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments
Linus Nwankwo
Bjoern Ellensohn
Vedant Dave
Peter Hofer
Jan Forstner
Marlene Villneuve
Robert Galler
Elmar Rueckert
59
3
0
20 Feb 2025
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
Xuan Tong
Yang Chang
Qing Zhao
Jiawen Yu
Boyang Wang
...
Xinji Mai
Haoran Wang
Zeng Tao
Yan Wang
Wenqiang Zhang
66
1
0
17 Feb 2025
An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet
Hongyu Liu
Guowu Yuan
Lei Yang
Kunxiao Liu
Hao Zhou
48
22
0
10 Feb 2025
Large Memory Network for Recommendation
Hui Lu
Zheng Chai
Y. Zheng
Zhe Chen
Deping Xie
Peng Xu
Xun Zhou
51
0
0
08 Feb 2025
RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images
Lei Yang
Guowu Yuan
Hao Zhou
Hongyu Liu
Jian Chen
Hao Wu
106
30
0
05 Feb 2025
ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies
C. Ciușdel
Alex Serban
Tiziano Passerini
CoGe
69
1
0
03 Feb 2025
A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches
Luca Ciampi
Ali Azmoudeh
Elif Ecem Akbaba
Erdi Sarıtaş
Ziya Ata Yazıcı
H. K. Ekenel
Giuseppe Amato
Fabrizio Falchi
97
0
0
31 Jan 2025
Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images
Wei-Lun Chen
Chia-Yeh Hsieh
Yu-Hsiang Kao
Kai-Chun Liu
Sheng-Yu Peng
Yu Tsao
85
0
0
30 Jan 2025
Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition
Jielong Tang
Zhenxing Wang
Ziyang Gong
Jianxing Yu
Shuang Wang
Jian Yin
38
0
0
28 Jan 2025
GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation
Ruicheng Zhang
Haowei Guo
Zeyu Zhang
Puxin Yan
Shen Zhao
76
5
0
22 Jan 2025
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Bingyi Liu
Jian Teng
Hongfei Xue
Enshu Wang
Chuanhui Zhu
Pu Wang
Libing Wu
78
0
0
21 Jan 2025
Self-supervised Transformation Learning for Equivariant Representations
Jaemyung Yu
Jaehyun Choi
Dong-Jae Lee
H. Hong
Junmo Kim
40
0
0
15 Jan 2025
UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping
Yanjie Li
Wenxuan Zhang
K. Liang
Bin Xiao
AAML
61
1
0
10 Jan 2025
Zero-shot Shark Tracking and Biometrics from Aerial Imagery
Chinmay K Lalgudi
Mark E Leone
Jaden V Clark
Sergio Madrigal-Mora
Mario Espinoza
42
0
0
10 Jan 2025
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Yan Lu
Xinzhu Ma
Lei Yang
Tianzhu Zhang
Yating Liu
Qi Chu
Tong He
Yonghui Li
W. Ouyang
61
3
0
08 Jan 2025
Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work
Takumi Kitsukawa
Kazuma Miura
Shigeki Yumoto
Sarthak Pathak
Alessandro Moro
K. Umeda
3DH
25
0
0
08 Jan 2025
Generalization-Enhanced Few-Shot Object Detection in Remote Sensing
Hui Lin
Nan Li
Pengjuan Yao
Kexin Dong
Yuhan Guo
Danfeng Hong
Y. Zhang
Congcong Wen
84
4
0
05 Jan 2025
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Junxiao Xue
Quan Deng
Fei Yu
Yanhao Wang
Jun Wang
Y. Li
VLM
41
3
0
31 Dec 2024
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
46
3
0
31 Dec 2024
Real-time Speech Enhancement on Raw Signals with Deep State-space Modeling
Yan Ru Pei
Ritik Shrivastava
Fnu Sidharth
38
1
0
31 Dec 2024
First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria
Stefan Schoder
44
0
0
31 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
35
0
0
23 Dec 2024
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement
H. Kim
Jaejun Yoo
47
0
0
23 Dec 2024
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
Tien-Yu Chi
Hung-Yueh Chiang
Chi-Chih Chang
N. Huang
Kai-Chiang Wu
83
0
0
21 Dec 2024
Previous
1
2
3
4
5
...
46
47
48
Next