ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 13,130 papers shown
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object DetectionInternational Conference on Learning Representations (ICLR), 2021
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLMViTVLM
640
407
0
22 Sep 2021
Natural Language Video Localization with Learnable Moment Proposals
Natural Language Video Localization with Learnable Moment ProposalsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Shaoning Xiao
Long Chen
Jian Shao
Yueting Zhuang
Jun Xiao
170
45
0
22 Sep 2021
A deep neural network for multi-species fish detection using multiple
  acoustic cameras
A deep neural network for multi-species fish detection using multiple acoustic cameras
Garcia Fernandez Guglielmo
François Martignac
M. Nevoux
L. Beaulaton
Thomas Corpetti
122
1
0
22 Sep 2021
COVR: A test-bed for Visually Grounded Compositional Generalization with
  real images
COVR: A test-bed for Visually Grounded Compositional Generalization with real imagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ben Bogin
Shivanshu Gupta
Matt Gardner
Jonathan Berant
CoGe
174
30
0
22 Sep 2021
MVM3Det: A Novel Method for Multi-view Monocular 3D Detection
MVM3Det: A Novel Method for Multi-view Monocular 3D Detection
Haoran Li
Zicheng Duan
Mingjun Ma
Yaran Chen
Jiaqi Li
Dong Zhao
3DPC
188
6
0
22 Sep 2021
Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps
Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps
Mohammad Mahdavian
KangKang Yin
Mo Chen
126
4
0
21 Sep 2021
Towards a Real-Time Facial Analysis System
Towards a Real-Time Facial Analysis System
Bishwo Adhikari
Xingyang Ni
Esa Rahtu
H. Huttunen
CVBM
75
2
0
21 Sep 2021
Oriented Object Detection in Aerial Images Based on Area Ratio of
  Parallelogram
Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram
Xinyi Yu
M.-W. Lin
Jiangping Lu
L. Ou
281
10
0
21 Sep 2021
KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
Xingyu Liu
Shun Iwase
Kris Kitani
3DPC
166
18
0
21 Sep 2021
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose
  Estimation
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation
Xingyu Liu
Shun Iwase
Kris Kitani
3DV
223
55
0
21 Sep 2021
Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
Fabian Küppers
Jan Kronenberger
Jonas Schneider
Anselm Haselhoff
UQCVBDL
159
11
0
21 Sep 2021
Survey: Transformer based Video-Language Pre-training
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLMViT
205
49
0
21 Sep 2021
Object Detection in Thermal Spectrum for Advanced Driver-Assistance
  Systems (ADAS)
Object Detection in Thermal Spectrum for Advanced Driver-Assistance Systems (ADAS)
Muhammad Ali Farooq
Peter Corcoran
C. Rotariu
Waseem Shariff
135
44
0
20 Sep 2021
Background-Foreground Segmentation for Interior Sensing in Automotive
  Industry
Background-Foreground Segmentation for Interior Sensing in Automotive Industry
Claudia Drygala
Matthias Rottmann
Hanno Gottschalk
Klaus Friedrichs
Thomas Kurbiel
151
2
0
20 Sep 2021
Learning Natural Language Generation from Scratch
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
147
4
0
20 Sep 2021
Learning Versatile Convolution Filters for Efficient Visual Recognition
Learning Versatile Convolution Filters for Efficient Visual Recognition
Kai Han
Yunhe Wang
Chang Xu
Chunjing Xu
Enhua Wu
Dacheng Tao
133
8
0
20 Sep 2021
Capsule networks with non-iterative cluster routing
Capsule networks with non-iterative cluster routing
Zhihao Zhao
Samuel Cheng
107
11
0
19 Sep 2021
A Study of the Generalizability of Self-Supervised Representations
A Study of the Generalizability of Self-Supervised Representations
Atharva Tendle
Mohammad Rashedul Hasan
263
32
0
19 Sep 2021
HPTQ: Hardware-Friendly Post Training Quantization
HPTQ: Hardware-Friendly Post Training Quantization
H. Habi
Reuven Peretz
Elad Cohen
Lior Dikstein
Oranit Dror
I. Diamant
Roy H. Jennings
Arnon Netzer
MQ
205
11
0
19 Sep 2021
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image
  Prediction
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction
Zekun Li
Yufan Liu
Bing Li
Weiming Hu
Kebin Wu
Chengwei Peng
ViT
131
24
0
18 Sep 2021
Computational Imaging and Artificial Intelligence: The Next Revolution
  of Mobile Vision
Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision
J. Suo
Weihang Zhang
Jin Gong
Xin Yuan
D. Brady
Qionghai Dai
226
40
0
18 Sep 2021
Fast query-by-example speech search using separable model
Fast query-by-example speech search using separable model
Yuguang Yang
Yu Pan
Xin Dong
Minqiang Xu
96
0
0
18 Sep 2021
Towards High-Quality Temporal Action Detection with Sparse Proposals
Towards High-Quality Temporal Action Detection with Sparse Proposals
Jiannan Wu
Pei Sun
Shoufa Chen
Jiewen Yang
Zihao Qi
Lan Ma
Ping Luo
ViT
148
11
0
18 Sep 2021
Screen Parsing: Towards Reverse Engineering of UI Models from
  Screenshots
Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots
Jason Wu
Xiaoyi Zhang
Jeffrey Nichols
Jeffrey P. Bigham
3DV
332
84
0
17 Sep 2021
Multimodal Incremental Transformer with Visual Grounding for Visual
  Dialogue Generation
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Feilong Chen
Fandong Meng
Xiuyi Chen
Peng Li
Jie Zhou
180
25
0
17 Sep 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
Feilong Chen
Xiuyi Chen
Fandong Meng
Peng Li
Jie Zhou
264
37
0
17 Sep 2021
PP-LCNet: A Lightweight CPU Convolutional Neural Network
PP-LCNet: A Lightweight CPU Convolutional Neural Network
Cheng Cui
Tingquan Gao
Shengyun Wei
Yuning Du
Ruoyu Guo
...
X. Lv
Qiwen Liu
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
ObjD
190
163
0
17 Sep 2021
Cross Modification Attention Based Deliberation Model for Image
  Captioning
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
107
8
0
17 Sep 2021
A Multimodal Sentiment Dataset for Video Recommendation
A Multimodal Sentiment Dataset for Video Recommendation
Hongxuan Tang
Hao Liu
Xinyan Xiao
Hua Wu
VGen
82
2
0
17 Sep 2021
Fast-Slow Transformer for Visually Grounding Speech
Fast-Slow Transformer for Visually Grounding Speech
Puyuan Peng
David Harwath
266
34
0
16 Sep 2021
An End-to-End Transformer Model for 3D Object Detection
An End-to-End Transformer Model for 3D Object Detection
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPCViT
419
570
0
16 Sep 2021
Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across
  Objects and Views
Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views
Robert McCraith
Eldar Insafutdinov
Lukás Neumann
Andrea Vedaldi
3DPC
185
10
0
16 Sep 2021
Label Assignment Distillation for Object Detection
Hailun Zhang
80
2
0
16 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
191
48
0
16 Sep 2021
Dense Semantic Contrast for Self-Supervised Visual Representation
  Learning
Dense Semantic Contrast for Self-Supervised Visual Representation Learning
Xiaoni Li
Can Ma
Yifei Zhang
Aoting Zhang
Wei Wang
Ning Jiang
Haiying Wu
Weiping Wang
SSL
223
42
0
16 Sep 2021
Few-Shot Object Detection by Attending to Per-Sample-Prototype
Few-Shot Object Detection by Attending to Per-Sample-Prototype
Hojun Lee
Myunggi Lee
Nojun Kwak
ObjD
194
40
0
16 Sep 2021
Exploiting Activation based Gradient Output Sparsity to Accelerate
  Backpropagation in CNNs
Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs
Anup Sarma
Sonali Singh
Huaipan Jiang
Ashutosh Pattnaik
Asit K. Mishra
N. Vijaykrishnan
M. Kandemir
Chita R. Das
151
5
0
16 Sep 2021
Partner-Assisted Learning for Few-Shot Image Classification
Partner-Assisted Learning for Few-Shot Image Classification
Jiawei Ma
Hanchen Xie
G. Han
Shih-Fu Chang
Aram Galstyan
Wael AbdAlmageed
VLM
176
76
0
15 Sep 2021
Deep Bregman Divergence for Contrastive Learning of Visual
  Representations
Deep Bregman Divergence for Contrastive Learning of Visual Representations
Mina Rezaei
Farzin Soleymani
J. Herbinger
Shekoofeh Azizi
SSL
181
18
0
15 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based
  Visual Question Answering
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
298
69
0
15 Sep 2021
What Vision-Language Models `See' when they See Scenes
What Vision-Language Models `See' when they See Scenes
Michele Cafagna
Kees van Deemter
Albert Gatt
VLM
259
13
0
15 Sep 2021
Progressive Hard-case Mining across Pyramid Levels for Object Detection
Progressive Hard-case Mining across Pyramid Levels for Object Detection
Binghong Wu
Yehui Yang
Dalu Yang
Junde Wu
Xiaorong Wang
Haifeng Huang
Lei Wang
Yanwu Xu
ObjD
131
0
0
15 Sep 2021
FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view
  Physical Adversarial Attack
FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack
Donghua Wang
Tingsong Jiang
Jialiang Sun
Weien Zhou
Xiaoya Zhang
Zhiqiang Gong
Wen Yao
Xiaoqian Chen
AAML
302
135
0
15 Sep 2021
ROW-SLAM: Under-Canopy Cornfield Semantic SLAM
ROW-SLAM: Under-Canopy Cornfield Semantic SLAM
Jiacheng Yuan
Jungseok Hong
Junaed Sattar
Volkan Isler
156
9
0
15 Sep 2021
Anchor DETR: Query Design for Transformer-Based Object Detection
Anchor DETR: Query Design for Transformer-Based Object Detection
Yingming Wang
Xinming Zhang
Tong Yang
Jian Sun
ViT
202
67
0
15 Sep 2021
PnP-DETR: Towards Efficient Visual Analysis with Transformers
PnP-DETR: Towards Efficient Visual Analysis with Transformers
Tao Wang
Li Yuan
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
ViT
189
117
0
15 Sep 2021
A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images
A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images
A. Borundiya
Arshak Navruzyan
Dennis Igoschev
F. C. Oughali
H. Pasupuleti
Mike Fuller
Vinay Kanigicherla
T. Kashyap
Rishabh Chaurasia
Sonali Vinod Jain
MedIm
64
0
0
14 Sep 2021
Multi-Scale Aligned Distillation for Low-Resolution Detection
Multi-Scale Aligned Distillation for Low-Resolution Detection
Lu Qi
Jason Kuen
Jiuxiang Gu
Zhe Lin
Yi Wang
Yukang Chen
Yanwei Li
Jiaya Jia
188
65
0
14 Sep 2021
AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance
AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance
Xiangcheng Liu
Jian Cao
Hongyi Yao
Wenyu Sun
Yuan Zhang
166
3
0
14 Sep 2021
DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection
DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection
Steven Lang
Fabrizio G. Ventola
Kristian Kersting
439
20
0
13 Sep 2021
Previous
123...118119120...261262263
Next