ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 12,974 papers shown
Title
RSDet++: Point-based Modulated Loss for More Accurate Rotated Object
  Detection
RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection
W. Qian
Xue Yang
Silong Peng
Junchi Yan
Xiujuan Zhang
144
54
0
24 Sep 2021
Localizing Infinity-shaped fishes: Sketch-guided object localization in
  the wild
Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
Pau Riba
S. Dey
Ali Furkan Biten
Josep Llados
164
3
0
24 Sep 2021
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLMVPVLMVLM
532
243
0
24 Sep 2021
Dense Contrastive Visual-Linguistic Pretraining
Dense Contrastive Visual-Linguistic PretrainingACM Multimedia (ACM MM), 2021
Lei Shi
Kai Shuang
Shijie Geng
Shiyang Feng
Zuohui Fu
Gerard de Melo
Yunpeng Chen
Sen Su
VLMSSL
214
11
0
24 Sep 2021
LGD: Label-guided Self-distillation for Object Detection
LGD: Label-guided Self-distillation for Object DetectionAAAI Conference on Artificial Intelligence (AAAI), 2021
Peizhen Zhang
Zijian Kang
Tong Yang
Xinming Zhang
N. Zheng
Jian Sun
ObjD
333
37
0
23 Sep 2021
Scene Graph Generation for Better Image Captioning?
Scene Graph Generation for Better Image Captioning?
Maximilian Mozes
Martin Schmitt
Vladimir Golkov
Hinrich Schütze
Zorah Lähner
GNN
172
5
0
23 Sep 2021
Towards Generalized and Incremental Few-Shot Object Detection
Towards Generalized and Incremental Few-Shot Object Detection
Yiting Li
H. Zhu
Jun Ma
C. Teo
Chen Xiang
P. Vadakkepat
T. Lee
CLLObjD
141
10
0
23 Sep 2021
Transferring Knowledge from Vision to Language: How to Achieve it and
  how to Measure it?
Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021
Tobias Norlund
Lovisa Hagström
Richard Johansson
237
25
0
23 Sep 2021
Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion
  Modeling
Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion ModelingIEEE International Conference on Robotics and Automation (ICRA), 2021
S. Back
Joosoon Lee
Taewon Kim
Sangjun Noh
Raeyoung Kang
Seongho Bak
Kyoobin Lee
195
81
0
23 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object DetectionInternational Conference on Learning Representations (ICLR), 2021
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLMViTVLM
570
406
0
22 Sep 2021
Natural Language Video Localization with Learnable Moment Proposals
Natural Language Video Localization with Learnable Moment ProposalsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Shaoning Xiao
Long Chen
Jian Shao
Yueting Zhuang
Jun Xiao
157
45
0
22 Sep 2021
A deep neural network for multi-species fish detection using multiple
  acoustic cameras
A deep neural network for multi-species fish detection using multiple acoustic cameras
Garcia Fernandez Guglielmo
François Martignac
M. Nevoux
L. Beaulaton
Thomas Corpetti
98
1
0
22 Sep 2021
COVR: A test-bed for Visually Grounded Compositional Generalization with
  real images
COVR: A test-bed for Visually Grounded Compositional Generalization with real imagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ben Bogin
Shivanshu Gupta
Matt Gardner
Jonathan Berant
CoGe
161
30
0
22 Sep 2021
MVM3Det: A Novel Method for Multi-view Monocular 3D Detection
MVM3Det: A Novel Method for Multi-view Monocular 3D Detection
Haoran Li
Zicheng Duan
Mingjun Ma
Yaran Chen
Jiaqi Li
Dong Zhao
3DPC
148
6
0
22 Sep 2021
Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps
Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps
Mohammad Mahdavian
KangKang Yin
Mo Chen
114
4
0
21 Sep 2021
Towards a Real-Time Facial Analysis System
Towards a Real-Time Facial Analysis System
Bishwo Adhikari
Xingyang Ni
Esa Rahtu
H. Huttunen
CVBM
75
2
0
21 Sep 2021
Oriented Object Detection in Aerial Images Based on Area Ratio of
  Parallelogram
Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram
Xinyi Yu
M.-W. Lin
Jiangping Lu
L. Ou
271
10
0
21 Sep 2021
KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
Xingyu Liu
Shun Iwase
Kris Kitani
3DPC
161
17
0
21 Sep 2021
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose
  Estimation
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation
Xingyu Liu
Shun Iwase
Kris Kitani
3DV
215
53
0
21 Sep 2021
Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
Fabian Küppers
Jan Kronenberger
Jonas Schneider
Anselm Haselhoff
UQCVBDL
121
11
0
21 Sep 2021
Survey: Transformer based Video-Language Pre-training
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLMViT
181
49
0
21 Sep 2021
Object Detection in Thermal Spectrum for Advanced Driver-Assistance
  Systems (ADAS)
Object Detection in Thermal Spectrum for Advanced Driver-Assistance Systems (ADAS)
Muhammad Ali Farooq
Peter Corcoran
C. Rotariu
Waseem Shariff
123
43
0
20 Sep 2021
Background-Foreground Segmentation for Interior Sensing in Automotive
  Industry
Background-Foreground Segmentation for Interior Sensing in Automotive Industry
Claudia Drygala
Matthias Rottmann
Hanno Gottschalk
Klaus Friedrichs
Thomas Kurbiel
127
2
0
20 Sep 2021
Learning Natural Language Generation from Scratch
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
127
4
0
20 Sep 2021
Learning Versatile Convolution Filters for Efficient Visual Recognition
Learning Versatile Convolution Filters for Efficient Visual Recognition
Kai Han
Yunhe Wang
Chang Xu
Chunjing Xu
Enhua Wu
Dacheng Tao
112
8
0
20 Sep 2021
Capsule networks with non-iterative cluster routing
Capsule networks with non-iterative cluster routing
Zhihao Zhao
Samuel Cheng
98
11
0
19 Sep 2021
A Study of the Generalizability of Self-Supervised Representations
A Study of the Generalizability of Self-Supervised Representations
Atharva Tendle
Mohammad Rashedul Hasan
228
32
0
19 Sep 2021
HPTQ: Hardware-Friendly Post Training Quantization
HPTQ: Hardware-Friendly Post Training Quantization
H. Habi
Reuven Peretz
Elad Cohen
Lior Dikstein
Oranit Dror
I. Diamant
Roy H. Jennings
Arnon Netzer
MQ
193
11
0
19 Sep 2021
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image
  Prediction
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction
Zekun Li
Yufan Liu
Bing Li
Weiming Hu
Kebin Wu
Chengwei Peng
ViT
112
24
0
18 Sep 2021
Computational Imaging and Artificial Intelligence: The Next Revolution
  of Mobile Vision
Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision
J. Suo
Weihang Zhang
Jin Gong
Xin Yuan
D. Brady
Qionghai Dai
195
35
0
18 Sep 2021
Fast query-by-example speech search using separable model
Fast query-by-example speech search using separable model
Yuguang Yang
Yu Pan
Xin Dong
Minqiang Xu
84
0
0
18 Sep 2021
Towards High-Quality Temporal Action Detection with Sparse Proposals
Towards High-Quality Temporal Action Detection with Sparse Proposals
Jiannan Wu
Pei Sun
Shoufa Chen
Jiewen Yang
Zihao Qi
Lan Ma
Ping Luo
ViT
125
11
0
18 Sep 2021
Screen Parsing: Towards Reverse Engineering of UI Models from
  Screenshots
Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots
Jason Wu
Xiaoyi Zhang
Jeffrey Nichols
Jeffrey P. Bigham
3DV
315
84
0
17 Sep 2021
Multimodal Incremental Transformer with Visual Grounding for Visual
  Dialogue Generation
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Feilong Chen
Fandong Meng
Xiuyi Chen
Peng Li
Jie Zhou
151
24
0
17 Sep 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
Feilong Chen
Xiuyi Chen
Fandong Meng
Peng Li
Jie Zhou
245
36
0
17 Sep 2021
PP-LCNet: A Lightweight CPU Convolutional Neural Network
PP-LCNet: A Lightweight CPU Convolutional Neural Network
Cheng Cui
Tingquan Gao
Shengyun Wei
Yuning Du
Ruoyu Guo
...
X. Lv
Qiwen Liu
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
ObjD
167
161
0
17 Sep 2021
Cross Modification Attention Based Deliberation Model for Image
  Captioning
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
102
7
0
17 Sep 2021
A Multimodal Sentiment Dataset for Video Recommendation
A Multimodal Sentiment Dataset for Video Recommendation
Hongxuan Tang
Hao Liu
Xinyan Xiao
Hua Wu
VGen
66
2
0
17 Sep 2021
Fast-Slow Transformer for Visually Grounding Speech
Fast-Slow Transformer for Visually Grounding Speech
Puyuan Peng
David Harwath
210
34
0
16 Sep 2021
An End-to-End Transformer Model for 3D Object Detection
An End-to-End Transformer Model for 3D Object Detection
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPCViT
343
565
0
16 Sep 2021
Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across
  Objects and Views
Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views
Robert McCraith
Eldar Insafutdinov
Lukás Neumann
Andrea Vedaldi
3DPC
155
10
0
16 Sep 2021
Label Assignment Distillation for Object Detection
Hailun Zhang
80
2
0
16 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for
  Image Captioning
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
166
46
0
16 Sep 2021
Dense Semantic Contrast for Self-Supervised Visual Representation
  Learning
Dense Semantic Contrast for Self-Supervised Visual Representation Learning
Xiaoni Li
Yu Zhou
Yifei Zhang
Aoting Zhang
Wei Wang
Ning Jiang
Haiying Wu
Weiping Wang
SSL
198
42
0
16 Sep 2021
Few-Shot Object Detection by Attending to Per-Sample-Prototype
Few-Shot Object Detection by Attending to Per-Sample-Prototype
Hojun Lee
Myunggi Lee
Nojun Kwak
ObjD
161
39
0
16 Sep 2021
Exploiting Activation based Gradient Output Sparsity to Accelerate
  Backpropagation in CNNs
Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs
Anup Sarma
Sonali Singh
Huaipan Jiang
Ashutosh Pattnaik
Asit K. Mishra
N. Vijaykrishnan
M. Kandemir
Chita R. Das
146
5
0
16 Sep 2021
Partner-Assisted Learning for Few-Shot Image Classification
Partner-Assisted Learning for Few-Shot Image Classification
Jiawei Ma
Hanchen Xie
G. Han
Shih-Fu Chang
Aram Galstyan
Wael AbdAlmageed
VLM
164
76
0
15 Sep 2021
Deep Bregman Divergence for Contrastive Learning of Visual
  Representations
Deep Bregman Divergence for Contrastive Learning of Visual Representations
Mina Rezaei
Farzin Soleymani
J. Herbinger
Shekoofeh Azizi
SSL
173
18
0
15 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based
  Visual Question Answering
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
260
67
0
15 Sep 2021
What Vision-Language Models `See' when they See Scenes
What Vision-Language Models `See' when they See Scenes
Michele Cafagna
Kees van Deemter
Albert Gatt
VLM
229
13
0
15 Sep 2021
Previous
123...116117118...258259260
Next