v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015

4 June 2015

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 13,130 papers shown

Pix2seq: A Language Modeling Framework for Object DetectionInternational Conference on Learning Representations (ICLR), 2021

David J. Fleet

640

407

22 Sep 2021

Natural Language Video Localization with Learnable Moment ProposalsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

170

22 Sep 2021

A deep neural network for multi-species fish detection using multiple acoustic cameras

Garcia Fernandez Guglielmo

François Martignac

M. Nevoux

L. Beaulaton

Thomas Corpetti

122

22 Sep 2021

COVR: A test-bed for Visually Grounded Compositional Generalization with real imagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

174

22 Sep 2021

MVM3Det: A Novel Method for Multi-view Monocular 3D Detection

188

22 Sep 2021

Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps

Mohammad Mahdavian

KangKang Yin

Mo Chen

126

21 Sep 2021

Towards a Real-Time Facial Analysis System

Esa Rahtu

21 Sep 2021

Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram

281

21 Sep 2021

KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation

166

21 Sep 2021

StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation

223

21 Sep 2021

Bayesian Confidence Calibration for Epistemic Uncertainty Modelling

159

21 Sep 2021

Survey: Transformer based Video-Language Pre-training

Ludan Ruan

Qin Jin

VLM ViT

205

21 Sep 2021

Object Detection in Thermal Spectrum for Advanced Driver-Assistance Systems (ADAS)

135

20 Sep 2021

Background-Foreground Segmentation for Interior Sensing in Automotive Industry

151

20 Sep 2021

Learning Natural Language Generation from Scratch

Olivier Pietquin

147

20 Sep 2021

Learning Versatile Convolution Filters for Efficient Visual Recognition

133

20 Sep 2021

Capsule networks with non-iterative cluster routing

Zhihao Zhao

Samuel Cheng

107

19 Sep 2021

A Study of the Generalizability of Self-Supervised Representations

Atharva Tendle

Mohammad Rashedul Hasan

263

19 Sep 2021

HPTQ: Hardware-Friendly Post Training Quantization

205

19 Sep 2021

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction

Bing Li

131

18 Sep 2021

Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision

226

18 Sep 2021

Fast query-by-example speech search using separable model

Yuguang Yang

Yu Pan

Xin Dong

Minqiang Xu

18 Sep 2021

Towards High-Quality Temporal Action Detection with Sparse Proposals

148

18 Sep 2021

Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots

332

17 Sep 2021

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Feilong Chen

Fandong Meng

Xiuyi Chen

Peng Li

Jie Zhou

180

17 Sep 2021

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog

Feilong Chen

Xiuyi Chen

Fandong Meng

Peng Li

Jie Zhou

264

17 Sep 2021

PP-LCNet: A Lightweight CPU Convolutional Neural Network

...

Dianhai Yu

190

163

17 Sep 2021

Cross Modification Attention Based Deliberation Model for Image Captioning

107

17 Sep 2021

A Multimodal Sentiment Dataset for Video Recommendation

17 Sep 2021

Fast-Slow Transformer for Visually Grounding Speech

Puyuan Peng

David Harwath

266

16 Sep 2021

An End-to-End Transformer Model for 3D Object Detection

419

570

16 Sep 2021

Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

Andrea Vedaldi

185

16 Sep 2021

Label Assignment Distillation for Object Detection

Hailun Zhang

16 Sep 2021

Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning

191

16 Sep 2021

Dense Semantic Contrast for Self-Supervised Visual Representation Learning

223

16 Sep 2021

Few-Shot Object Detection by Attending to Per-Sample-Prototype

194

16 Sep 2021

Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs

151

16 Sep 2021

Partner-Assisted Learning for Few-Shot Image Classification

Hanchen Xie

176

15 Sep 2021

Deep Bregman Divergence for Contrastive Learning of Visual Representations

181

15 Sep 2021

Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering

Ander Salaberria

Gorka Azkune

Oier López de Lacalle

Aitor Soroa Etxabe

Eneko Agirre

298

15 Sep 2021

What Vision-Language Models `See' when they See Scenes

259

15 Sep 2021

Progressive Hard-case Mining across Pyramid Levels for Object Detection

Yanwu Xu

131

15 Sep 2021

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack

302

135

15 Sep 2021

ROW-SLAM: Under-Canopy Cornfield Semantic SLAM

156

15 Sep 2021

Anchor DETR: Query Design for Transformer-Based Object Detection

202

15 Sep 2021

PnP-DETR: Towards Efficient Visual Analysis with Transformers

189

117

15 Sep 2021

A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images

14 Sep 2021

Multi-Scale Aligned Distillation for Low-Resolution Detection

Jiuxiang Gu

Yi Wang

188

14 Sep 2021

AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance

Yuan Zhang

166

14 Sep 2021

DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection

Steven Lang

Fabrizio G. Ventola

Kristian Kersting

439

13 Sep 2021