v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015

4 June 2015

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 13,130 papers shown

Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering

Jihyung Kil

Cheng Zhang

D. Xuan

Wei-Lun Chao

264

13 Sep 2021

Weakly Supervised Person Search with Region Siamese Networks

Yi Yang

165

13 Sep 2021

xGQA: Cross-Lingual Visual Question Answering

357

13 Sep 2021

Learning to Ground Visual Objects for Visual Dialog

189

13 Sep 2021

Mutual Supervision for Dense Object Detection

Ziteng Gao

Limin Wang

Gangshan Wu

224

13 Sep 2021

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation

Zhengkun Zhang

Xiaojun Meng

Yasheng Wang

Xin Jiang

Qun Liu

Zhenglu Yang

173

13 Sep 2021

Adversarially Trained Object Detector for Unsupervised Domain Adaptation

167

13 Sep 2021

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Zechen Bai

Yuta Nakashima

Noa Garcia

228

13 Sep 2021

Domain Adaptation by Maximizing Population Correlation with Neural Architecture Search

Zhixiong Yue

Pengxin Guo

Yu Zhang

178

12 Sep 2021

DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Negin Ghamsarian

M. Taschwer

Klaus Schoeffmann

162

11 Sep 2021

BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

Naina Dhingra

Florian Ritter

A. Kunz

213

11 Sep 2021

COSMic: A Coherence-Aware Generation Metric for Image Descriptions

154

11 Sep 2021

MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets

Dimitar Dimitrov

215

166

11 Sep 2021

Partially-Supervised Novel Object Captioning Leveraging Context from Paired Data

Shashank Bujimalla

Mahesh Subedar

Omesh Tickoo

193

10 Sep 2021

Negative Sample Matters: A Renaissance of Metric Learning for Temporal GroundingAAAI Conference on Artificial Intelligence (AAAI), 2021

Gangshan Wu

330

153

10 Sep 2021

Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Stella Frank

Emanuele Bugliarello

Desmond Elliott

181

09 Sep 2021

TxT: Crossmodal End-to-End Learning with TransformersGerman Conference on Pattern Recognition (DAGM), 2021

125

09 Sep 2021

M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal PretrainingComputer Vision and Pattern Recognition (CVPR), 2021

Michael C. Kampffmeyer

Xiaoyong Wei

Minlong Lu

Yaowei Wang

Xiaodan Liang

568

09 Sep 2021

ACP++: Action Co-occurrence Priors for Human-Object Interaction DetectionIEEE Transactions on Image Processing (TIP), 2021

In So Kweon

208

09 Sep 2021

Generation, augmentation, and alignment: A pseudo-source domain based method for source-free domain adaptationMachine-mediated learning (ML), 2021

217

09 Sep 2021

Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2021

223

08 Sep 2021

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with TransformersComputer Vision and Pattern Recognition (CVPR), 2021

Ping Luo

362

171

08 Sep 2021

Learning Local-Global Contextual Adaptation for Multi-Person Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2021

341

08 Sep 2021

RefineCap: Concept-Aware Refinement for Image Captioning

119

08 Sep 2021

Temporal RoI Align for Video Object RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2021

Qi Chu

150

08 Sep 2021

VideoModerator: A Risk-aware Framework for Multimodal Video Moderation in E-CommerceIEEE Transactions on Visualization and Computer Graphics (TVCG), 2021

Lingyun Yu

183

08 Sep 2021

YouRefIt: Embodied Reference Understanding with Language and GestureIEEE International Conference on Computer Vision (ICCV), 2021

229

08 Sep 2021

Tom: Leveraging trend of the observed gradients for faster convergence

07 Sep 2021

Knowledge Distillation Using Hierarchical Self-Supervision Augmented DistributionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

256

07 Sep 2021

Learning to Combine the Modalities of Language and Video for Temporal Moment LocalizationComputer Vision and Image Understanding (CVIU), 2021

Jungkyoo Shin

Jinyoung Moon

149

07 Sep 2021

Adversarial Parameter Defense by Multi-Step Risk MinimizationNeural Networks (NN), 2021

Xuancheng Ren

159

07 Sep 2021

Journalistic Guidelines Aware News Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

242

07 Sep 2021

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyondSIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2021

M. Ponti

Fernando Pereira dos Santos

Leo Sampaio Ferraz Ribeiro

G. B. Cavallari

162

06 Sep 2021

Active Perception with Neural Networks

Elijah S. Lee

AI4CE

150

06 Sep 2021

Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object DetectionIEEE International Conference on Computer Vision (ICCV), 2021

Jiageng Mao

Minzhe Niu

Haoyue Bai

Xiaodan Liang

Hang Xu

Chunjing Xu

3DPC

178

164

06 Sep 2021

Automatic Segmentation of the Optic Nerve Head Region in Optical Coherence Tomography: A Methodological Review

06 Sep 2021

Reasoning Graph Networks for Kinship Verification: from Star-shaped to Hierarchical

Wanhua Li

Jiwen Lu

Abudukelimu Wuerkaixi

Jianjiang Feng

Jie Zhou

142

06 Sep 2021

Parsing Table Structures in the Wild

219

06 Sep 2021

Robust Attentive Deep Neural Network for Exposing GAN-generated Faces

281

05 Sep 2021

Identification of Driver Phone Usage Violations via State-of-the-Art Object Detection with Tracking

S. Carrell

Amir Atapour-Abarghouei

133

05 Sep 2021

Hierarchical Object-to-Zone Graph for Object Navigation

259

05 Sep 2021

Training Meta-Surrogate Model for Transferable Adversarial Attack

281

05 Sep 2021

LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation

Mohammad Abuzar Shaikh

153

04 Sep 2021

Weakly Supervised Relative Spatial Reasoning for Visual Question Answering

Yezhou Yang

162

04 Sep 2021

ISyNet: Convolutional Neural Networks design for AI accelerator

Alexey Letunovskiy

Vladimir Korviakov

V. Polovnikov

Anastasiia Kargapoltseva

I. Mazurenko

Yepan Xiong

219

04 Sep 2021

Stimuli-Aware Visual Emotion Analysis

Jingyuan Yang

Jie Li

Xiumei Wang

Yuxuan Ding

Xinbo Gao

121

04 Sep 2021

A Comprehensive Approach for UAV Small Object Detection with Simulation-based Transfer Learning and Adaptive Fusion

154

04 Sep 2021

Semantics-Guided Contrastive Network for Zero-Shot Object detection

Caixia Yan

260

04 Sep 2021

Christophe De Vleeschouwer

Marc Van Droogenbroeck

129

03 Sep 2021

DeepTracks: Geopositioning Maritime Vehicles in Video Acquired from a Moving Platform

Jianli Wei

Guanyu Xu

Alper Yilmaz

145

02 Sep 2021