Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 12,969 papers shown
Title
A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images
A. Borundiya
Arshak Navruzyan
Dennis Igoschev
F. C. Oughali
H. Pasupuleti
Mike Fuller
Vinay Kanigicherla
T. Kashyap
Rishabh Chaurasia
Sonali Vinod Jain
MedIm
64
0
0
14 Sep 2021
Multi-Scale Aligned Distillation for Low-Resolution Detection
Lu Qi
Jason Kuen
Jiuxiang Gu
Zhe Lin
Yi Wang
Yukang Chen
Yanwei Li
Jiaya Jia
160
65
0
14 Sep 2021
AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance
Xiangcheng Liu
Jian Cao
Hongyi Yao
Wenyu Sun
Yuan Zhang
146
3
0
14 Sep 2021
DAFNe: A One-Stage Anchor-Free Approach for Oriented Object Detection
Steven Lang
Fabrizio G. Ventola
Kristian Kersting
379
19
0
13 Sep 2021
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil
Cheng Zhang
D. Xuan
Wei-Lun Chao
204
23
0
13 Sep 2021
Weakly Supervised Person Search with Region Siamese Networks
Chuchu Han
Kai Su
Dongdong Yu
Zehuan Yuan
Changxin Gao
Nong Sang
Yi Yang
Changhu Wang
144
23
0
13 Sep 2021
xGQA: Cross-Lingual Visual Question Answering
Jonas Pfeiffer
Gregor Geigle
Aishwarya Kamath
Jan-Martin O. Steitz
Stefan Roth
Ivan Vulić
Iryna Gurevych
329
76
0
13 Sep 2021
Learning to Ground Visual Objects for Visual Dialog
Feilong Chen
Xiuyi Chen
Can Xu
Daxin Jiang
OOD
146
18
0
13 Sep 2021
Mutual Supervision for Dense Object Detection
Ziteng Gao
Limin Wang
Gangshan Wu
180
37
0
13 Sep 2021
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation
Zhengkun Zhang
Xiaojun Meng
Yasheng Wang
Xin Jiang
Qun Liu
Zhenglu Yang
160
54
0
13 Sep 2021
Adversarially Trained Object Detector for Unsupervised Domain Adaptation
Kazuma Fujii
Hiroshi Kera
K. Kawamoto
ObjD
AAML
151
5
0
13 Sep 2021
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation
Zechen Bai
Yuta Nakashima
Noa Garcia
200
48
0
13 Sep 2021
Domain Adaptation by Maximizing Population Correlation with Neural Architecture Search
Zhixiong Yue
Pengxin Guo
Yu Zhang
163
1
0
12 Sep 2021
DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
Negin Ghamsarian
M. Taschwer
Klaus Schoeffmann
129
14
0
11 Sep 2021
BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation
Naina Dhingra
Florian Ritter
A. Kunz
185
43
0
11 Sep 2021
COSMic: A Coherence-Aware Generation Metric for Image Descriptions
Mert Inan
P. Sharma
Baber Khalid
Radu Soricut
Matthew Stone
Malihe Alikhani
EGVM
122
14
0
11 Sep 2021
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets
Shraman Pramanick
Shivam Sharma
Dimitar Dimitrov
Md. Shad Akhtar
Preslav Nakov
Tanmoy Chakraborty
126
161
0
11 Sep 2021
Partially-Supervised Novel Object Captioning Leveraging Context from Paired Data
Shashank Bujimalla
Mahesh Subedar
Omesh Tickoo
182
1
0
10 Sep 2021
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
AAAI Conference on Artificial Intelligence (AAAI), 2021
Zhenzhi Wang
Limin Wang
Tao Wu
Tianhao Li
Gangshan Wu
AI4TS
258
151
0
10 Sep 2021
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Stella Frank
Emanuele Bugliarello
Desmond Elliott
149
90
0
09 Sep 2021
TxT: Crossmodal End-to-End Learning with Transformers
German Conference on Pattern Recognition (DAGM), 2021
Jan-Martin O. Steitz
Jonas Pfeiffer
Iryna Gurevych
Stefan Roth
LRM
105
2
0
09 Sep 2021
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining
Computer Vision and Pattern Recognition (CVPR), 2021
Xiao Dong
Xunlin Zhan
Yangxin Wu
Yunchao Wei
Michael C. Kampffmeyer
Xiaoyong Wei
Minlong Lu
Yaowei Wang
Xiaodan Liang
515
46
0
09 Sep 2021
ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection
IEEE Transactions on Image Processing (TIP), 2021
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
176
25
0
09 Sep 2021
Generation, augmentation, and alignment: A pseudo-source domain based method for source-free domain adaptation
Machine-mediated learning (ML), 2021
Yuntao Du
Haiyang Yang
Mingcai Chen
Juan Jiang
Hongtao Luo
Chongjun Wang
TTA
182
38
0
09 Sep 2021
Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models
AAAI Conference on Artificial Intelligence (AAAI), 2021
Steven Y. Feng
Kevin Lu
Zhuofu Tao
Malihe Alikhani
Teruko Mitamura
Eduard H. Hovy
Varun Gangal
LRM
165
13
0
08 Sep 2021
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
Computer Vision and Pattern Recognition (CVPR), 2021
Zhiqi Li
Wenhai Wang
Enze Xie
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
Tong Lu
ViT
269
169
0
08 Sep 2021
Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2021
Nan Xue
Tianfu Wu
Gui-Song Xia
Guang Dai
3DH
317
38
0
08 Sep 2021
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
97
1
0
08 Sep 2021
Temporal RoI Align for Video Object Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2021
Tao Gong
Kai-xiang Chen
Xinjiang Wang
Qi Chu
Feng Zhu
Dahua Lin
Nenghai Yu
Huamin Feng
136
95
0
08 Sep 2021
VideoModerator: A Risk-aware Framework for Multimodal Video Moderation in E-Commerce
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2021
Tan Tang
Yanhong Wu
Lingyun Yu
Yuhong Li
Yingcai Wu
170
29
0
08 Sep 2021
YouRefIt: Embodied Reference Understanding with Language and Gesture
IEEE International Conference on Computer Vision (ICCV), 2021
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
199
48
0
08 Sep 2021
Tom: Leveraging trend of the observed gradients for faster convergence
Anirudh Maiya
Inumella Sricharan
Anshuman Pandey
S. SrinivasK.
ODL
90
0
0
07 Sep 2021
Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Chuanguang Yang
Zhulin An
Linhang Cai
Yongjun Xu
228
23
0
07 Sep 2021
Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Computer Vision and Image Understanding (CVIU), 2021
Jungkyoo Shin
Jinyoung Moon
125
8
0
07 Sep 2021
Adversarial Parameter Defense by Multi-Step Risk Minimization
Neural Networks (NN), 2021
Zhiyuan Zhang
Ruixuan Luo
Xuancheng Ren
Qi Su
Liangyou Li
Xu Sun
AAML
114
7
0
07 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
202
32
0
07 Sep 2021
Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond
SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2021
M. Ponti
Fernando Pereira dos Santos
Leo Sampaio Ferraz Ribeiro
G. B. Cavallari
160
18
0
06 Sep 2021
Active Perception with Neural Networks
Elijah S. Lee
AI4CE
137
1
0
06 Sep 2021
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection
IEEE International Conference on Computer Vision (ICCV), 2021
Jiageng Mao
Minzhe Niu
Haoyue Bai
Xiaodan Liang
Hang Xu
Chunjing Xu
3DPC
166
164
0
06 Sep 2021
Automatic Segmentation of the Optic Nerve Head Region in Optical Coherence Tomography: A Methodological Review
Rita Marques
D. Jesus
J. Barbosa-Breda
Jan Van Eijgen
Ingeborg Stalmans
T. van Walsum
S. Klein
Pedro G. Vaz
Luisa Sánchez Brea
62
10
0
06 Sep 2021
Reasoning Graph Networks for Kinship Verification: from Star-shaped to Hierarchical
Wanhua Li
Jiwen Lu
Abudukelimu Wuerkaixi
Jianjiang Feng
Jie Zhou
130
23
0
06 Sep 2021
Parsing Table Structures in the Wild
Rujiao Long
Wen Wang
Nan Xue
Feiyu Gao
Zhibo Yang
Yongpan Wang
Gui-Song Xia
LMTD
200
67
0
06 Sep 2021
Robust Attentive Deep Neural Network for Exposing GAN-generated Faces
Hui Guo
Shu Hu
Xin Wang
Ming-Ching Chang
Siwei Lyu
CVBM
226
48
0
05 Sep 2021
Identification of Driver Phone Usage Violations via State-of-the-Art Object Detection with Tracking
S. Carrell
Amir Atapour-Abarghouei
109
5
0
05 Sep 2021
Hierarchical Object-to-Zone Graph for Object Navigation
Sixian Zhang
Xinhang Song
Yubing Bai
Weijie Li
Yakui Chu
Shuqiang Jiang
235
81
0
05 Sep 2021
Training Meta-Surrogate Model for Transferable Adversarial Attack
Yunxiao Qin
Yuanhao Xiong
Jinfeng Yi
Cho-Jui Hsieh
AAML
221
25
0
05 Sep 2021
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation
Mohammad Abuzar Shaikh
Zhanghexuan Ji
Dana Moukheiber
Yan Shen
S. Srihari
Mingchen Gao
VLM
140
1
0
04 Sep 2021
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
129
19
0
04 Sep 2021
ISyNet: Convolutional Neural Networks design for AI accelerator
Alexey Letunovskiy
Vladimir Korviakov
V. Polovnikov
Anastasiia Kargapoltseva
I. Mazurenko
Yepan Xiong
200
1
0
04 Sep 2021
Stimuli-Aware Visual Emotion Analysis
Jingyuan Yang
Jie Li
Xiumei Wang
Yuxuan Ding
Xinbo Gao
105
74
0
04 Sep 2021
Previous
1
2
3
...
117
118
119
...
258
259
260
Next