Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1506.01497
Cited By
v1
v2
v3 (latest)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"
50 / 13,130 papers shown
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil
Cheng Zhang
D. Xuan
Wei-Lun Chao
264
23
0
13 Sep 2021
Weakly Supervised Person Search with Region Siamese Networks
Chuchu Han
Kai Su
Dongdong Yu
Zehuan Yuan
Changxin Gao
Nong Sang
Yi Yang
Changhu Wang
165
25
0
13 Sep 2021
xGQA: Cross-Lingual Visual Question Answering
Jonas Pfeiffer
Gregor Geigle
Aishwarya Kamath
Jan-Martin O. Steitz
Stefan Roth
Ivan Vulić
Iryna Gurevych
357
78
0
13 Sep 2021
Learning to Ground Visual Objects for Visual Dialog
Feilong Chen
Xiuyi Chen
Can Xu
Daxin Jiang
OOD
189
18
0
13 Sep 2021
Mutual Supervision for Dense Object Detection
Ziteng Gao
Limin Wang
Gangshan Wu
224
38
0
13 Sep 2021
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation
Zhengkun Zhang
Xiaojun Meng
Yasheng Wang
Xin Jiang
Qun Liu
Zhenglu Yang
173
57
0
13 Sep 2021
Adversarially Trained Object Detector for Unsupervised Domain Adaptation
Kazuma Fujii
Hiroshi Kera
K. Kawamoto
ObjD
AAML
167
5
0
13 Sep 2021
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation
Zechen Bai
Yuta Nakashima
Noa Garcia
228
48
0
13 Sep 2021
Domain Adaptation by Maximizing Population Correlation with Neural Architecture Search
Zhixiong Yue
Pengxin Guo
Yu Zhang
178
1
0
12 Sep 2021
DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos
Negin Ghamsarian
M. Taschwer
Klaus Schoeffmann
162
14
0
11 Sep 2021
BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation
Naina Dhingra
Florian Ritter
A. Kunz
213
43
0
11 Sep 2021
COSMic: A Coherence-Aware Generation Metric for Image Descriptions
Mert Inan
P. Sharma
Baber Khalid
Radu Soricut
Matthew Stone
Malihe Alikhani
EGVM
154
14
0
11 Sep 2021
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets
Shraman Pramanick
Shivam Sharma
Dimitar Dimitrov
Md. Shad Akhtar
Preslav Nakov
Tanmoy Chakraborty
215
166
0
11 Sep 2021
Partially-Supervised Novel Object Captioning Leveraging Context from Paired Data
Shashank Bujimalla
Mahesh Subedar
Omesh Tickoo
193
1
0
10 Sep 2021
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
AAAI Conference on Artificial Intelligence (AAAI), 2021
Zhenzhi Wang
Limin Wang
Tao Wu
Tianhao Li
Gangshan Wu
AI4TS
330
153
0
10 Sep 2021
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Stella Frank
Emanuele Bugliarello
Desmond Elliott
181
93
0
09 Sep 2021
TxT: Crossmodal End-to-End Learning with Transformers
German Conference on Pattern Recognition (DAGM), 2021
Jan-Martin O. Steitz
Jonas Pfeiffer
Iryna Gurevych
Stefan Roth
LRM
125
2
0
09 Sep 2021
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining
Computer Vision and Pattern Recognition (CVPR), 2021
Xiao Dong
Xunlin Zhan
Yangxin Wu
Yunchao Wei
Michael C. Kampffmeyer
Xiaoyong Wei
Minlong Lu
Yaowei Wang
Xiaodan Liang
568
46
0
09 Sep 2021
ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection
IEEE Transactions on Image Processing (TIP), 2021
Dong-Jin Kim
Xiao Sun
Jinsoo Choi
Stephen Lin
In So Kweon
208
25
0
09 Sep 2021
Generation, augmentation, and alignment: A pseudo-source domain based method for source-free domain adaptation
Machine-mediated learning (ML), 2021
Yuntao Du
Haiyang Yang
Mingcai Chen
Juan Jiang
Hongtao Luo
Chongjun Wang
TTA
217
39
0
09 Sep 2021
Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models
AAAI Conference on Artificial Intelligence (AAAI), 2021
Steven Y. Feng
Kevin Lu
Zhuofu Tao
Malihe Alikhani
Teruko Mitamura
Eduard H. Hovy
Varun Gangal
LRM
223
14
0
08 Sep 2021
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
Computer Vision and Pattern Recognition (CVPR), 2021
Zhiqi Li
Wenhai Wang
Enze Xie
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
Tong Lu
ViT
362
171
0
08 Sep 2021
Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2021
Nan Xue
Tianfu Wu
Gui-Song Xia
Guang Dai
3DH
341
39
0
08 Sep 2021
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
119
1
0
08 Sep 2021
Temporal RoI Align for Video Object Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2021
Tao Gong
Kai-xiang Chen
Xinjiang Wang
Qi Chu
Feng Zhu
Dahua Lin
Nenghai Yu
Huamin Feng
150
95
0
08 Sep 2021
VideoModerator: A Risk-aware Framework for Multimodal Video Moderation in E-Commerce
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2021
Tan Tang
Yanhong Wu
Lingyun Yu
Yuhong Li
Yingcai Wu
183
30
0
08 Sep 2021
YouRefIt: Embodied Reference Understanding with Language and Gesture
IEEE International Conference on Computer Vision (ICCV), 2021
Yixin Chen
Qing Li
Deqian Kong
Yik Lun Kei
Song-Chun Zhu
Tao Gao
Yixin Zhu
Siyuan Huang
LM&Ro
229
48
0
08 Sep 2021
Tom: Leveraging trend of the observed gradients for faster convergence
Anirudh Maiya
Inumella Sricharan
Anshuman Pandey
S. SrinivasK.
ODL
90
0
0
07 Sep 2021
Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Chuanguang Yang
Zhulin An
Linhang Cai
Yongjun Xu
256
23
0
07 Sep 2021
Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Computer Vision and Image Understanding (CVIU), 2021
Jungkyoo Shin
Jinyoung Moon
149
8
0
07 Sep 2021
Adversarial Parameter Defense by Multi-Step Risk Minimization
Neural Networks (NN), 2021
Zhiyuan Zhang
Ruixuan Luo
Xuancheng Ren
Qi Su
Liangyou Li
Xu Sun
AAML
159
7
0
07 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
242
32
0
07 Sep 2021
Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond
SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2021
M. Ponti
Fernando Pereira dos Santos
Leo Sampaio Ferraz Ribeiro
G. B. Cavallari
162
18
0
06 Sep 2021
Active Perception with Neural Networks
Elijah S. Lee
AI4CE
150
1
0
06 Sep 2021
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection
IEEE International Conference on Computer Vision (ICCV), 2021
Jiageng Mao
Minzhe Niu
Haoyue Bai
Xiaodan Liang
Hang Xu
Chunjing Xu
3DPC
178
164
0
06 Sep 2021
Automatic Segmentation of the Optic Nerve Head Region in Optical Coherence Tomography: A Methodological Review
Rita Marques
D. Jesus
J. Barbosa-Breda
Jan Van Eijgen
Ingeborg Stalmans
T. van Walsum
S. Klein
Pedro G. Vaz
Luisa Sánchez Brea
71
10
0
06 Sep 2021
Reasoning Graph Networks for Kinship Verification: from Star-shaped to Hierarchical
Wanhua Li
Jiwen Lu
Abudukelimu Wuerkaixi
Jianjiang Feng
Jie Zhou
142
23
0
06 Sep 2021
Parsing Table Structures in the Wild
Rujiao Long
Wen Wang
Nan Xue
Feiyu Gao
Zhibo Yang
Yongpan Wang
Gui-Song Xia
LMTD
219
67
0
06 Sep 2021
Robust Attentive Deep Neural Network for Exposing GAN-generated Faces
Hui Guo
Shu Hu
Xin Wang
Ming-Ching Chang
Siwei Lyu
CVBM
281
49
0
05 Sep 2021
Identification of Driver Phone Usage Violations via State-of-the-Art Object Detection with Tracking
S. Carrell
Amir Atapour-Abarghouei
133
5
0
05 Sep 2021
Hierarchical Object-to-Zone Graph for Object Navigation
Sixian Zhang
Xinhang Song
Yubing Bai
Weijie Li
Yakui Chu
Shuqiang Jiang
259
83
0
05 Sep 2021
Training Meta-Surrogate Model for Transferable Adversarial Attack
Yunxiao Qin
Yuanhao Xiong
Jinfeng Yi
Cho-Jui Hsieh
AAML
281
26
0
05 Sep 2021
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation
Mohammad Abuzar Shaikh
Zhanghexuan Ji
Dana Moukheiber
Yan Shen
S. Srihari
Mingchen Gao
VLM
153
1
0
04 Sep 2021
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
162
19
0
04 Sep 2021
ISyNet: Convolutional Neural Networks design for AI accelerator
Alexey Letunovskiy
Vladimir Korviakov
V. Polovnikov
Anastasiia Kargapoltseva
I. Mazurenko
Yepan Xiong
219
1
0
04 Sep 2021
Stimuli-Aware Visual Emotion Analysis
Jingyuan Yang
Jie Li
Xiumei Wang
Yuxuan Ding
Xinbo Gao
121
75
0
04 Sep 2021
A Comprehensive Approach for UAV Small Object Detection with Simulation-based Transfer Learning and Adaptive Fusion
Rui Chen
Youwei Guo
Huafei Zheng
Hongyu Jiang
154
18
0
04 Sep 2021
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan
Xiao Chang
Minnan Luo
Huan Liu
Xiaoqin Zhang
Qinghua Zheng
ObjD
VLM
260
95
0
04 Sep 2021
Ordinal Pooling
Adrien Deliège
M. Istasse
Ashwani Kumar
Christophe De Vleeschouwer
Marc Van Droogenbroeck
129
12
0
03 Sep 2021
DeepTracks: Geopositioning Maritime Vehicles in Video Acquired from a Moving Platform
Jianli Wei
Guanyu Xu
Alper Yilmaz
145
1
0
02 Sep 2021
Previous
1
2
3
...
119
120
121
...
261
262
263
Next