Title
Grafit: Learning fine-grained image representations with coarse labelsIEEE International Conference on Computer Vision (ICCV), 2020 Hugo Touvron Alexandre Sablayrolles Matthijs Douze Matthieu Cord Edouard Grave SSL 143 73 0 25 Nov 2020
Insights From A Large-Scale Database of Material Depictions In Paintings Hubert Lin Mitchell J. P. van Zuijlen M. Wijntjes S. Pont Kavita Bala 186 8 0 24 Nov 2020
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color HistogramsComputer Vision and Pattern Recognition (CVPR), 2020 Mahmoud Afifi Marcus A. Brubaker M. S. Brown GAN 240 126 0 23 Nov 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection TasksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020 Kemal Oksuz Baris Can Cam Sinan Kalkan Emre Akbas 204 39 0 21 Nov 2020
Open-Vocabulary Object Detection Using CaptionsComputer Vision and Pattern Recognition (CVPR), 2020 Alireza Zareian Kevin Dela Rosa Derek Hao Hu Shih-Fu Chang VLM ObjD 384 531 0 20 Nov 2020
Towards Abstract Relational Learning in Human Robot Interaction Mohamadreza Faridghasemnia Daniele Nardi A. Saffiotti 96 2 0 20 Nov 2020
Efficient Conditional Pre-training for Transfer Learning Shuvam Chakraborty Burak Uzkent Kumar Ayush Kumar Tanmay Evan Sheehan Stefano Ermon VLM 490 20 0 20 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain Human-like BiasesConference on Fairness, Accountability and Transparency (FAccT), 2020 Ryan Steed Aylin Caliskan SSL 290 172 0 28 Oct 2020
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions Liunian Harold Li Haoxuan You Zhecan Wang Alireza Zareian Shih-Fu Chang Kai-Wei Chang SSL VLM 183 12 0 24 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph Jingkang Yang Weirong Chen Xue Jiang Xiaopeng Yan Huabin Zheng Wayne Zhang NoLa 117 13 0 12 Oct 2020
A Unified Framework for Generic, Query-Focused, Privacy Preserving and Update Summarization using Submodular Information Measures Vishal Kaushal Suraj Kothawade Ganesh Ramakrishnan J. Bilmes Himanshu Asnani Rishabh K. Iyer 88 6 0 12 Oct 2020
Homography Estimation with Convolutional Neural Networks Under Conditions of Variance David Niblick A. Kak 187 4 0 02 Oct 2020
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns L. Ferreira Douglas De Rizzo Meneghetti P. Santos 71 2 0 02 Oct 2020
Asymmetric Loss For Multi-Label Classification Emanuel Ben-Baruch T. Ridnik Nadav Zamir Asaf Noy Itamar Friedman M. Protter Lihi Zelnik-Manor 360 683 0 29 Sep 2020
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning Xiaowei Hu Xi Yin Kevin Qinghong Lin Lijuan Wang Guang Dai Jianfeng Gao Zicheng Liu VLM 203 58 0 28 Sep 2020
MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object DetectionEuropean Conference on Computer Vision (ECCV), 2020 Xin Lu Quanquan Li Buyu Li Junjie Yan ObjD 155 62 0 24 Sep 2020
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks Zhiqiang Shen Marios Savvides 253 68 0 17 Sep 2020
Knowledge Guided Learning: Towards Open Domain Egocentric Action Recognition with Zero Supervision Sathyanarayanan N. Aakur Sanjoy Kundu Nikhil Gunti EgoV 127 1 0 16 Sep 2020
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal GenerationAAAI Conference on Artificial Intelligence (AAAI), 2020 Haisheng Su Weihao Gan Wei Wu Yu Qiao Junjie Yan 208 136 0 15 Sep 2020
Adaptive Label Smoothing Ujwal Krothapalli A. Lynn Abbott 187 11 0 14 Sep 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection ModelsInternational Conference on Computational Linguistics (COLING), 2020 Khyathi Chandu Piyush Sharma Soravit Changpinyo Ashish V. Thapliyal Radu Soricut DiffM VLM 201 3 0 10 Sep 2020
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask Jingru Tan Qiang Chen Hanming Deng Changbao Wang Lewei Lu Quanquan Li Jifeng Dai 144 19 0 03 Sep 2020
A Cost-Effective Person-Following System for Assistive Unmanned Vehicles with Deep Learning at the Edge A. Boschi Francesco Salvetti Vittorio Mazzia Marcello Chiaberge 180 14 0 31 Aug 2020
Soliciting Human-in-the-Loop User Feedback for Interactive Machine Learning Reduces User Trust and Impressions of Model AccuracyAAAI Conference on Human Computation & Crowdsourcing (HCOMP), 2020 Donald R. Honeycutt Mahsan Nourani Eric D. Ragan HAI 225 73 0 28 Aug 2020
Domain Adaptation Through Task DistillationEuropean Conference on Computer Vision (ECCV), 2020 Brady Zhou Nimit Kalra Philipp Krahenbuhl OOD 144 16 0 27 Aug 2020
DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment in COVID-19 PandemicmedRxiv (medRxiv), 2020 Mahdi Rezaei Mohsen Azarmi 255 159 0 26 Aug 2020
Object Detection with a Unified Label Space from Multiple Datasets Xiangyu Zhao S. Schulter Gaurav Sharma Yi-Hsuan Tsai Manmohan Chandraker Ying Nian Wu ObjD 164 80 0 15 Aug 2020
What leads to generalization of object proposals? Rui Wang D. Mahajan Vignesh Ramanathan ObjD 183 12 0 13 Aug 2020
Guided Collaborative Training for Pixel-wise Semi-Supervised LearningEuropean Conference on Computer Vision (ECCV), 2020 Zhanghan Ke Di Qiu Kaican Li Qiong Yan Rynson W. H. Lau 237 282 0 12 Aug 2020
BREEDS: Benchmarks for Subpopulation ShiftInternational Conference on Learning Representations (ICLR), 2020 Shibani Santurkar Dimitris Tsipras Aleksander Madry OOD 187 189 0 11 Aug 2020
Polysemy Deciphering Network for Robust Human-Object Interaction DetectionInternational Journal of Computer Vision (IJCV), 2020 Xubin Zhong Changxing Ding X. Qu Dacheng Tao 309 62 0 07 Aug 2020
Multiple instance learning on deep features for weakly supervised object detection with extreme domain shifts Nicolas Gonthier Saïd Ladjal Y. Gousseau WSOD 387 31 0 03 Aug 2020
Spatially Aware Multimodal Transformers for TextVQAEuropean Conference on Computer Vision (ECCV), 2020 Yash Kant Dhruv Batra Peter Anderson Alex Schwing Devi Parikh Jiasen Lu Harsh Agrawal 179 93 0 23 Jul 2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020 Haisheng Su Jinyuan Feng Hao Shao Zhenyu Jiang Manyuan Zhang Wei Wu Yu Liu Jiaming Song Junjie Yan 136 0 0 20 Jul 2020
Boosting Weakly Supervised Object Detection with Progressive Knowledge TransferEuropean Conference on Computer Vision (ECCV), 2020 Yuanyi Zhong Jianfeng Wang Jian-wei Peng Lei Zhang 148 57 0 15 Jul 2020
COBE: Contextualized Object Embeddings from Narrated Instructional VideoNeural Information Processing Systems (NeurIPS), 2020 Gedas Bertasius Lorenzo Torresani 185 27 0 14 Jul 2020
Deep learning for scene recognition from visual data: a survey Alina Matei A. Glavan Estefanía Talavera 160 19 0 03 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image Classification Rohan Taori Achal Dave Vaishaal Shankar Nicholas Carlini Benjamin Recht Ludwig Schmidt OOD 466 623 0 01 Jul 2020
Recurrent Relational Memory Network for Unsupervised Image CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2020 Dan Guo Yang Wang Peipei Song Meng Wang GAN 168 42 0 24 Jun 2020
Large image datasets: A pyrrhic win for computer vision?IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020 Vinay Uday Prabhu Abeba Birhane 300 402 0 24 Jun 2020
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning AttacksInternational Conference on Machine Learning (ICML), 2020 Avi Schwarzschild Micah Goldblum Arjun Gupta John P. Dickerson Tom Goldstein AAML TDI 272 189 0 22 Jun 2020
UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation Siddhesh Khandelwal Raghav Goyal Leonid Sigal VLM 329 2 0 12 Jun 2020
Rethinking Pre-training and Self-trainingNeural Information Processing Systems (NeurIPS), 2020 Barret Zoph Golnaz Ghiasi Nayeon Lee Huayu Chen Hanxiao Liu E. D. Cubuk Quoc V. Le SSeg 273 702 0 11 Jun 2020
Privacy-Aware Activity Classification from First Person Office Videos Partho Ghosh Md. Abrar Istiak Nayeeb Rashid Ahsan Habib Akash Ridwan Abrar Ankan Ghosh Dastider Asif Sushmit Taufiq Hasan PICV 134 2 0 11 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 Alessandro Suglia Ioannis Konstas Andrea Vanzo E. Bastianelli Desmond Elliott Stella Frank Oliver Lemon 131 17 0 03 Jun 2020
Multimodal grid features and cell pointers for Scene Text Visual Question AnsweringPattern Recognition Letters (Pattern Recognit. Lett.), 2020 Lluís Gómez Ali Furkan Biten Rubèn Pérez Tito Andrés Mafla Marçal Rusiñol Ernest Valveny Dimosthenis Karatzas 189 22 0 01 Jun 2020
Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels Junran Peng Xingyuan Bu Ming Sun Zhaoxiang Zhang Tieniu Tan Junjie Yan VLM ObjD 170 66 0 18 May 2020
Cross-media Structured Common Space for Multimedia Event ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 Pengfei Yu Alireza Zareian Qi Zeng Spencer Whitehead Di Lu Heng Ji Shih-Fu Chang 151 116 0 05 May 2020
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques Narinder Singh Punn S. K. Sonbhadra Sonali Agarwal Gaurav Rai 251 250 0 04 May 2020
Clue: Cross-modal Coherence Modeling for Caption GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 Malihe Alikhani Piyush Sharma Shengjie Li Radu Soricut Matthew Stone 181 59 0 02 May 2020