v1v2 (latest)

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

Computer Vision and Pattern Recognition (CVPR), 2022

14 November 2022

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2496★)

Papers citing "EVA: Exploring the Limits of Masked Visual Representation Learning at Scale"

29 / 579 papers shown

Hierarchical Video-Moment Retrieval and Step-CaptioningComputer Vision and Pattern Recognition (CVPR), 2023

273

29 Mar 2023

EVA-CLIP: Improved Training Techniques for CLIP at Scale

829

722

27 Mar 2023

Exploring the Benefits of Visual Prompting in Differential PrivacyIEEE International Conference on Computer Vision (ICCV), 2023

253

22 Mar 2023

EVA-02: A Visual Representation for Neon GenesisImage and Vision Computing (IVC), 2023

399

409

20 Mar 2023

ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

236

123

12 Mar 2023

A Categorical Framework of General Intelligence

Yang Yuan

248

08 Mar 2023

DejaVu: Conditional Regenerative Learning to Enhance Dense PredictionComputer Vision and Pattern Recognition (CVPR), 2023

Shubhankar Borse

Fatih Porikli

318

02 Mar 2023

A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and RethinkingInternational Journal of Computer Vision (IJCV), 2023

Yu Xie

Yinpeng Dong

Wenzhao Xiang

Hang Su

Shibao Zheng

324

117

28 Feb 2023

Offsite-Tuning: Transfer Learning without Full Model

Guangxuan Xiao

Ji Lin

Song Han

205

09 Feb 2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and VideoInternational Conference on Machine Learning (ICML), 2023

Jiabo Ye

...

Ji Zhang

Jingren Zhou

264

218

01 Feb 2023

What Makes Good Examples for Visual In-Context Learning?Neural Information Processing Systems (NeurIPS), 2023

Yuanhan Zhang

Kaiyang Zhou

Ziwei Liu

MLLM VPVLM VLM LRM

260

162

31 Jan 2023

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsInternational Conference on Machine Learning (ICML), 2023

Silvio Savarese

1.3K

6,661

30 Jan 2023

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2023

Liya Wang

A. Tien

414

28 Jan 2023

Masked Autoencoding Does Not Help Natural Language Supervision at ScaleComputer Vision and Pattern Recognition (CVPR), 2023

Floris Weers

Vaishaal Shankar

Angelos Katharopoulos

Yinfei Yang

Tom Gunter

CLIP

347

19 Jan 2023

TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real WorldACM Multimedia (ACM MM), 2023

...

Qin Jin

202

14 Jan 2023

A Survey on Self-supervised Learning: Algorithms, Applications, and Future TrendsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

565

354

13 Jan 2023

CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder

Xiangjian He

220

11 Jan 2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Yue Han

Jiangning Zhang

Zhucun Xue

Chao Xu

Xintian Shen

Yabiao Wang

Chengjie Wang

Yong Liu

Xiangtai Li

350

03 Jan 2023

Reproducible scaling laws for contrastive language-image learningComputer Vision and Pattern Recognition (CVPR), 2022

493

1,147

14 Dec 2022

CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet

Jianmin Bao

Lu Yuan

166

12 Dec 2022

Spurious Features Everywhere -- Large-Scale Detection of Harmful Spurious Features in ImageNetIEEE International Conference on Computer Vision (ICCV), 2022

Matthias Hein

299

09 Dec 2022

Direct-Effect Risk Minimization for Domain Generalization

327

26 Nov 2022

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token MigrationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yaowei Wang

196

23 Nov 2022

I Can't Believe There's No Images! Learning Visual Tasks Using only Language SupervisionIEEE International Conference on Computer Vision (ICCV), 2022

331

17 Nov 2022

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2022

...

Yu Qiao

553

958

10 Nov 2022

Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label ThresholdingIEEE Access (IEEE Access), 2022

197

07 Nov 2022

Neural Eigenfunctions Are Structured Representation Learners

Peng Cui

Jun Zhu

220

23 Oct 2022

Pathway to Future Symbiotic Creativity

Wei Xue

...

255

18 Aug 2022

Group DETR: Fast DETR Training with Group-Wise One-to-Many AssignmentIEEE International Conference on Computer Vision (ICCV), 2022

Errui Ding

Jingdong Wang

309

195

26 Jul 2022