Exploring the Limits of Weakly Supervised Pretraining

2 May 2018

Laurens van der Maaten

VLM

ArXiv (abs)PDF HTML

Papers citing "Exploring the Limits of Weakly Supervised Pretraining"

50 / 847 papers shown

ClusterMine: Robust Label-Free Visual Out-Of-Distribution Detection via Concept Mining from Text Corpora

152

10 Nov 2025

Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase

188

30 Oct 2025

Why Prototypes Collapse: Diagnosing and Preventing Partial Collapse in Prototypical Self-Supervised Learning

Michael C. Kampffmeyer

Adín Ramirez Rivera

125

23 Oct 2025

Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning

10 Oct 2025

Unsupervised Transformer Pre-Training for Images: Self-Distillation, Mean Teachers, and Random Crops

Mattia Scardecchia

ViT

157

04 Oct 2025

MultiModal Action Conditioned Video Generation

Yichen Li

Antonio Torralba

VGen

184

02 Oct 2025

GroupCoOp: Group-robust Fine-tuning via Group Prompt Learning

131

28 Sep 2025

MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper

...

228

31 Aug 2025

Data Leakage in Visual Datasets

222

24 Aug 2025

Perch 2.0: The Bittern Lesson for Bioacoustics

177

06 Aug 2025

Learning in Focus: Detecting Behavioral and Collaborative Engagement Using Vision Transformers

Noorbakhsh Amiri Golilarz

Shahram Rahimi

Andy D. Perkins

Shahram Rahimi

Noorbakhsh Amiri Golilarz

ViT

139

05 Aug 2025

Learning Partially-Decorrelated Common Spaces for Ad-hoc Video Search

Fan Hu

Zijie Xin

Xirong Li

126

04 Aug 2025

Scaling can lead to compositional generalization

203

09 Jul 2025

Fast-DataShapley: Neural Modeling for Training Data Valuation

423

05 Jun 2025

RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers

207

03 Jun 2025

The iNaturalist Sounds DatasetNeural Information Processing Systems (NeurIPS), 2025

253

31 May 2025

Hierarchical Material Recognition from Local Appearance

Matthew Beveridge

Shree K. Nayar

344

28 May 2025

Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations

157

27 May 2025

Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image ClassificationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

305

13 May 2025

SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image SegmentationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025

304

28 Apr 2025

ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data StreamsComputer Vision and Pattern Recognition (CVPR), 2025

311

21 Apr 2025

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video HashingComputer Vision and Pattern Recognition (CVPR), 2025

877

04 Apr 2025

Classifier-guided CLIP Distillation for Unsupervised Multi-label ClassificationComputer Vision and Pattern Recognition (CVPR), 2025

Dongseob Kim

Hyunjung Shim

VLM

327

21 Mar 2025

LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence RerankingJournal of Artificial Intelligence Research (JAIR), 2025

Yan Yan

Junyuan Liu

Bo Zhang

170

11 Mar 2025

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Xavier Thomas

Deepti Ghadiyaram

DiffM

499

09 Mar 2025

LapSum - One Method to Differentiate Them All: Ranking, Sorting and Top-k Selection

221

08 Mar 2025

CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation

1.1K

24 Feb 2025

Privacy-Preserving Dataset Combination

Keren Fuentes

Mimee Xu

Irene Chen

343

09 Feb 2025

Training-Free Restoration of Pruned Neural Networks

Keonho Lee

Minsoo Kim

Dong-Wan Choi

319

06 Feb 2025

Contrastive Forward-Forward: A Training Algorithm of Vision TransformerNeural Networks (NN), 2025

Hossein Aghagolzadeh

Mehdi Ezoji

ViT

442

01 Feb 2025

Making Reliable and Flexible Decisions in Long-tailed Classification

Bolian Li

Ruqi Zhang

919

23 Jan 2025

How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?

315

20 Jan 2025

TipSegNet: Fingertip Segmentation in Contactless Fingerprint ImagingItalian National Conference on Sensors (INS), 2025

L. Ruzicka

Bernhard Kohn

Clemens Heitzinger

332

10 Jan 2025

Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability EstimationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

237

08 Dec 2024

DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model PretrainingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

284

04 Dec 2024

On the Surprising Effectiveness of Attention Transfer for Vision TransformersNeural Information Processing Systems (NeurIPS), 2024

201

14 Nov 2024

Deploying Multi-task Online Server with Large Language ModelInternational Conference on Computational Linguistics (COLING), 2024

225

06 Nov 2024

Visual Fourier Prompt TuningNeural Information Processing Systems (NeurIPS), 2024

412

02 Nov 2024

Bayesian-guided Label Mapping for Visual ReprogrammingNeural Information Processing Systems (NeurIPS), 2024

411

31 Oct 2024

Dataset Awareness is not Enough: Implementing Sample-level Tail Encouragement in Long-tailed Self-supervised Learning

390

30 Oct 2024

Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual RecognitionNeural Information Processing Systems (NeurIPS), 2024

Mengke Li

Yong Liu

Yang Lu

Yiqun Zhang

Yiu-ming Cheung

Hui Huang

VLM

157

28 Oct 2024

TIPS: Text-Image Pretraining with Spatial awarenessInternational Conference on Learning Representations (ICLR), 2024

Kevis-Kokitsi Maninis

...

Mojtaba Seyedhosseini

Howard Zhou

Andre Araujo

VLM

436

21 Oct 2024

Process Reward Model with Q-Value RankingsInternational Conference on Learning Representations (ICLR), 2024

W. Li

Yixuan Li

LRM

591

15 Oct 2024

Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future

Long Chen

Huchuan Lu

253

08 Oct 2024

Recent Advances of Multimodal Continual Learning: A Comprehensive Survey

Dianzhi Yu

Xinni Zhang

Yankai Chen

Aiwei Liu

Yifei Zhang

Philip S. Yu

Irwin King

VLM CLL

350

07 Oct 2024

Feature Extractor or Decision Maker: Rethinking the Role of Visual Encoders in Visuomotor PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2024

366

30 Sep 2024

How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?

238

27 Sep 2024

Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations

Samyak Rawlekar

Shubhang Bhatnagar

Narendra Ahuja

VLM

260

12 Sep 2024

Data Collection-free Masked Video ModelingEuropean Conference on Computer Vision (ECCV), 2024

Yuchi Ishikawa

Masayoshi Kondo

Yoshimitsu Aoki

ViT

202

10 Sep 2024

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024

520

05 Sep 2024