Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1805.00932
Cited By
Exploring the Limits of Weakly Supervised Pretraining
2 May 2018
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Shouqing Yang
Ashwin R. Bharambe
Laurens van der Maaten
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Weakly Supervised Pretraining"
50 / 847 papers shown
ClusterMine: Robust Label-Free Visual Out-Of-Distribution Detection via Concept Mining from Text Corpora
Nikolas Adaloglou
Diana Petrusheva
Mohamed Asker
Félix D. P. Michels
M. Kollmann
OODD
152
0
0
10 Nov 2025
Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase
Mihir Mahajan
Alfred Nguyen
Franz Srambical
Stefan Bauer
188
0
0
30 Oct 2025
Why Prototypes Collapse: Diagnosing and Preventing Partial Collapse in Prototypical Self-Supervised Learning
Gabriel Y. Arteaga
Marius Aasan
Rwiddhi Chakraborty
Martine Hjelkrem-Tan
T. Silva
Michael C. Kampffmeyer
Adín Ramirez Rivera
125
0
0
23 Oct 2025
Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning
Yufei Wang
Adriana Kovashka
Loretta Fernández
Marc N. Coutanche
Seth Wiener
89
0
0
10 Oct 2025
Unsupervised Transformer Pre-Training for Images: Self-Distillation, Mean Teachers, and Random Crops
Mattia Scardecchia
ViT
157
0
0
04 Oct 2025
MultiModal Action Conditioned Video Generation
Yichen Li
Antonio Torralba
VGen
184
3
0
02 Oct 2025
GroupCoOp: Group-robust Fine-tuning via Group Prompt Learning
Nayeong Kim
Seong Joon Oh
Suha Kwak
VLM
131
0
0
28 Sep 2025
MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper
Runjia Zeng
Guangyan Sun
Qifan Wang
Tong Geng
S. Dianat
...
Raghuveer M. Rao
Xueling Zhang
Cheng Han
Lifu Huang
Dongfang Liu
MoE
228
3
0
31 Aug 2025
Data Leakage in Visual Datasets
Patrick Ramos
Ryan Ramos
Noa Garcia
PILM
222
1
0
24 Aug 2025
Perch 2.0: The Bittern Lesson for Bioacoustics
B. V. Merrienboer
Vincent Dumoulin
Jenny Hamer
Lauren Harrell
Andrea Burns
Tom Denton
MDE
177
5
0
06 Aug 2025
Learning in Focus: Detecting Behavioral and Collaborative Engagement Using Vision Transformers
Sindhuja Penchala
Saketh Reddy Kontham
Prachi Bhattacharjee
Sareh Karami
Mehdi Ghahremani
Noorbakhsh Amiri Golilarz
Shahram Rahimi
Andy D. Perkins
Shahram Rahimi
Noorbakhsh Amiri Golilarz
ViT
139
0
0
05 Aug 2025
Learning Partially-Decorrelated Common Spaces for Ad-hoc Video Search
Fan Hu
Zijie Xin
Xirong Li
126
0
0
04 Aug 2025
Scaling can lead to compositional generalization
Florian Redhardt
Yassir Akram
Simon Schug
GNN
CoGe
203
0
0
09 Jul 2025
Fast-DataShapley: Neural Modeling for Training Data Valuation
Haifeng Sun
Yu Xiong
Runze Wu
Xinyu Cai
Changjie Fan
Lan Zhang
Xiang-Yang Li
TDI
423
0
0
05 Jun 2025
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers
Yan Gong
Yiren Song
Yicheng Li
Chenglin Li
Yin Zhang
KELM
207
14
0
03 Jun 2025
The iNaturalist Sounds Dataset
Neural Information Processing Systems (NeurIPS), 2025
Mustafa Chasmai
Alexander Shepard
Subhransu Maji
Grant Van Horn
253
13
0
31 May 2025
Hierarchical Material Recognition from Local Appearance
Matthew Beveridge
Shree K. Nayar
344
3
0
28 May 2025
Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations
Yue Li Du
Ben Alexander
Mikhail Antonenka
Rohan Mahadev
Hao Wu
Dmitry Kislyuk
157
0
0
27 May 2025
Empowering Vision Transformers with Multi-Scale Causal Intervention for Long-Tailed Image Classification
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Xiaoshuo Yan
Zerui Li
Lei Meng
Zhuang Qi
Wei Wu
Zixuan Li
Xiangxu Meng
CML
BDL
305
2
0
13 May 2025
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Yulong Guo
Zilun Zhang
Yongheng Shang
Tiancheng Zhao
Shuiguang Deng
Yingchun Yang
Jianwei Yin
304
0
0
28 Apr 2025
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
Computer Vision and Pattern Recognition (CVPR), 2025
C. Kim
Jihwan Moon
Sangwoo Moon
Heeseung Yun
Sihaeng Lee
Aniruddha Kembhavi
Soonyoung Lee
Gunhee Kim
Sangho Lee
Christopher Clark
311
1
0
21 Apr 2025
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
Computer Vision and Pattern Recognition (CVPR), 2025
Niu Lian
Jun Li
Jinpeng Wang
Ruisheng Luo
Yaowei Wang
Shu-Tao Xia
Bin Chen
877
0
0
04 Apr 2025
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Computer Vision and Pattern Recognition (CVPR), 2025
Dongseob Kim
Hyunjung Shim
VLM
327
0
0
21 Mar 2025
LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking
Journal of Artificial Intelligence Research (JAIR), 2025
Yan Yan
Junyuan Liu
Bo Zhang
170
1
0
11 Mar 2025
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Xavier Thomas
Deepti Ghadiyaram
DiffM
499
2
0
09 Mar 2025
LapSum - One Method to Differentiate Them All: Ranking, Sorting and Top-k Selection
Łukasz Struski
Michał B. Bednarczyk
Igor T. Podolak
Jacek Tabor
BDL
221
2
0
08 Mar 2025
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation
Vishal Thengane
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
L. Yin
Xiatian Zhu
Salman Khan
CLL
1.1K
0
0
24 Feb 2025
Privacy-Preserving Dataset Combination
Keren Fuentes
Mimee Xu
Irene Chen
343
0
0
09 Feb 2025
Training-Free Restoration of Pruned Neural Networks
Keonho Lee
Minsoo Kim
Dong-Wan Choi
319
2
0
06 Feb 2025
Contrastive Forward-Forward: A Training Algorithm of Vision Transformer
Neural Networks (NN), 2025
Hossein Aghagolzadeh
Mehdi Ezoji
ViT
442
2
0
01 Feb 2025
Making Reliable and Flexible Decisions in Long-tailed Classification
Bolian Li
Ruqi Zhang
919
0
0
23 Jan 2025
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?
Wenxuan Li
Yaoyao Liu
Zongwei Zhou
MedIm
315
16
0
20 Jan 2025
TipSegNet: Fingertip Segmentation in Contactless Fingerprint Imaging
Italian National Conference on Sensors (INS), 2025
L. Ruzicka
Bernhard Kohn
Clemens Heitzinger
332
3
0
10 Jan 2025
Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability Estimation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Junha Lee
Sojung An
Sujeong You
Namik Cho
237
0
0
08 Dec 2024
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Youssof Nawar
Nouran Soliman
Moustafa Wassel
Mohamed ElHabebe
Noha Adly
Marwan Torki
Ahmed Elmassry
Islam Ahmed
MedIm
284
0
0
04 Dec 2024
On the Surprising Effectiveness of Attention Transfer for Vision Transformers
Neural Information Processing Systems (NeurIPS), 2024
Alexander C. Li
Yuandong Tian
Bin Chen
Deepak Pathak
Xinlei Chen
201
9
0
14 Nov 2024
Deploying Multi-task Online Server with Large Language Model
International Conference on Computational Linguistics (COLING), 2024
Yincen Qu
Chao Ma
Xiangying Dai
Hui Zhou
Yiting Wu
Hengyue Liu
225
0
0
06 Nov 2024
Visual Fourier Prompt Tuning
Neural Information Processing Systems (NeurIPS), 2024
Runjia Zeng
Cheng Han
Qifan Wang
Chunshu Wu
Tong Geng
Lifu Huang
Ying Nian Wu
Dongfang Liu
VPVLM
VLM
412
27
0
02 Nov 2024
Bayesian-guided Label Mapping for Visual Reprogramming
Neural Information Processing Systems (NeurIPS), 2024
C. Cai
Zesheng Ye
Bingquan Shen
Jianzhong Qi
Feng Liu
411
8
0
31 Oct 2024
Dataset Awareness is not Enough: Implementing Sample-level Tail Encouragement in Long-tailed Self-supervised Learning
Haowen Xiao
Guanghui Liu
Xinyi Gao
Yang Li
Fengmao Lv
Jielei Chu
390
0
0
30 Oct 2024
Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition
Neural Information Processing Systems (NeurIPS), 2024
Mengke Li
Yong Liu
Yang Lu
Yiqun Zhang
Yiu-ming Cheung
Hui Huang
VLM
157
15
0
28 Oct 2024
TIPS: Text-Image Pretraining with Spatial awareness
International Conference on Learning Representations (ICLR), 2024
Kevis-Kokitsi Maninis
Kaifeng Chen
Soham Ghosh
Arjun Karpur
Koert Chen
...
Jan Dlabal
Dan Gnanapragasam
Mojtaba Seyedhosseini
Howard Zhou
Andre Araujo
VLM
436
17
0
21 Oct 2024
Process Reward Model with Q-Value Rankings
International Conference on Learning Representations (ICLR), 2024
W. Li
Yixuan Li
LRM
591
58
0
15 Oct 2024
Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future
Long Chen
Yuzhi Huang
Junyu Dong
Qi Xu
Sam Kwong
Huimin Lu
Huchuan Lu
Chongyi Li
253
14
0
08 Oct 2024
Recent Advances of Multimodal Continual Learning: A Comprehensive Survey
Dianzhi Yu
Xinni Zhang
Yankai Chen
Aiwei Liu
Yifei Zhang
Philip S. Yu
Irwin King
VLM
CLL
350
30
0
07 Oct 2024
Feature Extractor or Decision Maker: Rethinking the Role of Visual Encoders in Visuomotor Policies
IEEE International Conference on Robotics and Automation (ICRA), 2024
Ruiyu Wang
Zheyu Zhuang
Shutong Jin
Nils Ingelhag
Danica Kragic
Florian T. Pokorny
366
0
0
30 Sep 2024
How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?
Jose Sosa
Mohamed Aloulou
Danila Rukhovich
Rim Sleimi
Boonyarit Changaival
Anis Kacem
Djamila Aouada
238
4
0
27 Sep 2024
Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations
Samyak Rawlekar
Shubhang Bhatnagar
Narendra Ahuja
VLM
260
1
0
12 Sep 2024
Data Collection-free Masked Video Modeling
European Conference on Computer Vision (ECCV), 2024
Yuchi Ishikawa
Masayoshi Kondo
Yoshimitsu Aoki
ViT
202
1
0
10 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Neural Information Processing Systems (NeurIPS), 2024
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-Xiong Wang
520
32
0
05 Sep 2024
1
2
3
4
...
15
16
17
Next