Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.12710
Cited By
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
24 November 2021
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers"
50 / 189 papers shown
Title
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Zhouqiang Jiang
Bowen Wang
Tong Xiang
Zhaofeng Niu
Hong Tang
Guangshun Li
Liangzhi Li
14
2
0
02 Nov 2023
Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning
Zhiming Qian
VLM
SSL
8
0
0
11 Oct 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Guoxi Huang
Hongtao Fu
A. Bors
13
7
0
25 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
20
19
0
07 Sep 2023
Toward High Quality Facial Representation Learning
Yue Wang
Jinlong Peng
Jiangning Zhang
Ran Yi
L. Liu
Yabiao Wang
Chengjie Wang
CVBM
SSL
38
7
0
07 Sep 2023
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Qi Han
Yuxuan Cai
Xiangyu Zhang
25
7
0
02 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
6
9
0
31 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
Qidong Huang
Xiaoyi Dong
Dongdong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
11
8
0
20 Aug 2023
SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
Zhiming Wang
Lin Gu
Feng Lu
18
0
0
17 Aug 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
13
0
0
29 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSL
ViT
11
3
0
18 Jul 2023
Stitched ViTs are Flexible Vision Backbones
Zizheng Pan
Jing Liu
Haoyu He
Jianfei Cai
Bohan Zhuang
11
2
0
30 Jun 2023
Inter-Instance Similarity Modeling for Contrastive Learning
Chen Shen
Dawei Liu
H. Tang
Zhe Qu
Jianxin Wang
SSL
14
4
0
21 Jun 2023
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue
Weihao Zeng
Keqing He
Yejie Wang
Chen Zeng
Jingang Wang
Yunsen Xian
Weiran Xu
13
1
0
17 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
36
2
0
12 Jun 2023
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian
Chenxin Tao
Jifeng Dai
Hao Li
Ziheng Li
Lewei Lu
Xiaogang Wang
Hongsheng Li
Gao Huang
Xizhou Zhu
DiffM
20
9
0
08 Jun 2023
Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis
Gongshu Wang
Ning Jiang
Yunxiao Ma
Tiantian Liu
Duanduan Chen
Jinglong Wu
Guoqi Li
Dong Liang
Tianyi Yan
MedIm
20
2
0
08 Jun 2023
Asymmetric Patch Sampling for Contrastive Learning
Chen Shen
Jianzhong Chen
Shu Wang
Hulin Kuang
Jin Liu
Jianxin Wang
SSL
12
4
0
05 Jun 2023
rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological Measurement
Xin Liu
Yuting Zhang
Zitong Yu
Hao Lu
Huanjing Yue
Jingyu Yang
13
24
0
04 Jun 2023
Masked Autoencoder for Unsupervised Video Summarization
Minho Shim
Taeoh Kim
Jinhyung Kim
Dongyoon Wee
15
1
0
02 Jun 2023
Exploring Open-Vocabulary Semantic Segmentation without Human Labels
Jun Chen
Deyao Zhu
Guocheng Qian
Bernard Ghanem
Zhicheng Yan
Chenchen Zhu
Fanyi Xiao
Mohamed Elhoseiny
Sean Culatana
VLM
19
11
0
01 Jun 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
22
2
0
25 May 2023
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training
Utku Ozbulak
Hyun Jung Lee
Beril Boga
Esla Timothy Anzaku
Ho-min Park
Arnout Van Messem
W. D. Neve
J. Vankerschaver
DiffM
11
35
0
23 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPC
ViT
16
38
0
15 May 2023
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
79
6
0
05 May 2023
Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Heng Pan
Chenyang Liu
Wenxiao Wang
Liejie Yuan
Hongfa Wang
Zhifeng Li
W. Liu
VLM
10
3
0
25 Apr 2023
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
31
270
0
24 Apr 2023
CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding
Dilxat Muhtar
Xue-liang Zhang
P. Xiao
Zhenshi Li
Feng-Xue Gu
SSL
14
47
0
19 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
26
47
0
06 Apr 2023
DIME-FM: DIstilling Multimodal and Efficient Foundation Models
Ximeng Sun
Pengchuan Zhang
Peizhao Zhang
Hardik Shah
Kate Saenko
Xide Xia
VLM
10
19
0
31 Mar 2023
Mixed Autoencoder for Self-supervised Visual Representation Learning
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
SSL
27
42
0
30 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
25
155
0
28 Mar 2023
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
Jihao Liu
Tai Wang
Boxiao Liu
Qihang Zhang
Yu Liu
Hongsheng Li
25
16
0
20 Mar 2023
Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang
Fangneng Zhan
Christian Theobalt
Shijian Lu
DiffM
MQ
33
30
0
11 Mar 2023
Masked Image Modeling with Local Multi-Scale Reconstruction
Haoqing Wang
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhiwei Deng
Kai Han
56
45
0
09 Mar 2023
Centroid-centered Modeling for Efficient Vision Transformer Pre-training
Xin Yan
Zuchao Li
Lefei Zhang
Bo Du
Dacheng Tao
VLM
20
0
0
08 Mar 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
M. Sun
Weining Wang
Xinxin Zhu
Jing Liu
13
14
0
07 Mar 2023
Efficient Masked Autoencoders with Self-Consistency
Zhaowen Li
Yousong Zhu
Zhiyang Chen
Wei Li
Chaoyang Zhao
Rui Zhao
Ming Tang
Jinqiao Wang
40
2
0
28 Feb 2023
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Ziyu Jiang
Yinpeng Chen
Mengchen Liu
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Zhangyang Wang
SSL
VLM
CLIP
22
16
0
27 Feb 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Xiao Wang
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CE
VLM
24
195
0
20 Feb 2023
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis
Yankai Jiang
Ming Sun
Heng Guo
Xiaoyu Bai
K. Yan
Le Lu
Minfeng Xu
MedIm
19
19
0
11 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAML
VLM
21
4
0
02 Feb 2023
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Jie Zhu
Jiyang Qi
Mingyu Ding
Xiaokang Chen
Ping Luo
Xinggang Wang
Wenyu Liu
Leye Wang
Jingdong Wang
SSL
17
7
0
27 Jan 2023
Regeneration Learning: A Learning Paradigm for Data Generation
Xu Tan
Tao Qin
Jiang Bian
Tie-Yan Liu
Yoshua Bengio
GAN
29
15
0
21 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
24
11
0
17 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
24
117
0
13 Jan 2023
Learning Trajectory-Word Alignments for Video-Language Tasks
Xu Yang
Zhang Li
Haiyang Xu
Hanwang Zhang
Qinghao Ye
Chenliang Li
Ming Yan
Yu Zhang
Fei Huang
Songfang Huang
20
7
0
05 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Sucheng Ren
Fangyun Wei
Zheng-Wei Zhang
Han Hu
16
34
0
03 Jan 2023
Disjoint Masking with Joint Distillation for Efficient Masked Image Modeling
Xin Ma
Chang-Shu Liu
Chunyu Xie
Long Ye
Yafeng Deng
Xiang Ji
18
8
0
31 Dec 2022
Improving Visual Representation Learning through Perceptual Understanding
Samyakh Tukra
Frederick Hoffman
Ken Chatfield
12
5
0
30 Dec 2022
Previous
1
2
3
4
Next