Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.18577
Cited By
Motion Guided Token Compression for Efficient Masked Video Modeling
10 January 2024
Yukun Feng
Yangming Shi
Fengze Liu
Tan Yan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Motion Guided Token Compression for Efficient Masked Video Modeling"
5 / 5 papers shown
Title
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
52
35
0
19 Oct 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
141
631
0
26 May 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,939
0
09 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
193
419
0
01 Feb 2021
1