Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.10636
Cited By
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
19 November 2022
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
S. Hwang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens"
5 / 5 papers shown
Title
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
255
7,337
0
11 Nov 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
212
682
0
13 Oct 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
275
1,939
0
09 Feb 2021
Self-supervised Co-training for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
196
371
0
19 Oct 2020
AdaFrame: Adaptive Frame Selection for Fast Video Recognition
Zuxuan Wu
Caiming Xiong
Chih-Yao Ma
R. Socher
L. Davis
110
194
0
29 Nov 2018
1