Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.03597
Cited By
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
7 April 2019
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Yunhui Liu
W. Liu
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics"
44 / 44 papers shown
Title
SimMIL: A Universal Weakly Supervised Pre-Training Framework for Multi-Instance Learning in Whole Slide Pathology Images
Yicheng Song
Tiancheng Lin
Die Peng
Su Yang
Yi Xu
MedIm
31
0
0
10 May 2025
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Zeen Song
Jingyao Wang
Jianqi Zhang
Changwen Zheng
Wenwen Qiang
SSL
58
0
0
19 Jul 2024
Strategies for Pretraining Neural Operators
Anthony Y. Zhou
Cooper Lorsung
AmirPouya Hemmasian
Amir Barati Farimani
AI4CE
39
4
0
12 Jun 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie M. Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond
Junyu Chen
Yihao Liu
Shuwen Wei
Zhangxing Bian
Shalini Subramanian
A. Carass
Jerry L. Prince
Yong Du
OOD
39
36
0
28 Jul 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
25
38
0
31 Mar 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
14
6
0
21 Dec 2022
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
H. Yao
Yi-Xin Jiang
Xiatian Zhu
Zehuan Yuan
26
19
0
09 Oct 2022
Learning State-Aware Visual Representations from Audible Interactions
Himangi Mittal
Pedro Morgado
Unnat Jain
Abhinav Gupta
70
22
0
27 Sep 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
21
20
0
17 Jul 2022
Self-Supervised Learning for Videos: A Survey
Madeline Chantry Schiappa
Y. S. Rawat
M. Shah
SSL
34
131
0
18 Jun 2022
On Negative Sampling for Audio-Visual Contrastive Learning from Movies
Mahdi M. Kalayeh
Shervin Ardeshir
Lingyi Liu
Nagendra Kamath
Ashok Chandrashekar
SSL
22
3
0
29 Apr 2022
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training
Guanhong Wang
Ke Lu
Yang Zhou
Zhanhao He
Gaoang Wang
SSL
19
3
0
27 Apr 2022
Contrastive Language-Action Pre-training for Temporal Localization
Mengmeng Xu
Erhan Gundogdu
⋆⋆ Maksim
Bernard Ghanem
M. Donoser
Loris Bazzani
33
27
0
26 Apr 2022
Probabilistic Representations for Video Contrastive Learning
Jungin Park
Jiyoung Lee
Ig-Jae Kim
K. Sohn
SSL
26
43
0
08 Apr 2022
Self-supervised Video Representation Learning with Cascade Positive Retrieval
Cheng-En Wu
Farley Lai
Yujie Hu
Asim Kadav
SSL
AI4TS
25
3
0
20 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Motion-Focused Contrastive Learning of Video Representations
Rui Li
Yiheng Zhang
Zhaofan Qiu
Ting Yao
Dong Liu
Tao Mei
SSL
21
34
0
11 Jan 2022
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Yinghao Xu
Fangyun Wei
Xiao Sun
Ceyuan Yang
Yujun Shen
Bo Dai
Bolei Zhou
Stephen Lin
VLM
23
52
0
17 Dec 2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Rui Qian
Yeqing Li
Liangzhe Yuan
Boqing Gong
Ting Liu
Matthew A. Brown
Serge J. Belongie
Ming-Hsuan Yang
Hartwig Adam
Yin Cui
AI4TS
43
6
0
08 Dec 2021
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Yang Liu
Keze Wang
Lingbo Liu
Hao Lan
Liang Lin
SSL
AI4TS
48
113
0
07 Dec 2021
Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
Shuangrui Ding
Maomao Li
Tianyu Yang
Rui Qian
Haohang Xu
Qingyi Chen
Jue Wang
Hongkai Xiong
SSL
21
49
0
30 Sep 2021
Self-supervised Representation Learning Framework for Remote Physiological Measurement Using Spatiotemporal Augmentation Loss
Hao Wang
E. Ahn
Jinman Kim
26
46
0
16 Jul 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
27
11
0
18 Jun 2021
3D Human Action Representation Learning via Cross-View Consistency Pursuit
Linguo Li
Minsi Wang
Bingbing Ni
Hang Wang
Jiancheng Yang
Wenjun Zhang
129
156
0
29 Apr 2021
Actor-centered Representations for Action Localization in Streaming Videos
Sathyanarayanan N. Aakur
Sudeep Sarkar
24
3
0
29 Apr 2021
Object Priors for Classifying and Localizing Unseen Actions
Pascal Mettes
William Thong
Cees G. M. Snoek
19
20
0
10 Apr 2021
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
Mandela Patrick
Yuki M. Asano
Bernie Huang
Ishan Misra
Florian Metze
Joao Henriques
Andrea Vedaldi
AI4TS
16
33
0
18 Mar 2021
Learning the Predictability of the Future
Dídac Surís
Ruoshi Liu
Carl Vondrick
16
71
0
01 Jan 2021
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
35
184
0
11 Dec 2020
Can Temporal Information Help with Contrastive Self-Supervised Learning?
Yutong Bai
Haoqi Fan
Ishan Misra
Ganesh Venkatesh
Yongyi Lu
Yuyin Zhou
Qihang Yu
Vikas Chandra
Alan Yuille
16
40
0
25 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Bernard Ghanem
30
123
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TS
SSL
23
23
0
23 Nov 2020
Self-supervised Human Activity Recognition by Learning to Predict Cross-Dimensional Motion
Setareh Rahimi Taghanaki
M. Rainbow
Ali Etemad
SSL
HAI
13
15
0
21 Oct 2020
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion
Jinpeng Wang
Yuting Gao
Ke Li
Jianguo Hu
Xinyang Jiang
Xiao-Wei Guo
Rongrong Ji
Xing Sun
26
62
0
12 Sep 2020
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
Ting Yao
Yiheng Zhang
Zhaofan Qiu
Yingwei Pan
Tao Mei
DRL
13
108
0
03 Aug 2020
Learning Video Representations from Textual Web Supervision
Jonathan C. Stroud
Zhichao Lu
Chen Sun
Jia Deng
Rahul Sukthankar
Cordelia Schmid
David A. Ross
SSL
27
48
0
29 Jul 2020
Video Playback Rate Perception for Self-supervisedSpatio-Temporal Representation Learning
Yuan Yao
Chang-rui Liu
Dezhao Luo
Yu Zhou
QiXiang Ye
18
169
0
20 Jun 2020
S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
Yizhe Zhu
Martin Renqiang Min
Asim Kadav
H. Graf
CoGe
DRL
19
95
0
23 May 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
25
54
0
30 Mar 2020
Self-supervised ECG Representation Learning for Emotion Recognition
Pritam Sarkar
Ali Etemad
SSL
22
259
0
04 Feb 2020
Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Dezhao Luo
Chang-rui Liu
Yu Zhou
Dongbao Yang
Can Ma
QiXiang Ye
Weiping Wang
SSL
11
160
0
02 Jan 2020
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Bernard Ghanem
Du Tran
SSL
23
428
0
28 Nov 2019
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
67
79
0
03 Aug 2018
1