Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.00419
Cited By
Self-Supervised Learning for Videos: A Survey
18 June 2022
Madeline Chantry Schiappa
Y. S. Rawat
M. Shah
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-Supervised Learning for Videos: A Survey"
31 / 81 papers shown
Title
N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition
Bashar Talafha
Abdul Waheed
Muhammad Abdul-Mageed
11
7
0
05 Jun 2023
Augmentation-aware Self-supervised Learning with Conditioned Projector
Marcin Przewike'zlikowski
Mateusz Pyla
Bartosz Zieliñski
Bartlomiej Twardowski
Jacek Tabor
Marek Śmieja
SSL
14
2
0
31 May 2023
S-JEA: Stacked Joint Embedding Architectures for Self-Supervised Visual Representation Learning
Alvzbveta Manová
A. Durrant
Georgios Leontidis
SSL
11
4
0
19 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
14
44
0
10 May 2023
Self-supervised dense representation learning for live-cell microscopy with time arrow prediction
Benjamin Gallusser
Max Stieber
Martin Weigert
8
3
0
09 May 2023
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
31
270
0
24 Apr 2023
To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review
Ravid Shwartz-Ziv
Yann LeCun
SSL
6
71
0
19 Apr 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
23
38
0
31 Mar 2023
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition
I. Dave
Mamshad Nayeem Rizve
C. L. P. Chen
M. Shah
TTA
26
13
0
28 Mar 2023
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
ViT
30
9
0
20 Mar 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
16
117
0
13 Jan 2023
Test of Time: Instilling Video-Language Models with a Sense of Time
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
70
36
0
05 Jan 2023
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning
Pritam Sarkar
Ali Etemad
14
20
0
25 Nov 2022
Motion Aware Self-Supervision for Generic Event Boundary Detection
Ayush Rai
Tarun Krishna
J. Dietlmeier
Kevin McGuinness
A. Smeaton
Noel E. O'Connor
19
2
0
11 Oct 2022
Self-Supervised Face Presentation Attack Detection with Dynamic Grayscale Snippets
Usman Muhammad
Mourad Oussalah
AAML
CVBM
18
6
0
27 Aug 2022
SVGraph: Learning Semantic Graphs from Instructional Videos
Madeline Chantry Schiappa
Y. S. Rawat
11
4
0
16 Jul 2022
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations
Madeline Chantry Schiappa
Shruti Vyas
Hamid Palangi
Y. S. Rawat
Vibhav Vineet
VLM
109
17
0
05 Jul 2022
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?
Fida Mohammad Thoker
Hazel Doughty
Piyush Bagad
Cees G. M. Snoek
SSL
25
19
0
27 Mar 2022
Self-Training: A Survey
Massih-Reza Amini
Vasilii Feofanov
Loïc Pauletto
Lies Hadjadj
Emilie Devijver
Yury Maximov
SSL
8
100
0
24 Feb 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
20
101
0
16 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
245
554
0
28 Sep 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
231
573
0
22 Apr 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,939
0
09 Feb 2021
Self-supervised Co-training for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
198
304
0
19 Oct 2020
Video Representation Learning by Recognizing Temporal Transformations
Simon Jenni
Givi Meishvili
Paolo Favaro
117
133
0
21 Jul 2020
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
401
594
0
21 Jul 2020
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
214
2,224
0
14 Jun 2018
Boosting Self-Supervised Learning via Knowledge Transfer
M. Noroozi
Ananth Vinjimoor
Paolo Favaro
Hamed Pirsiavash
SSL
207
291
0
01 May 2018
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
782
0
16 Nov 2016
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
Previous
1
2