Self-Supervised Learning for Videos: A Survey

18 June 2022

Madeline Chantry Schiappa

Papers citing "Self-Supervised Learning for Videos: A Survey"

31 / 81 papers shown

Title
N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition Bashar Talafha Abdul Waheed Muhammad Abdul-Mageed 11 7 0 05 Jun 2023
Augmentation-aware Self-supervised Learning with Conditioned Projector Marcin Przewike'zlikowski Mateusz Pyla Bartosz Zieliñski Bartlomiej Twardowski Jacek Tabor Marek Śmieja SSL 14 2 0 31 May 2023
S-JEA: Stacked Joint Embedding Architectures for Self-Supervised Visual Representation Learning Alvzbveta Manová A. Durrant Georgios Leontidis SSL 11 4 0 19 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps Yanfang Li Huan Wang Muxia Sun LM&MA AI4TS AI4CE 14 44 0 10 May 2023
Self-supervised dense representation learning for live-cell microscopy with time arrow prediction Benjamin Gallusser Max Stieber Martin Weigert 8 3 0 09 May 2023
A Cookbook of Self-Supervised Learning Randall Balestriero Mark Ibrahim Vlad Sobal Ari S. Morcos Shashank Shekhar ... Pierre Fernandez Amir Bar Hamed Pirsiavash Yann LeCun Micah Goldblum SyDa FedML SSL 31 270 0 24 Apr 2023
To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review Ravid Shwartz-Ziv Yann LeCun SSL 6 71 0 19 Apr 2023
Procedure-Aware Pretraining for Instructional Video Understanding Honglu Zhou Roberto Martín-Martín Mubbasir Kapadia Silvio Savarese Juan Carlos Niebles 23 38 0 31 Mar 2023
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition I. Dave Mamshad Nayeem Rizve C. L. P. Chen M. Shah TTA 26 13 0 28 Mar 2023
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization Fida Mohammad Thoker Hazel Doughty Cees G. M. Snoek ViT 30 9 0 20 Mar 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends Jie Gui Tuo Chen Jing Zhang Qiong Cao Zhe Sun Haoran Luo Dacheng Tao 16 117 0 13 Jan 2023
Test of Time: Instilling Video-Language Models with a Sense of Time Piyush Bagad Makarand Tapaswi Cees G. M. Snoek 70 36 0 05 Jan 2023
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning Pritam Sarkar Ali Etemad 14 20 0 25 Nov 2022
Motion Aware Self-Supervision for Generic Event Boundary Detection Ayush Rai Tarun Krishna J. Dietlmeier Kevin McGuinness A. Smeaton Noel E. O'Connor 19 2 0 11 Oct 2022
Self-Supervised Face Presentation Attack Detection with Dynamic Grayscale Snippets Usman Muhammad Mourad Oussalah AAML CVBM 18 6 0 27 Aug 2022
SVGraph: Learning Semantic Graphs from Instructional Videos Madeline Chantry Schiappa Y. S. Rawat 11 4 0 16 Jul 2022
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations Madeline Chantry Schiappa Shruti Vyas Hamid Palangi Y. S. Rawat Vibhav Vineet VLM 109 17 0 05 Jul 2022
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning? Fida Mohammad Thoker Hazel Doughty Piyush Bagad Cees G. M. Snoek SSL 25 19 0 27 Mar 2022
Self-Training: A Survey Massih-Reza Amini Vasilii Feofanov Loïc Pauletto Lies Hadjadj Emilie Devijver Yury Maximov SSL 8 100 0 24 Feb 2022
Video Transformers: A Survey Javier Selva A. S. Johansen Sergio Escalera Kamal Nasrollahi T. Moeslund Albert Clapés ViT 20 101 0 16 Jan 2022
Masked Autoencoders Are Scalable Vision Learners Kaiming He Xinlei Chen Saining Xie Yanghao Li Piotr Dollár Ross B. Girshick ViT TPM 258 7,337 0 11 Nov 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Hu Xu Gargi Ghosh Po-Yao (Bernie) Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer Florian Metze Luke Zettlemoyer Christoph Feichtenhofer CLIP VLM 245 554 0 28 Sep 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text Hassan Akbari Liangzhe Yuan Rui Qian Wei-Hong Chuang Shih-Fu Chang Yin Cui Boqing Gong ViT 231 573 0 22 Apr 2021
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius Heng Wang Lorenzo Torresani ViT 278 1,939 0 09 Feb 2021
Self-supervised Co-training for Video Representation Learning Tengda Han Weidi Xie Andrew Zisserman SSL 198 304 0 19 Oct 2020
Video Representation Learning by Recognizing Temporal Transformations Simon Jenni Givi Meishvili Paolo Favaro 117 133 0 21 Jul 2020
Multi-modal Transformer for Video Retrieval Valentin Gabeur Chen Sun Alahari Karteek Cordelia Schmid ViT 401 594 0 21 Jul 2020
VoxCeleb2: Deep Speaker Recognition Joon Son Chung Arsha Nagrani Andrew Zisserman 214 2,224 0 14 Jun 2018
Boosting Self-Supervised Learning via Knowledge Transfer M. Noroozi Ananth Vinjimoor Paolo Favaro Hamed Pirsiavash SSL 207 291 0 01 May 2018
Lip Reading Sentences in the Wild Joon Son Chung A. Senior Oriol Vinyals Andrew Zisserman 162 782 0 16 Nov 2016
Efficient Estimation of Word Representations in Vector Space Tomáš Mikolov Kai Chen G. Corrado J. Dean 3DV 228 29,632 0 16 Jan 2013