Evolving Losses for Unsupervised Video Representation Learning

Computer Vision and Pattern Recognition (CVPR), 2020

26 February 2020

Papers citing "Evolving Losses for Unsupervised Video Representation Learning"

50 / 95 papers shown

Temporally Heterogeneous Graph Contrastive Learning for Multimodal Acoustic event Classification

Yuanjian Chen

Yang Xiao

Jinjie Huang

141

18 Sep 2025

Aligning Moments in Time using Video Queries

363

21 Aug 2025

TrajSV: A Trajectory-based Model for Sports Video Representations and Applications

Zheng Wang

Shihao Xu

Wei Shi

201

15 Aug 2025

Improving population size adapting CMA-ES algorithm on step-size blow-up in weakly-structured multimodal functions

Chandula Fernando

Kushani De Silva

172

01 Jun 2025

Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey

547

09 Apr 2025

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

429

08 Apr 2025

SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningComputer Vision and Pattern Recognition (CVPR), 2025

472

01 Apr 2025

Towards evolution of Deep Neural Networks through contrastive Self-Supervised learning

Adriano Vinhas

João Correia

Penousal Machado

SSL

199

20 Jun 2024

Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model

258

12 Jun 2024

Learning text-to-video retrieval from image captioning

421

26 Apr 2024

Collaboratively Self-supervised Video Representation Learning for Action RecognitionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024

516

15 Jan 2024

Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked AutoencodersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

391

31 Oct 2023

Video Timeline Modeling For News Story UnderstandingNeural Information Processing Systems (NeurIPS), 2023

243

23 Sep 2023

TMac: Temporal Multi-Modal Graph Learning for Acoustic Event ClassificationACM Multimedia (ACM MM), 2023

345

21 Sep 2023

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked AutoencoderIEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023

345

15 Sep 2023

Language-based Action Concept Spaces Improve Video Self-Supervised LearningNeural Information Processing Systems (NeurIPS), 2023

Kanchana Ranasinghe

Michael S. Ryoo

SSL VLM

498

20 Jul 2023

Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action RecognitionNeurocomputing (Neurocomputing), 2023

Hubert P. H. Shum

295

03 Apr 2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023

Yi Wang

Yu Qiao

511

623

29 Mar 2023

Language-Guided Audio-Visual Source Separation via Trimodal ConsistencyComputer Vision and Pattern Recognition (CVPR), 2023

280

28 Mar 2023

Structured Video-Language Modeling with Temporal Grouping and Spatial GroundingInternational Conference on Learning Representations (ICLR), 2023

Ming-Hsuan Yang

352

28 Mar 2023

Self-Supervised Representation Learning from Temporal Ordering of Automated Driving SequencesIEEE Robotics and Automation Letters (RA-L), 2023

Abhinav Valada

376

17 Feb 2023

A Survey on Self-supervised Learning: Algorithms, Applications, and Future TrendsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

654

460

13 Jan 2023

Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised LearningMachine Vision and Applications (MVA), 2022

302

21 Dec 2022

XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2022

Pritam Sarkar

Ali Etemad

480

25 Nov 2022

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensInternational Conference on Machine Learning (ICML), 2022

463

19 Nov 2022

Learning State-Aware Visual Representations from Audible InteractionsNeural Information Processing Systems (NeurIPS), 2022

321

27 Sep 2022

ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization

353

19 Aug 2022

Static and Dynamic Concepts for Self-supervised Video Representation LearningEuropean Conference on Computer Vision (ECCV), 2022

279

26 Jul 2022

LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training

334

16 Jul 2022

Federated Self-supervised Learning for Video UnderstandingEuropean Conference on Computer Vision (ECCV), 2022

Yasar Abbas Ur Rehman

Yan Gao

Jiajun Shen

Pedro Porto Buarque de Gusmão

Nicholas D. Lane

FedML

288

05 Jul 2022

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

F. Saleh

Fuwen Tan

Adrian Bulat

Georgios Tzimiropoulos

Brais Martínez

SSL

367

16 Jun 2022

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

294

06 Jun 2022

Multimodal Conversational AI: A Survey of Datasets and Approaches

Anirudh S. Sundar

Larry Heck

186

13 May 2022

A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and OpportunitiesACM Computing Surveys (ACM CSUR), 2022

476

641

13 May 2022

On Negative Sampling for Audio-Visual Contrastive Learning from Movies

213

29 Apr 2022

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text RetrievalEuropean Conference on Computer Vision (ECCV), 2022

Ying Shan

Ping Luo

190

26 Apr 2022

A Survey of Video-based Action Quality Assessment

Shunli Wang

Dingkang Yang

Peng Zhai

Lihua Zhang

166

20 Apr 2022

Robust Cross-Modal Representation Learning with Progressive Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2022

320

10 Apr 2022

Controllable Augmentations for Video Representation Learning

350

30 Mar 2022

How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?European Conference on Computer Vision (ECCV), 2022

266

27 Mar 2022

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-TrainingNeural Information Processing Systems (NeurIPS), 2022

873

1,844

23 Mar 2022

Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting RecognitionIEEE Access (IEEE Access), 2022

Christopher Mutschler

503

16 Feb 2022

Bridging Video-text Retrieval with Multiple Choice QuestionsComputer Vision and Pattern Recognition (CVPR), 2022

Ying Shan

Ping Luo

394

126

13 Jan 2022

Exploring Temporal Granularity in Self-Supervised Video Representation Learning

Ming-Hsuan Yang

237

08 Dec 2021

Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning

Srijan Das

Michael S. Ryoo

SSL

334

07 Dec 2021

ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints

Srijan Das

Michael S. Ryoo

SSL

291

07 Dec 2021

TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning

Yang Liu

364

149

07 Dec 2021

Self-supervised Video Transformer

Salman Khan

370

114

02 Dec 2021

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal SynchronicityAAAI Conference on Artificial Intelligence (AAAI), 2021

Pritam Sarkar

Ali Etemad

SSL

392

09 Nov 2021

Constrained Mean Shift for Representation Learning

Ajinkya Tejankar

Soroush Abbasi Koohpayegani

Hamed Pirsiavash

SSL

211

19 Oct 2021