ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.12177
  4. Cited By
Evolving Losses for Unsupervised Video Representation Learning

Evolving Losses for Unsupervised Video Representation Learning

Computer Vision and Pattern Recognition (CVPR), 2020
26 February 2020
A. Piergiovanni
A. Angelova
Michael S. Ryoo
    SSL
ArXiv (abs)PDFHTML

Papers citing "Evolving Losses for Unsupervised Video Representation Learning"

50 / 95 papers shown
Temporally Heterogeneous Graph Contrastive Learning for Multimodal Acoustic event Classification
Temporally Heterogeneous Graph Contrastive Learning for Multimodal Acoustic event Classification
Yuanjian Chen
Yang Xiao
Jinjie Huang
141
0
0
18 Sep 2025
Aligning Moments in Time using Video Queries
Aligning Moments in Time using Video Queries
Yogesh Kumar
Uday Agarwal
Manish Gupta
Anand Mishra
363
1
0
21 Aug 2025
TrajSV: A Trajectory-based Model for Sports Video Representations and Applications
TrajSV: A Trajectory-based Model for Sports Video Representations and Applications
Zheng Wang
Shihao Xu
Wei Shi
201
0
0
15 Aug 2025
Improving population size adapting CMA-ES algorithm on step-size blow-up in weakly-structured multimodal functions
Improving population size adapting CMA-ES algorithm on step-size blow-up in weakly-structured multimodal functions
Chandula Fernando
Kushani De Silva
172
0
0
01 Jun 2025
Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey
Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey
Adriano Vinhas
João Correia
Penousal Machado
SSLSyDa
547
0
0
09 Apr 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViTSSL
429
2
0
08 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningComputer Vision and Pattern Recognition (CVPR), 2025
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
472
6
0
01 Apr 2025
Towards evolution of Deep Neural Networks through contrastive
  Self-Supervised learning
Towards evolution of Deep Neural Networks through contrastive Self-Supervised learning
Adriano Vinhas
João Correia
Penousal Machado
SSL
199
0
0
20 Jun 2024
Labeling Comic Mischief Content in Online Videos with a Multimodal
  Hierarchical-Cross-Attention Model
Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model
Elaheh Baharlouei
Mahsa Shafaei
Yigeng Zhang
Hugo Jair Escalante
Thamar Solorio
258
2
0
12 Jun 2024
Learning text-to-video retrieval from image captioning
Learning text-to-video retrieval from image captioning
Lucas Ventura
Cordelia Schmid
Gül Varol
3DV
421
11
0
26 Apr 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action RecognitionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
516
3
0
15 Jan 2024
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked
  Autoencoders
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked AutoencodersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
391
25
0
31 Oct 2023
Video Timeline Modeling For News Story Understanding
Video Timeline Modeling For News Story UnderstandingNeural Information Processing Systems (NeurIPS), 2023
Meng Liu
Ruotong Wang
Jialu Liu
H. Dai
Mingming Yang
Shilin Xu
Zheyun Feng
Boqing Gong
243
5
0
23 Sep 2023
TMac: Temporal Multi-Modal Graph Learning for Acoustic Event
  Classification
TMac: Temporal Multi-Modal Graph Learning for Acoustic Event ClassificationACM Multimedia (ACM MM), 2023
Meng Liu
K. Liang
Dayu Hu
Hao Yu
Yue Liu
Lingyuan Meng
Wenxuan Tu
Sihang Zhou
Xinwang Liu
345
41
0
21 Sep 2023
AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual
  Masked Autoencoder
AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked AutoencoderIEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023
Xingjian Diao
Ming Cheng
Shitong Cheng
VGen
345
12
0
15 Sep 2023
Language-based Action Concept Spaces Improve Video Self-Supervised
  Learning
Language-based Action Concept Spaces Improve Video Self-Supervised LearningNeural Information Processing Systems (NeurIPS), 2023
Kanchana Ranasinghe
Michael S. Ryoo
SSLVLM
498
16
0
20 Jul 2023
Focalized Contrastive View-invariant Learning for Self-supervised
  Skeleton-based Action Recognition
Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action RecognitionNeurocomputing (Neurocomputing), 2023
Qianhui Men
Edmond S. L. Ho
Hubert P. H. Shum
Howard Leung
SSL
295
24
0
03 Apr 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
511
623
0
29 Mar 2023
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Language-Guided Audio-Visual Source Separation via Trimodal ConsistencyComputer Vision and Pattern Recognition (CVPR), 2023
Reuben Tan
Arijit Ray
Andrea Burns
Bryan A. Plummer
Justin Salamon
Oriol Nieto
Bryan C. Russell
Kate Saenko
280
31
0
28 Mar 2023
Structured Video-Language Modeling with Temporal Grouping and Spatial
  Grounding
Structured Video-Language Modeling with Temporal Grouping and Spatial GroundingInternational Conference on Learning Representations (ICLR), 2023
Yuanhao Xiong
Long Zhao
Boqing Gong
Ming-Hsuan Yang
Florian Schroff
Ting Liu
Cho-Jui Hsieh
Liangzhe Yuan
VLM
352
0
0
28 Mar 2023
Self-Supervised Representation Learning from Temporal Ordering of
  Automated Driving Sequences
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving SequencesIEEE Robotics and Automation Letters (RA-L), 2023
Christopher Lang
Alexander Braun
Lars Schillingmann
Karsten Haug
Abhinav Valada
SSL
376
14
0
17 Feb 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and
  Future Trends
A Survey on Self-supervised Learning: Algorithms, Applications, and Future TrendsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
654
460
0
13 Jan 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive
  Self-Supervised Learning
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised LearningMachine Vision and Applications (MVA), 2022
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
302
6
0
21 Dec 2022
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video
  Representation Learning
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Pritam Sarkar
Ali Etemad
480
44
0
25 Nov 2022
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant
  Spatiotemporal Tokens
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensInternational Conference on Machine Learning (ICML), 2022
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
Sung Ju Hwang
463
15
0
19 Nov 2022
Learning State-Aware Visual Representations from Audible Interactions
Learning State-Aware Visual Representations from Audible InteractionsNeural Information Processing Systems (NeurIPS), 2022
Himangi Mittal
Pedro Morgado
Unnat Jain
Abhinav Gupta
321
29
0
27 Sep 2022
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain
  Generalization
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization
Zdravko Marinov
Alina Roitberg
David Schneider
Rainer Stiefelhagen
353
6
0
19 Aug 2022
Static and Dynamic Concepts for Self-supervised Video Representation
  Learning
Static and Dynamic Concepts for Self-supervised Video Representation LearningEuropean Conference on Computer Vision (ECCV), 2022
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
SSL
279
28
0
26 Jul 2022
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
Sumanth Gurram
An Fang
David M. Chan
John F. Canny
VLMAI4TS
334
2
0
16 Jul 2022
Federated Self-supervised Learning for Video Understanding
Federated Self-supervised Learning for Video UnderstandingEuropean Conference on Computer Vision (ECCV), 2022
Yasar Abbas Ur Rehman
Yan Gao
Jiajun Shen
Pedro Porto Buarque de Gusmão
Nicholas D. Lane
FedML
288
22
0
05 Jul 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
367
1
0
16 Jun 2022
Beyond Just Vision: A Review on Self-Supervised Representation Learning
  on Multimodal and Temporal Data
Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data
Shohreh Deldari
Hao Xue
Aaqib Saeed
Jiayuan He
Daniel V. Smith
Flora D. Salim
AI4TS
294
45
0
06 Jun 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches
Multimodal Conversational AI: A Survey of Datasets and Approaches
Anirudh S. Sundar
Larry Heck
186
35
0
13 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications,
  Challenges, and Opportunities
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and OpportunitiesACM Computing Surveys (ACM CSUR), 2022
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
476
641
0
13 May 2022
On Negative Sampling for Audio-Visual Contrastive Learning from Movies
On Negative Sampling for Audio-Visual Contrastive Learning from Movies
Mahdi M. Kalayeh
Shervin Ardeshir
Lingyi Liu
Nagendra Kamath
Ashok Chandrashekar
SSL
213
3
0
29 Apr 2022
MILES: Visual BERT Pre-training with Injected Language Semantics for
  Video-text Retrieval
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text RetrievalEuropean Conference on Computer Vision (ECCV), 2022
Yuying Ge
Yixiao Ge
Xihui Liu
Alex Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
VLM
190
49
0
26 Apr 2022
A Survey of Video-based Action Quality Assessment
A Survey of Video-based Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Qing Yu
Tao Suo
Zhan Sun
Ka Li
Lihua Zhang
166
22
0
20 Apr 2022
Robust Cross-Modal Representation Learning with Progressive
  Self-Distillation
Robust Cross-Modal Representation Learning with Progressive Self-DistillationComputer Vision and Pattern Recognition (CVPR), 2022
A. Andonian
Shixing Chen
Raffay Hamid
VLM
320
72
0
10 Apr 2022
Controllable Augmentations for Video Representation Learning
Controllable Augmentations for Video Representation Learning
Rui Qian
Weiyao Lin
John See
Dian Li
SSLAI4TS
350
17
0
30 Mar 2022
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?European Conference on Computer Vision (ECCV), 2022
Fida Mohammad Thoker
Hazel Doughty
Piyush Bagad
Cees G. M. Snoek
SSL
266
22
0
27 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for
  Self-Supervised Video Pre-Training
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-TrainingNeural Information Processing Systems (NeurIPS), 2022
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
873
1,844
0
23 Mar 2022
Auxiliary Cross-Modal Representation Learning with Triplet Loss
  Functions for Online Handwriting Recognition
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting RecognitionIEEE Access (IEEE Access), 2022
Felix Ott
David Rügamer
Lucas Heublein
B. Bischl
Christopher Mutschler
503
13
0
16 Feb 2022
Bridging Video-text Retrieval with Multiple Choice Questions
Bridging Video-text Retrieval with Multiple Choice QuestionsComputer Vision and Pattern Recognition (CVPR), 2022
Yuying Ge
Yixiao Ge
Xihui Liu
Dian Li
Ying Shan
Xiaohu Qie
Ping Luo
BDL
394
126
0
13 Jan 2022
Exploring Temporal Granularity in Self-Supervised Video Representation
  Learning
Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Rui Qian
Yeqing Li
Liangzhe Yuan
Boqing Gong
Ting Liu
Matthew A. Brown
Serge Belongie
Ming-Hsuan Yang
Hartwig Adam
Huayu Chen
AI4TS
237
7
0
08 Dec 2021
Cross-modal Manifold Cutmix for Self-supervised Video Representation
  Learning
Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Srijan Das
Michael S. Ryoo
SSL
334
1
0
07 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen
  Viewpoints
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
291
34
0
07 Dec 2021
TCGL: Temporal Contrastive Graph for Self-supervised Video
  Representation Learning
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Yang Liu
Keze Wang
Lingbo Liu
Hao Lan
Liang Lin
SSLAI4TS
364
149
0
07 Dec 2021
Self-supervised Video Transformer
Self-supervised Video Transformer
Kanchana Ranasinghe
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Michael S. Ryoo
ViT
370
114
0
02 Dec 2021
Self-Supervised Audio-Visual Representation Learning with Relaxed
  Cross-Modal Synchronicity
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal SynchronicityAAAI Conference on Artificial Intelligence (AAAI), 2021
Pritam Sarkar
Ali Etemad
SSL
392
16
0
09 Nov 2021
Constrained Mean Shift for Representation Learning
Constrained Mean Shift for Representation Learning
Ajinkya Tejankar
Soroush Abbasi Koohpayegani
Hamed Pirsiavash
SSL
211
0
0
19 Oct 2021
12
Next
Page 1 of 2