Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.12177
Cited By
Evolving Losses for Unsupervised Video Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2020
26 February 2020
A. Piergiovanni
A. Angelova
Michael S. Ryoo
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Evolving Losses for Unsupervised Video Representation Learning"
50 / 95 papers shown
Temporally Heterogeneous Graph Contrastive Learning for Multimodal Acoustic event Classification
Yuanjian Chen
Yang Xiao
Jinjie Huang
141
0
0
18 Sep 2025
Aligning Moments in Time using Video Queries
Yogesh Kumar
Uday Agarwal
Manish Gupta
Anand Mishra
363
1
0
21 Aug 2025
TrajSV: A Trajectory-based Model for Sports Video Representations and Applications
Zheng Wang
Shihao Xu
Wei Shi
201
0
0
15 Aug 2025
Improving population size adapting CMA-ES algorithm on step-size blow-up in weakly-structured multimodal functions
Chandula Fernando
Kushani De Silva
172
0
0
01 Jun 2025
Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey
Adriano Vinhas
João Correia
Penousal Machado
SSL
SyDa
547
0
0
09 Apr 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViT
SSL
429
2
0
08 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
472
6
0
01 Apr 2025
Towards evolution of Deep Neural Networks through contrastive Self-Supervised learning
Adriano Vinhas
João Correia
Penousal Machado
SSL
199
0
0
20 Jun 2024
Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model
Elaheh Baharlouei
Mahsa Shafaei
Yigeng Zhang
Hugo Jair Escalante
Thamar Solorio
258
2
0
12 Jun 2024
Learning text-to-video retrieval from image captioning
Lucas Ventura
Cordelia Schmid
Gül Varol
3DV
421
11
0
26 Apr 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
516
3
0
15 Jan 2024
Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Srijan Das
Tanmay Jain
Dominick Reilly
P. Balaji
Soumyajit Karmakar
Shyam Marjit
Xiang Li
Abhijit Das
Michael S. Ryoo
391
25
0
31 Oct 2023
Video Timeline Modeling For News Story Understanding
Neural Information Processing Systems (NeurIPS), 2023
Meng Liu
Ruotong Wang
Jialu Liu
H. Dai
Mingming Yang
Shilin Xu
Zheyun Feng
Boqing Gong
243
5
0
23 Sep 2023
TMac: Temporal Multi-Modal Graph Learning for Acoustic Event Classification
ACM Multimedia (ACM MM), 2023
Meng Liu
K. Liang
Dayu Hu
Hao Yu
Yue Liu
Lingyuan Meng
Wenxuan Tu
Sihang Zhou
Xinwang Liu
345
41
0
21 Sep 2023
AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder
IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2023
Xingjian Diao
Ming Cheng
Shitong Cheng
VGen
345
12
0
15 Sep 2023
Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Neural Information Processing Systems (NeurIPS), 2023
Kanchana Ranasinghe
Michael S. Ryoo
SSL
VLM
498
16
0
20 Jul 2023
Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition
Neurocomputing (Neurocomputing), 2023
Qianhui Men
Edmond S. L. Ho
Hubert P. H. Shum
Howard Leung
SSL
295
24
0
03 Apr 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Computer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
511
623
0
29 Mar 2023
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Computer Vision and Pattern Recognition (CVPR), 2023
Reuben Tan
Arijit Ray
Andrea Burns
Bryan A. Plummer
Justin Salamon
Oriol Nieto
Bryan C. Russell
Kate Saenko
280
31
0
28 Mar 2023
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
International Conference on Learning Representations (ICLR), 2023
Yuanhao Xiong
Long Zhao
Boqing Gong
Ming-Hsuan Yang
Florian Schroff
Ting Liu
Cho-Jui Hsieh
Liangzhe Yuan
VLM
352
0
0
28 Mar 2023
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences
IEEE Robotics and Automation Letters (RA-L), 2023
Christopher Lang
Alexander Braun
Lars Schillingmann
Karsten Haug
Abhinav Valada
SSL
376
14
0
17 Feb 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
654
460
0
13 Jan 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning
Machine Vision and Applications (MVA), 2022
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
302
6
0
21 Dec 2022
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Pritam Sarkar
Ali Etemad
480
44
0
25 Nov 2022
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
International Conference on Machine Learning (ICML), 2022
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
Sung Ju Hwang
463
15
0
19 Nov 2022
Learning State-Aware Visual Representations from Audible Interactions
Neural Information Processing Systems (NeurIPS), 2022
Himangi Mittal
Pedro Morgado
Unnat Jain
Abhinav Gupta
321
29
0
27 Sep 2022
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization
Zdravko Marinov
Alina Roitberg
David Schneider
Rainer Stiefelhagen
353
6
0
19 Aug 2022
Static and Dynamic Concepts for Self-supervised Video Representation Learning
European Conference on Computer Vision (ECCV), 2022
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
SSL
279
28
0
26 Jul 2022
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
Sumanth Gurram
An Fang
David M. Chan
John F. Canny
VLM
AI4TS
334
2
0
16 Jul 2022
Federated Self-supervised Learning for Video Understanding
European Conference on Computer Vision (ECCV), 2022
Yasar Abbas Ur Rehman
Yan Gao
Jiajun Shen
Pedro Porto Buarque de Gusmão
Nicholas D. Lane
FedML
288
22
0
05 Jul 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
367
1
0
16 Jun 2022
Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data
Shohreh Deldari
Hao Xue
Aaqib Saeed
Jiayuan He
Daniel V. Smith
Flora D. Salim
AI4TS
294
45
0
06 Jun 2022
Multimodal Conversational AI: A Survey of Datasets and Approaches
Anirudh S. Sundar
Larry Heck
186
35
0
13 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
ACM Computing Surveys (ACM CSUR), 2022
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
476
641
0
13 May 2022
On Negative Sampling for Audio-Visual Contrastive Learning from Movies
Mahdi M. Kalayeh
Shervin Ardeshir
Lingyi Liu
Nagendra Kamath
Ashok Chandrashekar
SSL
213
3
0
29 Apr 2022
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
European Conference on Computer Vision (ECCV), 2022
Yuying Ge
Yixiao Ge
Xihui Liu
Alex Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
VLM
190
49
0
26 Apr 2022
A Survey of Video-based Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Qing Yu
Tao Suo
Zhan Sun
Ka Li
Lihua Zhang
166
22
0
20 Apr 2022
Robust Cross-Modal Representation Learning with Progressive Self-Distillation
Computer Vision and Pattern Recognition (CVPR), 2022
A. Andonian
Shixing Chen
Raffay Hamid
VLM
320
72
0
10 Apr 2022
Controllable Augmentations for Video Representation Learning
Rui Qian
Weiyao Lin
John See
Dian Li
SSL
AI4TS
350
17
0
30 Mar 2022
How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?
European Conference on Computer Vision (ECCV), 2022
Fida Mohammad Thoker
Hazel Doughty
Piyush Bagad
Cees G. M. Snoek
SSL
266
22
0
27 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Neural Information Processing Systems (NeurIPS), 2022
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
873
1,844
0
23 Mar 2022
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition
IEEE Access (IEEE Access), 2022
Felix Ott
David Rügamer
Lucas Heublein
B. Bischl
Christopher Mutschler
503
13
0
16 Feb 2022
Bridging Video-text Retrieval with Multiple Choice Questions
Computer Vision and Pattern Recognition (CVPR), 2022
Yuying Ge
Yixiao Ge
Xihui Liu
Dian Li
Ying Shan
Xiaohu Qie
Ping Luo
BDL
394
126
0
13 Jan 2022
Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Rui Qian
Yeqing Li
Liangzhe Yuan
Boqing Gong
Ting Liu
Matthew A. Brown
Serge Belongie
Ming-Hsuan Yang
Hartwig Adam
Huayu Chen
AI4TS
237
7
0
08 Dec 2021
Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Srijan Das
Michael S. Ryoo
SSL
334
1
0
07 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
291
34
0
07 Dec 2021
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Yang Liu
Keze Wang
Lingbo Liu
Hao Lan
Liang Lin
SSL
AI4TS
364
149
0
07 Dec 2021
Self-supervised Video Transformer
Kanchana Ranasinghe
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Michael S. Ryoo
ViT
370
114
0
02 Dec 2021
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
AAAI Conference on Artificial Intelligence (AAAI), 2021
Pritam Sarkar
Ali Etemad
SSL
392
16
0
09 Nov 2021
Constrained Mean Shift for Representation Learning
Ajinkya Tejankar
Soroush Abbasi Koohpayegani
Hamed Pirsiavash
SSL
211
0
0
19 Oct 2021
1
2
Next
Page 1 of 2