Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.06646
Cited By
v1
v2
v3
v4 (latest)
Self-Supervised Video Representation Learning With Odd-One-Out Networks
21 November 2016
Basura Fernando
Hakan Bilen
E. Gavves
Stephen Gould
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-Supervised Video Representation Learning With Odd-One-Out Networks"
50 / 277 papers shown
Title
Smooth regularization for efficient video recognition
Gil Goldman
Raja Giryes
Mahadev Satyanarayanan
AI4TS
167
0
0
25 Nov 2025
REALIGN: Regularized Procedure Alignment with Matching Video Embeddings via Partial Gromov-Wasserstein Optimal Transport
Soumyadeep Chandra
Kaushik Roy
85
0
0
29 Sep 2025
Multimodal Learning for Fake News Detection in Short Videos Using Linguistically Verified Data and Heterogeneous Modality Fusion
Shanghong Li
Chiam Wen Qi Ruth
Hong Xu
Fang Liu
88
0
0
19 Sep 2025
SpecBPP: A Self-Supervised Learning Approach for Hyperspectral Representation and Soil Organic Carbon Estimation
Daniel Laáh Ayuba
Jean-Yves Guillemaut
Belen Marti-Cardona
Oscar Mendez Maldonado
189
1
0
26 Jul 2025
Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport
Syed Ahmed Mahmood
Ali Shah Ali
Umer Ahmed
Fawad Javed Fateh
M. Zia
Quoc-Huy Tran
155
2
0
21 Jul 2025
Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection
Ayush K. Rai
Kyle Min
Tarun Krishna
Feiyan Hu
Alan F. Smeaton
Noel E. O'Connor
VGen
314
0
0
13 May 2025
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
877
3
0
08 Apr 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViT
SSL
283
0
0
08 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
310
3
0
01 Apr 2025
Joint Self-Supervised Video Alignment and Action Segmentation
Ali Shah Ali
Syed Ahmed Mahmood
Mubin Saeed
Andrey Konin
M. Zia
Quoc-Huy Tran
OT
370
3
0
21 Mar 2025
Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary Survey
ACM Computing Surveys (ACM CSUR), 2024
Zixuan Shanggua
Yanjie Dong
Song Guo
Victor C. M. Leung
M. Jamal Deen
Yan Wang
473
8
0
23 Dec 2024
Data Collection-free Masked Video Modeling
European Conference on Computer Vision (ECCV), 2024
Yuchi Ishikawa
Masayoshi Kondo
Yoshimitsu Aoki
ViT
166
1
0
10 Sep 2024
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets
European Conference on Computer Vision (ECCV), 2024
Ishan Rajendrakumar Dave
Fabian Caba Heilbron
Mubarak Shah
Simon Jenni
189
3
0
02 Sep 2024
How Effective are Self-Supervised Models for Contact Identification in Videos
Omri Herscovici
Limalka Sadith
Liel David
Daniel Harari
Muhammad Haris Khan
305
1
0
01 Aug 2024
SIGMA:Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi
Michael Dorkenwald
Fida Mohammad Thoker
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
233
14
0
22 Jul 2024
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Changwen Zheng
Wenwen Qiang
Jianqi Zhang
Changwen Zheng
Jingyao Wang
SSL
248
0
0
19 Jul 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
281
29
0
22 May 2024
JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos
Pietro Nardelli
Danilo Comminiello
257
1
0
05 May 2024
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering
Charig Yang
Weidi Xie
Andrew Zisserman
222
7
0
25 Apr 2024
Learning Group Activity Features Through Person Attribute Prediction
Chihiro Nakatani
Hiroaki Kawashima
Norimichi Ukita
199
3
0
05 Mar 2024
MV2MAE: Multi-View Video Masked Autoencoders
Ketul Shah
Robert Crandall
Jie Xu
Peng Zhou
Marian George
Mayank Bansal
Rama Chellappa
227
6
0
29 Jan 2024
Learning to Visually Connect Actions and their Effects
Eric Peh
Paritosh Parmar
Basura Fernando
360
2
0
19 Jan 2024
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
187
6
0
21 Dec 2023
United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Siddhant Bansal
Chetan Arora
C. V. Jawahar
277
11
0
06 Nov 2023
Concatenated Masked Autoencoders as Spatial-Temporal Learner
Zhouqiang Jiang
Bowen Wang
Tong Xiang
Zhaofeng Niu
Hong Tang
Guangshun Li
Liangzhi Li
147
4
0
02 Nov 2023
Video Timeline Modeling For News Story Understanding
Neural Information Processing Systems (NeurIPS), 2023
Meng Liu
Ruotong Wang
Jialu Liu
H. Dai
Mingming Yang
Shilin Xu
Zheyun Feng
Boqing Gong
172
5
0
23 Sep 2023
Representation Learning Dynamics of Self-Supervised Models
Pascal Esser
Satyaki Mukherjee
Debarghya Ghoshdastidar
SSL
223
3
0
05 Sep 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
239
4
0
23 Aug 2023
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations
IEEE International Conference on Computer Vision (ICCV), 2023
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
VOS
192
27
0
22 Aug 2023
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems
Momojit Biswas
Himanshu Buckchash
Dilip K. Prasad
SSL
257
10
0
14 Aug 2023
Set Learning for Accurate and Calibrated Models
International Conference on Learning Representations (ICLR), 2023
Lukas Muttenthaler
Robert A. Vandermeulen
Qiuyi Zhang
Thomas Unterthiner
Klaus-Robert Muller
274
4
0
05 Jul 2023
A Large-Scale Analysis on Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
245
3
0
09 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Neural Information Processing Systems (NeurIPS), 2023
Zihui Xue
Kristen Grauman
EgoV
234
47
0
08 Jun 2023
HomE: Homography-Equivariant Video Representation Learning
Anirudh Sriram
Adrien Gaidon
Jiajun Wu
Juan Carlos Niebles
L. Fei-Fei
Ehsan Adeli
SSL
AI4TS
135
2
0
02 Jun 2023
Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion
European Conference on Computer Vision (ECCV), 2023
Quoc-Huy Tran
Muhammad Ahmed
Murad Popattia
M. Hassan
Ahmed Andrey
Konin M. Zeeshan
AI4TS
558
5
0
31 May 2023
Siamese Masked Autoencoders
Neural Information Processing Systems (NeurIPS), 2023
Agrim Gupta
Jiajun Wu
Gaowen Liu
Li Fei-Fei
140
80
0
23 May 2023
Self-Supervised Video Similarity Learning
Giorgos Kordopatis-Zilos
Giorgos Tolias
Christos Tzelepis
I. Kompatsiaris
Ioannis Patras
Symeon Papadopoulos
SSL
214
14
0
06 Apr 2023
Diffusion Models as Masked Autoencoders
IEEE International Conference on Computer Vision (ICCV), 2023
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
179
75
0
06 Apr 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
275
54
0
31 Mar 2023
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Jun Cen
Shiwei Zhang
Xiang Wang
Yixuan Pei
Zhiwu Qing
Yingya Zhang
Qifeng Chen
165
5
0
25 Mar 2023
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization
IEEE International Conference on Computer Vision (ICCV), 2023
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
ViT
276
12
0
20 Mar 2023
Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos
D. Fan
De-Yun Yang
Xinyu Li
Vimal Bhat
M. Rohith
SSL
160
1
0
13 Mar 2023
DPPMask: Masked Image Modeling with Determinantal Point Processes
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Junde Xu
Zikai Lin
Donghao Zhou
Yao-Cheng Yang
Xiangyun Liao
Bian Wu
Guangyong Chen
Pheng-Ann Heng
268
3
0
13 Mar 2023
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences
IEEE Robotics and Automation Letters (RA-L), 2023
Christopher Lang
Alexander Braun
Lars Schillingmann
Karsten Haug
Abhinav Valada
SSL
261
12
0
17 Feb 2023
Audio-Visual Contrastive Learning with Temporal Self-Supervision
AAAI Conference on Artificial Intelligence (AAAI), 2023
Simon Jenni
Alexander Black
John Collomosse
SSL
182
24
0
15 Feb 2023
Event-guided Multi-patch Network with Self-supervision for Non-uniform Motion Deblurring
International Journal of Computer Vision (IJCV), 2022
Hongguang Zhang
Limeng Zhang
Yuchao Dai
Hongdong Li
Piotr Koniusz
132
24
0
14 Feb 2023
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images
Machine Intelligence Research (MIR), 2023
Wei-Chien Wang
Euijoon Ahn
Da-wei Feng
Jinman Kim
MedIm
556
36
0
10 Feb 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
504
329
0
13 Jan 2023
Learning to Summarize Videos by Contrasting Clips
Ivan Sosnovik
A. Moskalev
Cees Kaandorp
A. Smeulders
234
1
0
12 Jan 2023
Test of Time: Instilling Video-Language Models with a Sense of Time
Computer Vision and Pattern Recognition (CVPR), 2023
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
436
47
0
05 Jan 2023
1
2
3
4
5
6
Next