v1v2 (latest)

RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning

AAAI Conference on Artificial Intelligence (AAAI), 2020

27 October 2020

Chuang Gan

ArXiv (abs)PDF HTML Github (37★)

Papers citing "RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning"

50 / 81 papers shown

Advancing Video Self-Supervised Learning via Image Foundation ModelsPattern Recognition Letters (Pattern Recogn. Lett.), 2025

Jingwei Wu

Zhewei Huang

Chang Liu

234

25 May 2025

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

429

08 Apr 2025

A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning

971

08 Apr 2025

Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA

253

08 Apr 2025

SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningComputer Vision and Pattern Recognition (CVPR), 2025

472

01 Apr 2025

LocoMotion: Learning Motion-Focused Video-Language RepresentationsAsian Conference on Computer Vision (ACCV), 2024

Hazel Doughty

Fida Mohammad Thoker

Cees G. M. Snoek

422

15 Oct 2024

FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action RecognitionEuropean Conference on Computer Vision (ECCV), 2024

Ishan Rajendrakumar Dave

Mamshad Nayeem Rizve

Mubarak Shah

AI4TS

241

02 Sep 2024

Enhancing Sound Source Localization via False Negative EliminationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

Zhaoxiang Zhang

378

29 Aug 2024

How Effective are Self-Supervised Models for Contact Identification in Videos

384

01 Aug 2024

SIGMA:Sinkhorn-Guided Masked Video Modeling

321

22 Jul 2024

Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective

427

19 Jul 2024

Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language RecognitionIEEE Transactions on Image Processing (TIP), 2024

Wengang Zhou

320

15 Jun 2024

Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model

258

12 Jun 2024

The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol

Konstanty Subbotko

Wojciech Jablonski

Piotr Bilinski

351

26 May 2024

BIMM: Brain Inspired Masked Modeling for Video Representation Learning

283

21 May 2024

Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting

392

18 Mar 2024

Collaboratively Self-supervised Video Representation Learning for Action RecognitionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024

516

15 Jan 2024

Universal Time-Series Representation Learning: A Survey

447

08 Jan 2024

PECoP: Parameter Efficient Continual Pretraining for Action Quality AssessmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Amirhossein Dadashzadeh

Shuchao Duan

Alan Whone

Majid Mirmehdi

297

11 Nov 2023

FGPrompt: Fine-grained Goal Prompting for Image-goal NavigationNeural Information Processing Systems (NeurIPS), 2023

326

11 Oct 2023

Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation LearningACM Multimedia (ACM MM), 2023

243

01 Sep 2023

Unsupervised Representation Learning for Time Series: A Review

Yong Liu

280

03 Aug 2023

Language-based Action Concept Spaces Improve Video Self-Supervised LearningNeural Information Processing Systems (NeurIPS), 2023

Kanchana Ranasinghe

Michael S. Ryoo

SSL VLM

498

20 Jul 2023

Vesper: A Compact and Effective Pretrained Model for Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023

345

20 Jul 2023

A Large-Scale Analysis on Self-Supervised Video Representation Learning

361

09 Jun 2023

Masked Autoencoder for Unsupervised Video Summarization

224

02 Jun 2023

PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud VideosComputer Vision and Pattern Recognition (CVPR), 2023

288

06 May 2023

VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023

Yi Wang

Yu Qiao

515

623

29 Mar 2023

Structured Video-Language Modeling with Temporal Grouping and Spatial GroundingInternational Conference on Learning Representations (ICLR), 2023

Ming-Hsuan Yang

353

28 Mar 2023

TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2023

281

28 Mar 2023

Tubelet-Contrastive Self-Supervision for Video-Efficient GeneralizationIEEE International Conference on Computer Vision (ICCV), 2023

399

20 Mar 2023

Multi-Task Self-Supervised Time-Series Representation LearningInformation Sciences (Inf. Sci.), 2023

Heejeong Choi

Pilsung Kang

AI4TS SSL

322

02 Mar 2023

Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised LearningMachine Vision and Applications (MVA), 2022

302

21 Dec 2022

MoQuad: Motion-focused Quadruple Construction for Video Contrastive Learning

Yuan Liu

Jiacheng Chen

Hao Wu

264

21 Dec 2022

MHCCL: Masked Hierarchical Cluster-Wise Contrastive Learning for Multivariate Time SeriesAAAI Conference on Artificial Intelligence (AAAI), 2022

Yong Liu

485

02 Dec 2022

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensInternational Conference on Machine Learning (ICML), 2022

463

19 Nov 2022

Semantic Video Moments Retrieval at Scale: A New Task and a Baseline

Na Li

294

15 Oct 2022

Masked Motion Encoding for Self-Supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022

Chuang Gan

403

12 Oct 2022

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

175

09 Oct 2022

Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeComputer Vision and Pattern Recognition (CVPR), 2022

Ping Luo

272

30 Sep 2022

Temporal Contrastive Learning with CurriculumIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Shuvendu Roy

Ali Etemad

323

02 Sep 2022

Motion Sensitive Contrastive Learning for Self-supervised Video RepresentationEuropean Conference on Computer Vision (ECCV), 2022

251

12 Aug 2022

DAS: Densely-Anchored Sampling for Deep Metric LearningEuropean Conference on Computer Vision (ECCV), 2022

Yaowei Wang

325

30 Jul 2022

Static and Dynamic Concepts for Self-supervised Video Representation LearningEuropean Conference on Computer Vision (ECCV), 2022

279

26 Jul 2022

Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly DetectionEuropean Conference on Computer Vision (ECCV), 2022

Yibing Zhan

184

24 Jul 2022

MAR: Masked Autoencoders for Efficient Action RecognitionIEEE transactions on multimedia (IEEE TMM), 2022

321

24 Jul 2022

LocVTP: Video-Text Pre-training for Temporal LocalizationEuropean Conference on Computer Vision (ECCV), 2022

236

21 Jul 2022

Dual Contrastive Learning for Spatio-temporal RepresentationACM Multimedia (ACM MM), 2022

175

12 Jul 2022

Exploring Temporally Dynamic Data Augmentation for Video RecognitionInternational Conference on Learning Representations (ICLR), 2022

302

30 Jun 2022

SLIC: Self-Supervised Learning with Iterative Clustering for Human Action VideosComputer Vision and Pattern Recognition (CVPR), 2022

287

25 Jun 2022