ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.07949
  4. Cited By
RSPNet: Relative Speed Perception for Unsupervised Video Representation
  Learning
v1v2 (latest)

RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning

AAAI Conference on Artificial Intelligence (AAAI), 2020
27 October 2020
Peihao Chen
Deng Huang
Dongliang He
Xiang Long
Runhao Zeng
Shilei Wen
Zhuliang Yu
Chuang Gan
    SSL
ArXiv (abs)PDFHTMLGithub (37★)

Papers citing "RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning"

50 / 81 papers shown
Advancing Video Self-Supervised Learning via Image Foundation Models
Advancing Video Self-Supervised Learning via Image Foundation ModelsPattern Recognition Letters (Pattern Recogn. Lett.), 2025
Jingwei Wu
Zhewei Huang
Chang Liu
234
0
0
25 May 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViTSSL
429
2
0
08 Apr 2025
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
971
4
0
08 Apr 2025
Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA
Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA
Zijie Song
Zhenzhen Hu
Yixiao Ma
Jia Li
Richang Hong
253
2
0
08 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningComputer Vision and Pattern Recognition (CVPR), 2025
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
472
6
0
01 Apr 2025
LocoMotion: Learning Motion-Focused Video-Language Representations
LocoMotion: Learning Motion-Focused Video-Language RepresentationsAsian Conference on Computer Vision (ACCV), 2024
Hazel Doughty
Fida Mohammad Thoker
Cees G. M. Snoek
422
4
0
15 Oct 2024
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for
  Semi-Supervised Fine-Grained Action Recognition
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action RecognitionEuropean Conference on Computer Vision (ECCV), 2024
Ishan Rajendrakumar Dave
Mamshad Nayeem Rizve
Mubarak Shah
AI4TS
241
5
0
02 Sep 2024
Enhancing Sound Source Localization via False Negative Elimination
Enhancing Sound Source Localization via False Negative EliminationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zengjie Song
Jiangshe Zhang
Yuxi Wang
Junsong Fan
Zhaoxiang Zhang
378
4
0
29 Aug 2024
How Effective are Self-Supervised Models for Contact Identification in
  Videos
How Effective are Self-Supervised Models for Contact Identification in Videos
Omri Herscovici
Limalka Sadith
Liel David
Daniel Harari
Muhammad Haris Khan
384
1
0
01 Aug 2024
SIGMA:Sinkhorn-Guided Masked Video Modeling
SIGMA:Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi
Michael Dorkenwald
Fida Mohammad Thoker
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
321
20
0
22 Jul 2024
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Changwen Zheng
Wenwen Qiang
Jianqi Zhang
Changwen Zheng
Jingyao Wang
SSL
427
0
0
19 Jul 2024
Self-Supervised Representation Learning with Spatial-Temporal
  Consistency for Sign Language Recognition
Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language RecognitionIEEE Transactions on Image Processing (TIP), 2024
Weichao Zhao
Wengang Zhou
Hezhen Hu
Min Wang
Houqiang Li
SLR
320
16
0
15 Jun 2024
Labeling Comic Mischief Content in Online Videos with a Multimodal
  Hierarchical-Cross-Attention Model
Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model
Elaheh Baharlouei
Mahsa Shafaei
Yigeng Zhang
Hugo Jair Escalante
Thamar Solorio
258
2
0
12 Jun 2024
The devil is in discretization discrepancy. Robustifying Differentiable
  NAS with Single-Stage Searching Protocol
The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol
Konstanty Subbotko
Wojciech Jablonski
Piotr Bilinski
351
0
0
26 May 2024
BIMM: Brain Inspired Masked Modeling for Video Representation Learning
BIMM: Brain Inspired Masked Modeling for Video Representation Learning
Zhifan Wan
Jie Zhang
Chang-bo Li
Shiguang Shan
283
0
0
21 May 2024
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting
Zhuliang Yu
Guohao Chen
Jiaxiang Wu
Yifan Zhang
Yaofo Chen
Peilin Zhao
Shuaicheng Niu
TTAOOD
392
16
0
18 Mar 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action RecognitionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
516
3
0
15 Jan 2024
Universal Time-Series Representation Learning: A Survey
Universal Time-Series Representation Learning: A Survey
Patara Trirat
Yooju Shin
Junhyeok Kang
Youngeun Nam
Jihye Na
Minyoung Bae
Joeun Kim
Byunghyun Kim
Jae-Gil Lee
AI4TS
447
41
0
08 Jan 2024
PECoP: Parameter Efficient Continual Pretraining for Action Quality
  Assessment
PECoP: Parameter Efficient Continual Pretraining for Action Quality AssessmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Amirhossein Dadashzadeh
Shuchao Duan
Alan Whone
Majid Mirmehdi
297
25
0
11 Nov 2023
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
FGPrompt: Fine-grained Goal Prompting for Image-goal NavigationNeural Information Processing Systems (NeurIPS), 2023
Xinyu Sun
Peihao Chen
Jugang Fan
Thomas H. Li
Jian Chen
Zhuliang Yu
326
33
0
11 Oct 2023
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video
  Representation Learning
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation LearningACM Multimedia (ACM MM), 2023
Minghao Zhu
Xiao Lin
Ronghao Dang
Chengju Liu
Qi Chen
VGen
243
11
0
01 Sep 2023
Unsupervised Representation Learning for Time Series: A Review
Unsupervised Representation Learning for Time Series: A Review
Qianwen Meng
Hangwei Qian
Yong Liu
Yonghui Xu
Zhiqi Shen
Li-zhen Cui
AI4TS
280
31
0
03 Aug 2023
Language-based Action Concept Spaces Improve Video Self-Supervised
  Learning
Language-based Action Concept Spaces Improve Video Self-Supervised LearningNeural Information Processing Systems (NeurIPS), 2023
Kanchana Ranasinghe
Michael S. Ryoo
SSLVLM
498
16
0
20 Jul 2023
Vesper: A Compact and Effective Pretrained Model for Speech Emotion
  Recognition
Vesper: A Compact and Effective Pretrained Model for Speech Emotion RecognitionIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2023
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
345
75
0
20 Jul 2023
A Large-Scale Analysis on Self-Supervised Video Representation Learning
A Large-Scale Analysis on Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
361
3
0
09 Jun 2023
Masked Autoencoder for Unsupervised Video Summarization
Masked Autoencoder for Unsupervised Video Summarization
Minho Shim
Taeoh Kim
Jinhyung Kim
Dongyoon Wee
224
4
0
02 Jun 2023
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on
  Point Cloud Videos
PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud VideosComputer Vision and Pattern Recognition (CVPR), 2023
Zhiqiang Shen
Xiaoxiao Sheng
Longguang Wang
Yike Guo
Qiong Liu
Xiaoping Zhou
3DPCSSL
288
26
0
06 May 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
515
623
0
29 Mar 2023
Structured Video-Language Modeling with Temporal Grouping and Spatial
  Grounding
Structured Video-Language Modeling with Temporal Grouping and Spatial GroundingInternational Conference on Learning Representations (ICLR), 2023
Yuanhao Xiong
Long Zhao
Boqing Gong
Ming-Hsuan Yang
Florian Schroff
Ting Liu
Cho-Jui Hsieh
Liangzhe Yuan
VLM
353
0
0
28 Mar 2023
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video
  Representations for Semi-Supervised Action Recognition
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2023
I. Dave
Mamshad Nayeem Rizve
Chong Chen
M. Shah
TTA
281
28
0
28 Mar 2023
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization
Tubelet-Contrastive Self-Supervision for Video-Efficient GeneralizationIEEE International Conference on Computer Vision (ICCV), 2023
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
ViT
399
13
0
20 Mar 2023
Multi-Task Self-Supervised Time-Series Representation Learning
Multi-Task Self-Supervised Time-Series Representation LearningInformation Sciences (Inf. Sci.), 2023
Heejeong Choi
Pilsung Kang
AI4TSSSL
322
21
0
02 Mar 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive
  Self-Supervised Learning
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised LearningMachine Vision and Applications (MVA), 2022
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
302
6
0
21 Dec 2022
MoQuad: Motion-focused Quadruple Construction for Video Contrastive
  Learning
MoQuad: Motion-focused Quadruple Construction for Video Contrastive Learning
Yuan Liu
Jiacheng Chen
Hao Wu
264
3
0
21 Dec 2022
MHCCL: Masked Hierarchical Cluster-Wise Contrastive Learning for
  Multivariate Time Series
MHCCL: Masked Hierarchical Cluster-Wise Contrastive Learning for Multivariate Time SeriesAAAI Conference on Artificial Intelligence (AAAI), 2022
Qianwen Meng
Hangwei Qian
Yong Liu
Li-zhen Cui
Yonghui Xu
Zhiqi Shen
AI4TS
485
53
0
02 Dec 2022
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant
  Spatiotemporal Tokens
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensInternational Conference on Machine Learning (ICML), 2022
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
Sung Ju Hwang
463
15
0
19 Nov 2022
Semantic Video Moments Retrieval at Scale: A New Task and a Baseline
Semantic Video Moments Retrieval at Scale: A New Task and a Baseline
Na Li
294
0
0
15 Oct 2022
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022
Xinyu Sun
Peihao Chen
Liang-Chieh Chen
Chan Li
Thomas H. Li
Zhuliang Yu
Chuang Gan
403
47
0
12 Oct 2022
Self-supervised Video Representation Learning with Motion-Aware Masked
  Autoencoders
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
Huanjin Yao
Yi Jiang
Xiatian Zhu
Zehuan Yuan
175
32
0
09 Oct 2022
Learning Transferable Spatiotemporal Representations from Natural Script
  Knowledge
Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeComputer Vision and Pattern Recognition (CVPR), 2022
Ziyun Zeng
Yuying Ge
Xihui Liu
Bin Chen
Ping Luo
Shutao Xia
Yixiao Ge
AI4TS
272
9
0
30 Sep 2022
Temporal Contrastive Learning with Curriculum
Temporal Contrastive Learning with CurriculumIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Shuvendu Roy
Ali Etemad
323
4
0
02 Sep 2022
Motion Sensitive Contrastive Learning for Self-supervised Video
  Representation
Motion Sensitive Contrastive Learning for Self-supervised Video RepresentationEuropean Conference on Computer Vision (ECCV), 2022
Jingcheng Ni
Nana Zhou
Jie Qin
Qianrun Wu
Junqi Liu
Boxun Li
Di Huang
SSL
251
20
0
12 Aug 2022
DAS: Densely-Anchored Sampling for Deep Metric Learning
DAS: Densely-Anchored Sampling for Deep Metric LearningEuropean Conference on Computer Vision (ECCV), 2022
Lizhao Liu
Shan Huang
Zhuangwei Zhuang
Ran Yang
Zhuliang Yu
Yaowei Wang
325
14
0
30 Jul 2022
Static and Dynamic Concepts for Self-supervised Video Representation
  Learning
Static and Dynamic Concepts for Self-supervised Video Representation LearningEuropean Conference on Computer Vision (ECCV), 2022
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
SSL
279
28
0
26 Jul 2022
Hierarchical Semi-Supervised Contrastive Learning for
  Contamination-Resistant Anomaly Detection
Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly DetectionEuropean Conference on Computer Vision (ECCV), 2022
Gaoang Wang
Yibing Zhan
Xinchao Wang
Min-Gyoo Song
Klara Nahrstedt
184
14
0
24 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action RecognitionIEEE transactions on multimedia (IEEE TMM), 2022
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
321
64
0
24 Jul 2022
LocVTP: Video-Text Pre-training for Temporal Localization
LocVTP: Video-Text Pre-training for Temporal LocalizationEuropean Conference on Computer Vision (ECCV), 2022
Meng Cao
Tianyu Yang
Junwu Weng
Can Zhang
Jue Wang
Yuexian Zou
236
72
0
21 Jul 2022
Dual Contrastive Learning for Spatio-temporal Representation
Dual Contrastive Learning for Spatio-temporal RepresentationACM Multimedia (ACM MM), 2022
Shuangrui Ding
Rui Qian
H. Xiong
AI4TSSSL
175
26
0
12 Jul 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Exploring Temporally Dynamic Data Augmentation for Video RecognitionInternational Conference on Learning Representations (ICLR), 2022
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
302
18
0
30 Jun 2022
SLIC: Self-Supervised Learning with Iterative Clustering for Human
  Action Videos
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action VideosComputer Vision and Pattern Recognition (CVPR), 2022
S. H. Khorasgani
Yuxuan Chen
Florian Shkurti
SSL
287
32
0
25 Jun 2022
12
Next
Page 1 of 2