ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.01596
  4. Cited By
Learning to See by Moving
v1v2 (latest)

Learning to See by Moving

7 May 2015
Pulkit Agrawal
João Carreira
Jitendra Malik
    SSL
ArXiv (abs)PDFHTML

Papers citing "Learning to See by Moving"

50 / 326 papers shown
Title
VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models
VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models
Jesimon Barreto
C. Caetano
A. Araújo
William Robson Schwartz
VLM
120
0
0
23 Oct 2025
Learning to Navigate Socially Through Proactive Risk Perception
Learning to Navigate Socially Through Proactive Risk Perception
Erjia Xiao
Lingfeng Zhang
Yingbo Tang
Hao Cheng
Zhanchen Zhu
Wenbo Ding
Lei Zhou
L. Chen
Hangjun Ye
Xiaoshuai Hao
180
0
0
09 Oct 2025
Self-supervised Representation Learning with Local Aggregation for Image-based Profiling
Self-supervised Representation Learning with Local Aggregation for Image-based Profiling
Siran Dai
Qianqian Xu
Peisong Wen
Yang Liu
Qingming Huang
255
2
0
17 Jun 2025
Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection
Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection
Ayush K. Rai
Kyle Min
Tarun Krishna
Feiyan Hu
Alan F. Smeaton
Noel E. O'Connor
VGen
306
0
0
13 May 2025
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation
Aviv Slobodkin
Hagai Taitelbaum
Yonatan Bitton
Brian Gordon
Michal Sokolik
Nitzan Bitton-Guetta
Almog Gueta
Royi Rassin
Itay Laish
Dani Lischinski
EGVMVGen
332
1
0
24 Apr 2025
Random Walks in Self-supervised Learning for Triangular Meshes
Gal Yefet
A. Tal
SSL
258
0
0
02 Mar 2025
General Intelligence Requires Reward-based Pretraining
General Intelligence Requires Reward-based Pretraining
Seungwook Han
Jyothish Pari
Samuel J. Gershman
Pulkit Agrawal
LRM
773
2
0
26 Feb 2025
Scaling 4D Representations
Scaling 4D Representations
João Carreira
Dilara Gokay
Michael King
Chuhan Zhang
Ignacio Rocco
...
Viorica Patraucean
Dima Damen
Pauline Luc
Mehdi S. M. Sajjadi
Andrew Zisserman
387
18
0
19 Dec 2024
From Cognition to Precognition: A Future-Aware Framework for Social Navigation
From Cognition to Precognition: A Future-Aware Framework for Social NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024
Zeying Gong
Tianshuai Hu
Ronghe Qiu
Junwei Liang
782
9
0
20 Sep 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
PooDLe: Pooled and dense self-supervised learning from naturalistic videosInternational Conference on Learning Representations (ICLR), 2024
Alex N. Wang
Christopher Hoang
Yuwen Xiong
Yann LeCun
Mengye Ren
443
4
0
20 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical SurveyInternational Journal of Computer Vision (IJCV), 2024
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
352
4
0
29 Jul 2024
Self-supervised visual learning from interactions with objects
Self-supervised visual learning from interactions with objects
A. Aubret
Céline Teulière
Jochen Triesch
210
9
0
09 Jul 2024
Semantic Graph Consistency: Going Beyond Patches for Regularizing
  Self-Supervised Vision Transformers
Semantic Graph Consistency: Going Beyond Patches for Regularizing Self-Supervised Vision Transformers
Chaitanya Devaguptapu
Sumukh K. Aithal
Shrinivas Ramasubramanian
Moyuru Yamada
Manohar Kaul
ViT
277
0
0
18 Jun 2024
The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning
The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning
Dulhan Jayalath
Gilad Landau
Brendan Shillingford
M. Woolrich
Oiwi Parker Jones
SSL
398
15
0
06 Jun 2024
Unsupervised learning based object detection using Contrastive Learning
Unsupervised learning based object detection using Contrastive Learning
Chandan Kumar
Jansel Herrera-Gerena
John Just
Matthew J. Darr
Ali Jannesari
SSL
180
2
0
21 Feb 2024
Bootstrap Masked Visual Modeling via Hard Patches Mining
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
183
6
0
21 Dec 2023
Is ImageNet worth 1 video? Learning strong image encoders from 1 long
  unlabelled video
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled videoInternational Conference on Learning Representations (ICLR), 2023
Shashanka Venkataramanan
Mamshad Nayeem Rizve
João Carreira
Yuki M. Asano
Yannis Avrithis
SSL
222
33
0
12 Oct 2023
Efficient Planning with Latent Diffusion
Efficient Planning with Latent DiffusionInternational Conference on Learning Representations (ICLR), 2023
Wenhao Li
DiffM
346
10
0
30 Sep 2023
Flow Factorized Representation Learning
Flow Factorized Representation LearningNeural Information Processing Systems (NeurIPS), 2023
Yue Song
Thomas Anderson Keller
Andrii Zadaianchuk
Max Welling
DRLOOD
302
5
0
22 Sep 2023
DeViL: Decoding Vision features into Language
DeViL: Decoding Vision features into Language
Meghal Dani
Isabel Rio-Torto
Stephan Alaniz
Zeynep Akata
VLM
147
11
0
04 Sep 2023
Language-based Action Concept Spaces Improve Video Self-Supervised
  Learning
Language-based Action Concept Spaces Improve Video Self-Supervised LearningNeural Information Processing Systems (NeurIPS), 2023
Kanchana Ranasinghe
Michael S. Ryoo
SSLVLM
390
15
0
20 Jul 2023
LowDINO -- A Low Parameter Self Supervised Learning Model
LowDINO -- A Low Parameter Self Supervised Learning Model
Sai Krishna Prathapaneni
Shvejan Shashank
K. SrikarReddy
217
0
0
28 May 2023
Siamese Masked Autoencoders
Siamese Masked AutoencodersNeural Information Processing Systems (NeurIPS), 2023
Agrim Gupta
Jiajun Wu
Gaowen Liu
Li Fei-Fei
136
80
0
23 May 2023
Self-Supervised Learning for Point Clouds Data: A Survey
Self-Supervised Learning for Point Clouds Data: A SurveyExpert systems with applications (ESWA), 2023
Changyu Zeng
Wei Wang
A. Nguyen
Yutao Yue
3DPC
167
0
0
09 May 2023
Unsupervised Style-based Explicit 3D Face Reconstruction from Single
  Image
Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image
Heng Yu
Z. '. Milacski
László A. Jeni
3DV3DH
169
2
0
24 Apr 2023
A Cookbook of Self-Supervised Learning
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDaFedMLSSL
363
356
0
24 Apr 2023
Self-Supervised Video Similarity Learning
Self-Supervised Video Similarity Learning
Giorgos Kordopatis-Zilos
Giorgos Tolias
Christos Tzelepis
I. Kompatsiaris
Ioannis Patras
Symeon Papadopoulos
SSL
214
14
0
06 Apr 2023
ViC-MAE: Self-Supervised Representation Learning from Images and Video
  with Contrastive Masked Autoencoders
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
J. Hernandez
Ruben Villegas
Vicente Ordonez
SSL
152
2
0
21 Mar 2023
A Review of Predictive and Contrastive Self-supervised Learning for
  Medical Images
A Review of Predictive and Contrastive Self-supervised Learning for Medical ImagesMachine Intelligence Research (MIR), 2023
Wei-Chien Wang
Euijoon Ahn
Da-wei Feng
Jinman Kim
MedIm
540
35
0
10 Feb 2023
Advancing Radiograph Representation Learning with Masked Record Modeling
Advancing Radiograph Representation Learning with Masked Record ModelingInternational Conference on Learning Representations (ICLR), 2023
Hong-Yu Zhou
Chenyu Lian
Lian-cheng Wang
Yizhou Yu
MedIm
244
83
0
30 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and
  Future Trends
A Survey on Self-supervised Learning: Algorithms, Applications, and Future TrendsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
500
325
0
13 Jan 2023
MoQuad: Motion-focused Quadruple Construction for Video Contrastive
  Learning
MoQuad: Motion-focused Quadruple Construction for Video Contrastive Learning
Yuan Liu
Jiacheng Chen
Hao Wu
213
3
0
21 Dec 2022
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in
  Vision-and-Language Navigation
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language NavigationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
196
12
0
20 Nov 2022
Local Manifold Augmentation for Multiview Semantic Consistency
Local Manifold Augmentation for Multiview Semantic Consistency
Yuzhou Nie
Wing Yin Cheung
Chang-rui Liu
Xiang Ji
211
3
0
05 Nov 2022
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
Inference and Denoise: Causal Inference-based Neural Speech EnhancementInternational Workshop on Machine Learning for Signal Processing (MLSP), 2022
Tsun-An Hsieh
Chao-Han Huck Yang
Pin-Yu Chen
Sabato Marco Siniscalchi
Yu Tsao
CML
189
2
0
02 Nov 2022
Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present
  and Future
Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future
Guo-Jun Qi
M. Shah
SSL
126
8
0
23 Oct 2022
MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without
  Camera Pose
MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera PoseInternational Conference on Machine Learning (ICML), 2022
Yang Fu
Ishan Misra
Xiaolong Wang
MDE
164
12
0
13 Oct 2022
Extraneousness-Aware Imitation Learning
Extraneousness-Aware Imitation LearningIEEE International Conference on Robotics and Automation (ICRA), 2022
Rachel Zheng
Kaizhe Hu
Zhecheng Yuan
Boyuan Chen
Huazhe Xu
SSL
244
4
0
04 Oct 2022
Leveraging Self-Supervised Training for Unintentional Action Recognition
Leveraging Self-Supervised Training for Unintentional Action Recognition
Enea Duka
Anna Kukleva
Bernt Schiele
135
2
0
23 Sep 2022
Pixel-Wise Prediction based Visual Odometry via Uncertainty Estimation
Pixel-Wise Prediction based Visual Odometry via Uncertainty EstimationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Haoming Chen
Tingbo Liao
Hsuan-Kung Yang
Chun-Yi Lee
166
2
0
18 Aug 2022
Impact Makes a Sound and Sound Makes an Impact: Sound Guides
  Representations and Explorations
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and ExplorationsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Xufeng Zhao
C. Weber
Muhammad Burhan Hafez
S. Wermter
140
10
0
04 Aug 2022
MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations
  of Behavior
MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of BehaviorInternational Conference on Machine Learning (ICML), 2022
Jennifer J. Sun
Markus Marks
Andrew Ulmer
Dipam Chakraborty
Brian Geuther
...
Joseph Parker
Pietro Perona
Yisong Yue
K. Branson
Ann Kennedy
195
13
0
21 Jul 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
MeshMAE: Masked Autoencoders for 3D Mesh Data AnalysisEuropean Conference on Computer Vision (ECCV), 2022
Yaqian Liang
Shanshan Zhao
Baosheng Yu
Jing Zhang
Fazhi He
ViT
154
53
0
20 Jul 2022
Structural Causal 3D Reconstruction
Structural Causal 3D ReconstructionEuropean Conference on Computer Vision (ECCV), 2022
Weiyang Liu
Zhen Liu
Liam Paull
Adrian Weller
Bernhard Schölkopf
3DVCML
268
17
0
20 Jul 2022
Unsupervised Visual Representation Learning by Synchronous Momentum
  Grouping
Unsupervised Visual Representation Learning by Synchronous Momentum GroupingEuropean Conference on Computer Vision (ECCV), 2022
Bo Pang
Yifan Zhang
Yaoyi Li
Jia Cai
Cewu Lu
SSL
174
33
0
13 Jul 2022
Pixel-level Correspondence for Self-Supervised Learning from Video
Pixel-level Correspondence for Self-Supervised Learning from Video
Yash Sharma
Yi Zhu
Chris Russell
Thomas Brox
SSL
163
4
0
08 Jul 2022
Visual Pre-training for Navigation: What Can We Learn from Noise?
Visual Pre-training for Navigation: What Can We Learn from Noise?IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Yanwei Wang
Ching-Yun Ko
Pulkit Agrawal
SSL
368
7
0
30 Jun 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation
  Learning
Bi-Calibration Networks for Weakly-Supervised Video Representation LearningInternational Journal of Computer Vision (IJCV), 2022
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
198
8
0
21 Jun 2022
Learning Behavior Representations Through Multi-Timescale Bootstrapping
Learning Behavior Representations Through Multi-Timescale Bootstrapping
Mehdi Azabou
Michael J. Mendelson
Maks Sorokin
S. Thakoor
Nauman Ahad
Carolina Urzay
Eva L. Dyer
AI4CE
195
8
0
14 Jun 2022
Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised
  Correspondence Learning
Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence LearningComputer Vision and Pattern Recognition (CVPR), 2022
Liulei Li
Tianfei Zhou
Wenguan Wang
Pu Cao
Jian-Wei Li
Yi Yang
SSL
262
55
0
27 Mar 2022
1234567
Next