RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements

Abstract
Recent advances in camera-controllable video generation have been constrained by the reliance on static-scene datasets with relative-scale camera annotations, such as RealEstate10K. While these datasets enable basic viewpoint control, they fail to capture dynamic scene interactions and lack metric-scale geometric consistency-critical for synthesizing realistic object motions and precise camera trajectories in complex environments. To bridge this gap, we introduce the first fully open-source, high-resolution dynamic-scene dataset with metric-scale camera annotations inthis https URL.
View on arXiv@article{zheng2025_2504.08212, title={ RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements }, author={ Guangcong Zheng and Teng Li and Xianpan Zhou and Xi Li }, journal={arXiv preprint arXiv:2504.08212}, year={ 2025 } }
Comments on this paper