Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.16256
Cited By
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
26 December 2023
Lu Ling
Yichen Sheng
Zhi Tu
Wentian Zhao
Cheng Xin
Kun Wan
Lantao Yu
Qianyu Guo
Zixun Yu
Yawen Lu
Xuanmao Li
Xingpeng Sun
Rohan Ashok
Aniruddha Mukherjee
Hao Kang
Xiangrui Kong
Gang Hua
Tianti Zhang
Bedrich Benes
Aniket Bera
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision"
50 / 72 papers shown
Title
Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation
Lu Ling
C. Lin
Tsung-Yi Lin
Yifan Ding
Y. Zeng
Yichen Sheng
Yunhao Ge
Ming-Yu Liu
Aniket Bera
Zhaoshuo Li
VGen
3DV
42
0
0
05 May 2025
Controllable Weather Synthesis and Removal with Video Diffusion Models
Chih-Hao Lin
Z. Wang
Ruofan Liang
Yuxuan Zhang
Sanja Fidler
Shenlong Wang
Zan Gojcic
DiffM
VGen
40
0
0
01 May 2025
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
60
0
0
01 May 2025
Dynamic Camera Poses and Where to Find Them
C. Rockwell
Joseph Tung
Tsung-Yi Lin
Ming-Yu Liu
David Fouhey
Chen-Hsuan Lin
35
0
0
24 Apr 2025
Towards Understanding Camera Motions in Any Video
Zhiqiu Lin
Siyuan Cen
Daniel Jiang
Jay Karhade
Hewei Wang
...
Rushikesh Zawar
Xue Bai
Yilun Du
Chuang Gan
Deva Ramanan
VGen
23
0
0
21 Apr 2025
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Chenjie Cao
Jingkai Zhou
Shikai Li
Jingyun Liang
Chaohui Yu
Fan Wang
Xiangyang Xue
Yanwei Fu
DiffM
VGen
61
0
0
21 Apr 2025
RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
Guangcong Zheng
Teng Li
Xianpan Zhou
Xi Li
VGen
3DV
53
1
0
11 Apr 2025
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
Mengchen Zhang
Tong Wu
Jing Tan
Ziwei Liu
Gordon Wetzstein
D. Lin
VGen
21
0
0
09 Apr 2025
3D Scene Understanding Through Local Random Access Sequence Modeling
Wanhee Lee
Klemen Kotar
R. Venkatesh
Jared Watrous
Honglin Chen
Khai Loong Aw
Daniel L. K. Yamins
3DV
34
0
0
04 Apr 2025
OmniCam: Unified Multimodal Video Generation via Camera Control
Xiaoda Yang
Jiayang Xu
Kaixuan Luan
Xinyu Zhan
Hongshun Qiu
...
Shuai Yang
Li Zhang
Checheng Yu
Cewu Lu
Lixin Yang
DiffM
VGen
62
0
0
03 Apr 2025
Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis
Niluthpol Chowdhury Mithun
Tuan Pham
Qiao Wang
Ben Southall
Kshitij Minhas
Bogdan Matei
Stephan Mandt
S. Samarasekera
Rakesh Kumar
3DGS
43
0
0
02 Apr 2025
FlowR: Flowing from Sparse to Dense 3D Reconstructions
Tobias Fischer
Samuel Rota Buló
Yung-Hsu Yang
Nikhil Varma Keetha
Lorenzo Porzi
Norman Muller
Katja Schwarz
Jonathon Luiten
Marc Pollefeys
Peter Kontschieder
3DGS
39
0
0
02 Apr 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu
Xiangjun Gao
Wenbo Hu
Xiaoyu Li
Song-Hai Zhang
Ying Shan
VGen
MDE
56
1
0
01 Apr 2025
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
Sibo Wu
Congrong Xu
Binbin Huang
Andreas Geiger
Anpei Chen
VGen
48
0
0
27 Mar 2025
HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Xinpeng Liu
Zeyi Huang
Fumio Okura
Y. Matsushita
39
0
0
25 Mar 2025
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Zihang Lai
Andrea Vedaldi
34
0
0
25 Mar 2025
Accenture-NVS1: A Novel View Synthesis Dataset
Thomas Sugg
Kyle O'Brien
Lekh Poudel
Alex Dumouchelle
Michelle Jou
Marc Bosch
Deva Ramanan
S. Narasimhan
Shubham Tulsiani
32
1
0
24 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
74
1
0
20 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Christos Sakaridis
Mattia Segu
Y. Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
37
0
0
20 Mar 2025
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
Hyojun Go
Byeongjun Park
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
3DGS
VGen
92
1
0
20 Mar 2025
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Pietro Michiardi
60
0
0
18 Mar 2025
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
54
0
0
18 Mar 2025
Bolt3D: Generating 3D Scenes in Seconds
Stanislaw Szymanowicz
Jason Y. Zhang
P. Srinivasan
Ruiqi Gao
Arthur Brussee
Aleksander Holynski
Ricardo Martín Brualla
Jonathan T. Barron
Philipp Henzler
90
4
0
18 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGen
LLMSV
47
1
0
15 Mar 2025
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
46
6
0
14 Mar 2025
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He
Ceyuan Yang
Shanchuan Lin
Yinghao Xu
Meng Wei
Liangke Gui
Qi Zhao
Gordon Wetzstein
Lu Jiang
Hongsheng Li
DiffM
VGen
95
5
0
13 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
65
0
0
12 Mar 2025
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
Runze Zhang
Guoguang Du
Xiaochuan Li
Qi Jia
Liang Jin
...
Zhenhua Guo
Yaqian Zhao
Xiaoli Gong
Rengang Li
Baoyu Fan
VGen
70
0
0
08 Mar 2025
StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams
Yang LI
Jinglu Wang
Lei Chu
Xiao Li
Shiu-hong Kao
Ying Chen
Yan Lu
3DGS
47
0
0
08 Mar 2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Mark YU
Wenbo Hu
Jinbo Xing
Ying Shan
VGen
85
3
0
07 Mar 2025
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Xuanchi Ren
Tianchang Shen
Jiahui Huang
Huan Ling
Yifan Lu
Merlin Nimier-David
Thomas Muller
Alexander Keller
Sanja Fidler
Jun Gao
DiffM
VGen
69
8
0
05 Mar 2025
MUSt3R: Multi-view Network for Stereo 3D Reconstruction
Yohann Cabon
Lucas Stoffl
L. Antsfeld
G. Csurka
Boris Chidlovskii
Jérôme Revaud
Vincent Leroy
3DGS
3DV
48
2
0
03 Mar 2025
Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
Jay Zhangjie Wu
Yuxuan Zhang
Haithem Turki
Xuanchi Ren
Jun Gao
Mike Zheng Shou
Sanja Fidler
Zan Gojcic
Huan Ling
44
1
0
03 Mar 2025
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Luigi Piccinelli
Christos Sakaridis
Y. Yang
Mattia Segu
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
41
6
0
27 Feb 2025
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
Shangzhan Zhang
Jianyuan Wang
Yinghao Xu
Nan Xue
Christian Rupprecht
Xiaowei Zhou
Yujun Shen
Gordon Wetzstein
101
7
0
17 Feb 2025
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
Xingyi He He
Hao Yu
Sida Peng
Dongli Tan
Zehong Shen
Hujun Bao
Xiaowei Zhou
39
3
0
13 Jan 2025
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Z. Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
105
1
0
18 Dec 2024
Wonderland: Navigating 3D Scenes from a Single Image
Hanwen Liang
Junli Cao
Vidit Goel
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
125
11
0
16 Dec 2024
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Yue Chen
Xingyu Chen
Anpei Chen
Gerard Pons-Moll
Yuliang Xiu
3DGS
83
2
0
12 Dec 2024
LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors
Yabo Chen
Chen-Ning Yang
Jiemin Fang
Xiaopeng Zhang
Lingxi Xie
Wei-Ming Shen
Wenrui Dai
Hongkai Xiong
Q. Tian
3DGS
91
2
0
12 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffM
VGen
102
14
0
09 Dec 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
93
3
0
26 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
59
5
0
25 Nov 2024
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Chenjie Cao
Chaohui Yu
Shang Liu
Fan Wang
Xiangyang Xue
Yanwei Fu
75
1
0
25 Nov 2024
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
Hyojun Go
Byeongjun Park
Jiho Jang
Jin-Young Kim
Soonwoo Kwon
Changick Kim
3DGS
111
2
0
25 Nov 2024
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors
Soumava Paul
Prakhar Kaushik
Alan L. Yuille
3DGS
DiffM
86
0
0
24 Nov 2024
Generating 3D-Consistent Videos from Unposed Internet Photos
Gene Chou
Kai Zhang
Sai Bi
Hao Tan
Zexiang Xu
Fujun Luan
Bharath Hariharan
Noah Snavely
3DGS
VGen
75
3
0
20 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
F. Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
46
31
0
07 Nov 2024
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Yuedong Chen
Chuanxia Zheng
Haofei Xu
Bohan Zhuang
Andrea Vedaldi
Tat-Jen Cham
Jianfei Cai
3DGS
53
14
0
07 Nov 2024
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Botao Ye
Sifei Liu
Haofei Xu
Xueting Li
Marc Pollefeys
Ming Yang
Songyou Peng
25
21
0
31 Oct 2024
1
2
Next