Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.12781
Cited By
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
17 July 2024
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
Michael Vasilkovsky
Hsin-Ying Lee
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control"
48 / 48 papers shown
Title
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Chenjie Cao
Jingkai Zhou
Shikai Li
Jingyun Liang
Chaohui Yu
Fan Wang
Xiangyang Xue
Yanwei Fu
DiffM
VGen
61
0
0
21 Apr 2025
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
Ruineng Li
Daitao Xing
Huiming Sun
Yuanzhou Ha
Jinglin Shen
C. Ho
DiffM
VGen
32
0
0
11 Apr 2025
Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization
Yikai Wang
Guangce Liu
Xinzhou Wang
Zilong Chen
Jiafang Li
Xin Liang
F. Sun
J. Zhu
3DGS
VGen
20
0
0
05 Apr 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Y. Wang
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVM
VGen
69
3
0
27 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
70
0
0
27 Mar 2025
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
Jinwei Li
Huan-ang Gao
Wenyi Li
Haohan Chi
Chenyu Liu
...
Yao Yao
Jingwei Zhao
Hongyang Li
Yikai Wang
Hao Zhao
61
0
0
26 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
34
1
0
25 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGen
LLMSV
39
1
0
15 Mar 2025
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Jianhong Bai
Menghan Xia
Xiao Fu
Xintao Wang
Lianrui Mu
...
Zuozhu Liu
Haoji Hu
Xiang Bai
Pengfei Wan
Di Zhang
DiffM
VGen
38
3
0
14 Mar 2025
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He
Ceyuan Yang
Shanchuan Lin
Yinghao Xu
Meng Wei
Liangke Gui
Qi Zhao
Gordon Wetzstein
Lu Jiang
Hongsheng Li
DiffM
VGen
95
5
0
13 Mar 2025
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
Runze Zhang
Guoguang Du
Xiaochuan Li
Qi Jia
Liang Jin
...
Zhenhua Guo
Yaqian Zhao
Xiaoli Gong
Rengang Li
Baoyu Fan
VGen
67
0
0
08 Mar 2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Mark YU
Wenbo Hu
Jinbo Xing
Ying Shan
VGen
79
3
0
07 Mar 2025
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Jinbo Xing
Long Mai
Cusuh Ham
Jiahui Huang
Aniruddha Mahapatra
Chi-Wing Fu
T. Wong
Feng Liu
DiffM
VGen
98
2
0
06 Feb 2025
AKiRa: Augmentation Kit on Rays for optical video generation
Xi Wang
Robin Courant
Marc Christie
Vicky Kalogeiton
VGen
98
3
0
31 Dec 2024
Wonderland: Navigating 3D Scenes from a Single Image
Hanwen Liang
Junli Cao
Vidit Goel
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
125
11
0
16 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffM
VGen
90
14
0
09 Dec 2024
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Chaoyang Wang
Peiye Zhuang
Tuan Duc Ngo
Willi Menapace
Aliaksandr Siarohin
Michael Vasilkovsky
Ivan Skorokhodov
Sergey Tulyakov
Peter Wonka
Hsin-Ying Lee
DiffM
VGen
82
3
0
05 Dec 2024
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Yifan Lu
Xuanchi Ren
Jiawei Yang
Tianchang Shen
Zhangjie Wu
...
Y. Wang
Siheng Chen
Mike Chen
Sanja Fidler
Jiahui Huang
VGen
74
3
0
05 Dec 2024
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
Yuelei Wang
Jian Zhang
Pengtao Jiang
H. Zhang
Jinwei Chen
Bo Li
VGen
DiffM
105
2
0
02 Dec 2024
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
Rundi Wu
Ruiqi Gao
Ben Poole
Alex Trevithick
Changxi Zheng
Jonathan T. Barron
Aleksander Holyñski
VGen
68
17
0
27 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
55
1
0
12 Nov 2024
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
David Junhao Zhang
Roni Paiss
Shiran Zada
Nikhil Karnad
David E. Jacobs
Yael Pritch
Inbar Mosseri
Mike Zheng Shou
Neal Wadhwa
Nataniel Ruiz
DiffM
VGen
66
14
0
07 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
F. Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
28
27
0
07 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
41
13
0
07 Nov 2024
Framer: Interactive Frame Interpolation
Wen Wang
Qiuyu Wang
Kecheng Zheng
Hao Ouyang
Zhekai Chen
Biao Gong
Hao Chen
Yujun Shen
Chunhua Shen
VGen
34
4
0
24 Oct 2024
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Aakanksha
Arash Ahmadian
Seraphina Goldfarb-Tarrant
B. Ermiş
Marzieh Fadaee
Sara Hooker
MoMe
55
10
0
14 Oct 2024
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning
Etai Littwin
Vimal Thilak
Anand Gopalakrishnan
19
8
0
14 Oct 2024
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Wangbo Yu
Jinbo Xing
Li Yuan
Wenbo Hu
Xiaoyu Li
Zhipeng Huang
Xiangjun Gao
T. Wong
Ying Shan
Yonghong Tian
VGen
DiffM
28
66
0
03 Sep 2024
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Monika Zimmermann
Jacek Naruniec
Christopher Schroers
Markus Gross
Romann M. Weber
VGen
DiffM
27
3
0
01 Aug 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
45
27
0
10 Jul 2024
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
Haiyu Zhang
Xinyuan Chen
Yaohui Wang
Xihui Liu
Yunhong Wang
Yu Qiao
VGen
58
27
0
31 May 2024
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation
Tianyuan Zhang
Hong-Xing Yu
Rundi Wu
Brandon Yushan Feng
Changxi Zheng
Noah Snavely
Jiajun Wu
William T. Freeman
AI4CE
VGen
54
9
0
19 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
56
54
0
10 Apr 2024
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
Hao He
Yinghao Xu
Yuwei Guo
Gordon Wetzstein
Bo Dai
Hongsheng Li
Ceyuan Yang
DiffM
VGen
83
115
0
02 Apr 2024
TC4D: Trajectory-Conditioned Text-to-4D Generation
Sherwin Bahmani
Xian Liu
Yifan Wang
Ivan Skorokhodov
Victor Rong
...
Jeong Joon Park
Sergey Tulyakov
Gordon Wetzstein
Andrea Tagliasacchi
David B. Lindell
94
9
0
26 Mar 2024
STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Yifei Zeng
Yanqin Jiang
Siyu Zhu
Yuanxun Lu
Youtian Lin
Hao Zhu
Weiming Hu
Xun Cao
Yao Yao
3DGS
65
14
0
22 Mar 2024
DreamReward: Text-to-3D Generation with Human Preference
Junliang Ye
Fangfu Liu
Qixiu Li
Zhengyi Wang
Yikai Wang
Xinzhou Wang
Yueqi Duan
Jun Zhu
51
20
0
21 Mar 2024
GVGEN: Text-to-3D Generation with Volumetric Representation
Xianglong He
Junyi Chen
Sida Peng
Di Huang
Yangguang Li
Xiaoshui Huang
Chun Yuan
Wanli Ouyang
Tong He
3DGS
DiffM
54
7
0
19 Mar 2024
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han
Filippos Kokkinos
Philip H. S. Torr
VGen
60
16
0
18 Mar 2024
BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis
Lutao Jiang
Lin Wang
3DGS
56
13
0
17 Mar 2024
TripoSR: Fast 3D Object Reconstruction from a Single Image
Dmitry Tochilkin
David Pankratz
Zexiang Liu
Zixuan Huang
Adam Letts
Yangguang Li
Ding Liang
Christian Laforte
Varun Jampani
Yan-Pei Cao
89
128
0
04 Mar 2024
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
Jiaxiang Tang
Zhaoxi Chen
Xiaokang Chen
Tengfei Wang
Gang Zeng
Ziwei Liu
3DGS
3DV
79
97
0
07 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGen
DiffM
64
21
0
05 Feb 2024
CAD: Photorealistic 3D Generation via Adversarial Distillation
Ziyu Wan
Despoina Paschalidou
Ian Huang
Hongyu Liu
Bokui Shen
Xiaoyu Xiang
Jing Liao
Leonidas J. Guibas
DiffM
54
7
0
11 Dec 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
150
985
0
25 Nov 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
86
52
0
16 Oct 2023
Light Field Diffusion for Single-View Novel View Synthesis
Yifeng Xiong
Haoyu Ma
Shanlin Sun
Kun Han
Hao Tang
Xiaohui Xie
DiffM
18
2
0
20 Sep 2023
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Junshu Tang
Tengfei Wang
Bo Zhang
Ting Zhang
Ran Yi
Lizhuang Ma
Dong Chen
DiffM
179
218
0
24 Mar 2023
1