Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.10314
Cited By
CAT3D: Create Anything in 3D with Multi-View Diffusion Models
16 May 2024
Ruiqi Gao
Aleksander Holynski
Philipp Henzler
Arthur Brussee
Ricardo Martín Brualla
Pratul P. Srinivasan
Jonathan T. Barron
Ben Poole
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CAT3D: Create Anything in 3D with Multi-View Diffusion Models"
31 / 31 papers shown
Title
Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis
Michal Geyer
Omer Tov
Linyi Jin
Richard Tucker
Inbar Mosseri
Tali Dekel
Noah Snavely
DiffM
VGen
88
0
0
30 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Xiaohui Zeng
VGen
MDE
31
0
0
15 Apr 2025
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
Sibo Wu
Congrong Xu
Binbin Huang
Andreas Geiger
Anpei Chen
VGen
55
0
0
27 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
69
0
0
12 Mar 2025
View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
Stephen Tian
Blake Wulfe
Kyle Sargent
Katherine Liu
Sergey Zakharov
Vitor Campagnolo Guizilini
Jiajun Wu
70
10
0
21 Feb 2025
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
Kaixin Yao
Longwen Zhang
Xinhao Yan
Yan Zeng
Qixuan Zhang
Wei Yang
Lan Xu
Jiayuan Gu
Jingyi Yu
22
2
0
18 Feb 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
103
4
0
11 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
109
5
0
10 Feb 2025
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Xianghui Ze
Zhenbo Song
Qiwei Wang
Jianfeng Lu
Yujiao Shi
46
0
0
05 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
69
7
0
28 Jan 2025
CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation
Alex Berian
Daniel Brignac
JhihYang Wu
Natnael Daba
Abhijit Mahalanobis
DiffM
50
1
0
20 Jan 2025
Wonderland: Navigating 3D Scenes from a Single Image
Hanwen Liang
Junli Cao
Vidit Goel
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
125
11
0
16 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffM
VGen
107
14
0
09 Dec 2024
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
106
2
0
02 Dec 2024
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Chenjie Cao
Chaohui Yu
Shang Liu
Fan Wang
Xiangyang Xue
Yanwei Fu
81
1
0
25 Nov 2024
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors
Soumava Paul
Prakhar Kaushik
Alan L. Yuille
3DGS
DiffM
88
0
0
24 Nov 2024
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng
Jiawei Liu
Pengqi Tu
Tianhao Qi
Mingzhen Sun
Tianxiang Ma
Songtao Zhao
Siyu Zhou
Qian He
VGen
47
7
0
10 Nov 2024
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Guosheng Zhao
Chaojun Ni
Xiaofeng Wang
Zheng Zhu
X. Zhang
...
Xinze Chen
Boyuan Wang
Youyi Zhang
Wenjun Mei
Xingang Wang
VGen
73
24
0
17 Oct 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
72
41
0
17 Jul 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
50
27
0
10 Jul 2024
WonderWorld: Interactive 3D Scene Generation from a Single Image
Hong-Xing Yu
Haoyi Duan
Charles Herrmann
William T. Freeman
Jiajun Wu
3DGS
VGen
41
37
0
13 Jun 2024
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Hao Wen
Zehuan Huang
Yaohui Wang
Xinyuan Chen
Yu Qiao
97
7
0
05 Jun 2024
Images that Sound: Composing Images and Sounds on a Single Canvas
Ziyang Chen
Daniel Geng
Andrew Owens
DiffM
44
8
0
20 May 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
62
54
0
10 Apr 2024
Lumiere: A Space-Time Diffusion Model for Video Generation
Omer Bar-Tal
Hila Chefer
Omer Tov
Charles Herrmann
Roni Paiss
...
T. Michaeli
Oliver Wang
Deqing Sun
Tali Dekel
Inbar Mosseri
VGen
101
214
0
23 Jan 2024
NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber
Aleksander Holyñski
Varun Jampani
Saurabh Saxena
Noah Snavely
Abhishek Kar
Angjoo Kanazawa
60
30
0
07 Dec 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
150
985
0
25 Nov 2023
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Junshu Tang
Tengfei Wang
Bo Zhang
Ting Zhang
Ran Yi
Lizhuang Ma
Dong Chen
DiffM
179
307
0
24 Mar 2023
FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization
Jiawei Yang
Marco Pavone
Yue Wang
113
260
0
13 Mar 2023
3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Anchit Gupta
Wenhan Xiong
Yixin Nie
Anchit Gupta
Barlas Oğuz
DiffM
91
156
0
09 Mar 2023
RealFusion: 360° Reconstruction of Any Object from a Single Image
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
Andrea Vedaldi
85
288
0
21 Feb 2023
1