Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2403.12008
Cited By
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
European Conference on Computer Vision (ECCV), 2024
18 March 2024
Vikram S. Voleti
Chun-Han Yao
Mark Boss
Adam Letts
David Pankratz
Dmitry Tochilkin
Christian Laforte
Robin Rombach
Varun Jampani
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (21 upvotes)
Papers citing
"SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion"
50 / 130 papers shown
Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object
Yuxuan Lin
Ruihang Chu
Zhenyu Chen
Xiao Tang
Lei Ke
...
Zhihao Li
Shiyong Liu
Xiaofei Wu
Jianzhuang Liu
Yujiu Yang
184
0
0
29 May 2025
Is Single-View Mesh Reconstruction Ready for Robotics?
Frederik Nolte
Andreas Geiger
Bernhard Schölkopf
Ingmar Posner
449
2
0
23 May 2025
PhyMAGIC: Physical Motion-Aware Generative Inference with Confidence-guided LLM
Siwei Meng
Yawei Luo
Ping Liu
DiffM
VGen
261
1
0
22 May 2025
DiMeR: Disentangled Mesh Reconstruction Model
Lutao Jiang
Jiantao Lin
Kanghao Chen
Wenhang Ge
Xin Yang
Yifan Jiang
Yuanhuiyi Lyu
Xu Zheng
Yinchuan Li
Yingcong Chen
3DV
540
6
0
24 Apr 2025
R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors
Haoyang Wang
Liming Liu
Peiheng Wang
Junlin Hao
Jiangkai Wu
Xinggong Zhang
217
0
0
16 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
407
4
0
15 Apr 2025
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
Stathis Galanakis
Alexandros Lattas
Stylianos Moschoglou
Bernhard Kainz
Stefanos Zafeiriou
DiffM
398
0
0
14 Apr 2025
H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models
Yushu Wu
Yanyu Li
Ivan Skorokhodov
Vidit Goel
Willi Menapace
Sharath Girish
Aliaksandr Siarohin
Yanzhi Wang
Sergey Tulyakov
DiffM
VGen
376
5
0
14 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
429
37
0
10 Apr 2025
Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Yikai Wang
Guangce Liu
Xinzhou Wang
Zilong Chen
Jiafang Li
Xin Liang
F. Sun
J. Zhu
3DGS
VGen
365
3
0
05 Apr 2025
Distilling Multi-view Diffusion Models into 3D Generators
Hao Qin
Luyuan Chen
Ming Kong
Mengxu Lu
Qiang Zhu
3DGS
540
1
0
01 Apr 2025
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min
Daehyeon Choi
Kyeongmin Yeo
Jihyun Lee
Minhyuk Sung
478
1
0
28 Mar 2025
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
350
10
0
28 Mar 2025
3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models
Yujiao Shi
Mengchen Zhang
Tong Wu
Tengfei Wang
Gordon Wetzstein
Dahua Lin
Yu Qiao
ELM
600
6
0
27 Mar 2025
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
Computer Vision and Pattern Recognition (CVPR), 2025
Zhiyuan Ma
Xinyue Liang
Rongyuan Wu
Xiangyu Zhu
Zhen Lei
Lei Zhang
300
2
0
27 Mar 2025
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Sangwon Beak
Hyeonwoo Kim
Hanbyul Joo
301
3
0
25 Mar 2025
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
Zhiqiang Yuan
Ting Zhang
Ying Deng
Jiapei Zhang
Yeshuang Zhu
Zexi Jia
Jie Zhou
Jinchao Zhang
VGen
240
2
0
22 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGS
VGen
533
22
0
20 Mar 2025
CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image
Arindam Dutta
Meng Zheng
Zhongpai Gao
Benjamin Planche
Anwesha Choudhuri
Terrence Chen
Amit K. Roy-Chowdhury
Ziyan Wu
3DH
277
4
0
19 Mar 2025
Bolt3D: Generating 3D Scenes in Seconds
Stanislaw Szymanowicz
Jason Y. Zhang
P. Srinivasan
Ruiqi Gao
Arthur Brussee
Aleksander Holynski
Ricardo Martín Brualla
Jonathan T. Barron
Philipp Henzler
406
25
0
18 Mar 2025
SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model
Computer Vision and Pattern Recognition (CVPR), 2025
Yucheng Mao
Boyang Wang
Nilesh Kulkarni
Jeong Joon Park
DiffM
344
1
0
18 Mar 2025
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Ping Liu
Yawei Luo
562
2
0
18 Mar 2025
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
Ruijie Lu
Yixin Chen
Yu Liu
Jiaxiang Tang
Junfeng Ni
Diwen Wan
Gang Zeng
Siyuan Huang
DiffM
VGen
458
8
0
15 Mar 2025
V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes
Yanming Zhang
Jun-Kun Chen
Jipeng Lyu
Yu-Xiong Wang
DiffM
VGen
327
2
0
13 Mar 2025
CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction
Z. Wu
Xibin Song
Senbo Wang
Weizhe Liu
Jiayu Yang
...
Shenzhou Chen
Taizhang Shang
Weixuan Sun
Shan Luo
Pan Ji
DiffM
232
2
0
13 Mar 2025
WonderVerse: Extendable 3D Scene Generation with Video Generative Models
Hao Feng
Zhi Zuo
Jia-Hui Pan
Ka-Hei Hui
Yihua Shao
Qi Dou
Wei Xie
Zhengzhe Liu
VGen
450
4
0
12 Mar 2025
V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video
Jianqi Chen
Biao Zhang
Xiangjun Tang
Peter Wonka
VGen
303
13
0
11 Mar 2025
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Computer Vision and Pattern Recognition (CVPR), 2025
Yuhan Wang
Fangzhou Hong
Shuai Yang
Liming Jiang
Wayne Wu
Chen Change Loy
VGen
277
2
0
11 Mar 2025
High-Quality 3D Head Reconstruction from Any Single Portrait Image
Jianfu Zhang
yujie Gao
Jiahui Zhan
Wentao Wang
Yiyi Zhang
H. Zhao
Liqing Zhang
3DH
283
0
0
11 Mar 2025
GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation
Ye Tao
Jiawei Zhang
Yahao Shi
Dongqing Zou
Bin Zhou
3DGS
369
1
0
08 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
285
9
0
06 Mar 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
381
8
0
03 Mar 2025
GenVDM: Generating Vector Displacement Maps From a Single Image
Computer Vision and Pattern Recognition (CVPR), 2025
Yuezhi Yang
Qimin Chen
Vladimir G. Kim
S. Chaudhuri
Qixing Huang
Zheyu Chen
3DGS
VGen
276
2
0
01 Mar 2025
3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation
Hansheng Chen
Bokui Shen
Yulin Liu
Ruoxi Shi
Linqi Zhou
Connor Z. Lin
Jiayuan Gu
H. Su
Gordon Wetzstein
Leonidas Guibas
410
10
0
21 Feb 2025
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
ACM Transactions on Graphics (TOG), 2025
Kaixin Yao
Longwen Zhang
Xinhao Yan
Yan Zeng
Qixuan Zhang
Wei Yang
Lan Xu
Jiayuan Gu
Jingyi Yu
421
40
0
18 Feb 2025
When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Pingping Zhang
Jinlong Li
Kecheng Chen
Meng Wang
Long Xu
Haoliang Li
Andrii Zadaianchuk
Sam Kwong
Shiqi Wang
VGen
328
13
0
17 Feb 2025
Matrix3D: Large Photogrammetry Model All-in-One
Computer Vision and Pattern Recognition (CVPR), 2025
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
686
20
0
11 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
International Conference on Learning Representations (ICLR), 2025
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
400
37
0
28 Jan 2025
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Weikang Bian
Zhaoyang Huang
Xiaoyu Shi
Yijin Li
Fu-Yun Wang
Jiaming Song
3DGS
VGen
DiffM
344
30
0
05 Jan 2025
Edicho: Consistent Image Editing in the Wild
Qingyan Bai
Hao Ouyang
Yinghao Xu
Qiuyu Wang
Ceyuan Yang
Ka Leong Cheng
Yujun Shen
Qifeng Chen
DiffM
546
5
0
30 Dec 2024
Wonderland: Navigating 3D Scenes from a Single Image
Computer Vision and Pattern Recognition (CVPR), 2024
Hanwen Liang
Junli Cao
Sergei Korolev
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
454
53
0
16 Dec 2024
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Rick Akkerman
Haiwen Feng
M. Black
Dimitrios Tzionas
Victoria Fernandez-Abrevaya
VGen
AI4CE
630
5
0
16 Dec 2024
GenLit: Reformulating Single-Image Relighting as Video Generation
Shrisha Bharadwaj
Haiwen Feng
Giorgio Becherini
Victoria Fernandez-Abrevaya
Michael J. Black
VGen
523
5
0
15 Dec 2024
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Computer Vision and Pattern Recognition (CVPR), 2024
Yushu Wu
Zhixing Zhang
Yanyu Li
Yanwu Xu
Vidit Goel
...
Ju Hu
Dimitris N. Metaxas
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VGen
DiffM
406
19
0
13 Dec 2024
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
Computer Vision and Pattern Recognition (CVPR), 2024
Hyojun Go
Byeongjun Park
Jiho Jang
Jin-Young Kim
Soonwoo Kwon
Changick Kim
3DGS
882
18
0
25 Nov 2024
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Computer Vision and Pattern Recognition (CVPR), 2024
Chenjie Cao
Chaohui Yu
Shang Liu
Fan Wang
Xiangyang Xue
Yanwei Fu
460
11
0
25 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Computer Vision and Pattern Recognition (CVPR), 2024
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
435
4
0
23 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
Fan Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
360
101
0
07 Nov 2024
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation
Neural Information Processing Systems (NeurIPS), 2024
Hao Phung
Quan Dao
T. Dao
Hoang Phan
Dimitris Metaxas
Anh Tran
Mamba
730
14
0
06 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
International Conference on Learning Representations (ICLR), 2024
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Zhiyong Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
371
41
0
04 Nov 2024
Previous
1
2
3
Next