ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.18613
  4. Cited By
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
v1v2 (latest)

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Computer Vision and Pattern Recognition (CVPR), 2024
27 November 2024
Rundi Wu
Ruiqi Gao
Ben Poole
Alex Trevithick
Changxi Zheng
Jonathan T. Barron
Aleksander Holyñski
    VGen
ArXiv (abs)PDFHTMLHuggingFace (58 upvotes)Github

Papers citing "CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models"

50 / 62 papers shown
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Yanran Zhang
Ziyi Wang
Wenzhao Zheng
Zheng Zhu
Jie Zhou
Jiwen Lu
VGen3DV
311
1
0
04 Dec 2025
YingVideo-MV: Music-Driven Multi-Stage Video Generation
YingVideo-MV: Music-Driven Multi-Stage Video Generation
Jiahui Chen
Weida Wang
Runhua Shi
Huan Yang
Chaofan Ding
Zihao Chen
DiffMVGen
296
0
0
02 Dec 2025
ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling
ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling
Qisen Wang
Yifan Zhao
Peisen Shen
Jialu Li
Jia Li
3DGSVGen
214
1
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffMVGen3DPC
352
4
0
01 Dec 2025
RemedyGS: Defend 3D Gaussian Splatting against Computation Cost Attacks
RemedyGS: Defend 3D Gaussian Splatting against Computation Cost Attacks
Y. Li
Zhening Liu
Zijian Li
Zehong Lin
Jun Zhang
3DGS
83
1
0
27 Nov 2025
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
Byeongjun Park
Byung-Hoon Kim
Hyungjin Chung
Jong Chul Ye
VGen
290
1
0
25 Nov 2025
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
Zhenxing Mi
Yuxin Wang
Dan Xu
VGen
212
1
0
24 Nov 2025
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Huiqiang Sun
Liao Shen
Zhan Peng
Kun Wang
Size Wu
...
Z. Huang
Xingyu Zeng
Zhiguo Cao
Wei Li
Chen Change Loy
DiffMVGen
233
0
0
17 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
214
2
0
10 Nov 2025
Gait Recognition via Collaborating Discriminative and Generative Diffusion Models
Gait Recognition via Collaborating Discriminative and Generative Diffusion Models
H. Xiong
Bin Feng
Bang Wang
Xinggang Wang
Wenyu Liu
DiffM
217
0
0
09 Nov 2025
MotionStream: Real-Time Video Generation with Interactive Motion Controls
MotionStream: Real-Time Video Generation with Interactive Motion Controls
Joonghyuk Shin
Zhengqi Li
Richard Zhang
Jun-Yan Zhu
Jaesik Park
Eli Schechtman
Xun Huang
DiffMVGen
480
31
0
03 Nov 2025
DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
Yaokun Li
Lihe Ding
Xiao Chen
Guang Tan
Tianfan Xue
174
0
0
25 Oct 2025
From Volume Rendering to 3D Gaussian Splatting: Theory and Applications
From Volume Rendering to 3D Gaussian Splatting: Theory and Applications
Vitor Matias
Daniel Perazzo
V. Silva
Alberto Raposo
Luiz Velho
Afonso Paiva
Tiago Novello
3DGS
273
1
0
20 Oct 2025
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Jie-Ying Lee
Yi-Ruei Liu
Shr-Ruei Tsai
Wei-Cheng Chang
Chung-Ho Wu
Jiewen Chan
Zhenjun Zhao
Chieh Hubert Lin
Yu-Lun Liu
3DGS
370
10
0
17 Oct 2025
iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation
iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation
Chuanrui Zhang
Zhengxian Wu
Guanxing Lu
Yansong Tang
Ziwei Wang
VGen
153
2
0
10 Oct 2025
A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features
A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features
Axel Barroso-Laguna
Tommaso Cavallari
V. Prisacariu
Eric Brachmann
220
0
0
01 Oct 2025
UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
Guanjun Wu
Jiemin Fang
Chen Yang
Sikuang Li
Taoran Yi
...
Xiaopeng Zhang
Wei Wei
Wenyu Liu
Xinggang Wang
Qi Tian
226
6
0
29 Sep 2025
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
Chen Wang
Chuhao Chen
Yiming Huang
Zhiyang Dou
Yuan Liu
Jiatao Gu
Lingjie Liu
DiffMVGenPINN
725
20
0
24 Sep 2025
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Sherwin Bahmani
Tianchang Shen
Jiawei Ren
Jiahui Huang
Yifeng Jiang
...
Zan Gojcic
Sanja Fidler
Huan Ling
Jun Gao
Xuanchi Ren
VGen
190
11
0
23 Sep 2025
T2Bs: Text-to-Character Blendshapes via Video Generation
T2Bs: Text-to-Character Blendshapes via Video Generation
Jiahao Luo
Chaoyang Wang
Michael Vasilkovsky
V. Shakhrai
Di Liu
...
Sergey Tulyakov
Peter Wonka
Hsin-Ying Lee
James Davis
Jian Wang
DiffM
267
1
0
12 Sep 2025
Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data
Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data
Nithin Gopalakrishnan Nair
Srinivas Kaza
Xuan Luo
Vishal M. Patel
Stephen Lombardi
Jungyeon Park
126
0
0
08 Sep 2025
CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis
CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis
Xin Kong
Daniel Watson
Yannick Strümpler
Michael Niemeyer
F. Tombari
DiffM
181
3
0
08 Sep 2025
LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding
LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding
Julian Ost
Andrea Ramazzina
Amogh Joshi
Maximilian Bömer
Mario Bijelic
Felix Heide
3DV
263
6
0
26 Aug 2025
Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing
Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing
Feng-Lin Liu
Shi-Yang Li
Yan-Pei Cao
Hongbo Fu
Lin Gao
DiffMVGen
259
1
0
19 Aug 2025
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Zhaoxi Chen
Tianqi Liu
Long Zhuo
Jiawei Ren
Zeng Tao
He Zhu
Fangzhou Hong
Liang Pan
Ziwei Liu
VGen
213
14
0
18 Aug 2025
ViPE: Video Pose Engine for 3D Geometric Perception
ViPE: Video Pose Engine for 3D Geometric Perception
Jiahui Huang
Qunjie Zhou
Hesam Rabeti
Aleksandr Korovko
Huan Ling
...
Jiawei Ren
Kevin Xie
Joydeep Biswas
Laura Leal-Taixe
Sanja Fidler
288
66
0
12 Aug 2025
Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation
Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation
Xiaoyan Liu
Kangrui Li
Yuehao Song
Jiaxin Liu
VGen3DGS
196
0
0
11 Aug 2025
Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation
Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation
Xunzhi Xiang
Y. Chen
Guiyu Zhang
Zhongyu Wang
Zhe Gao
...
Haibin Huang
Yang Gao
C. Zhang
Qi Fan
Xuelong Li
DiffMVGen
274
9
0
05 Aug 2025
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
Jiahui Zhang
Yuelei Li
Anpei Chen
Muyu Xu
Kunhao Liu
...
Hanspeter Pfister
Paul Liang
Shijian Lu
Fangneng Zhan
Fangneng Zhan
789
17
0
19 Jul 2025
Cameras as Relative Positional Encoding
Cameras as Relative Positional Encoding
Ruilong Li
Brent Yi
Junchen Liu
Hang Gao
Yi Ma
Angjoo Kanazawa
ViT
313
39
0
14 Jul 2025
Voyaging into Perpetual Dynamic Scenes from a Single View
Voyaging into Perpetual Dynamic Scenes from a Single View
Fengrui Tian
Tianjiao Ding
Jinqi Luo
Hancheng Min
René Vidal
VGen
318
0
0
05 Jul 2025
Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy
Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy
Yuhao Liu
Tengfei Wang
Fang Liu
Zhenwei Wang
Rynson W. H. Lau
DiffMVGen
238
6
0
27 Jun 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
428
17
0
20 Jun 2025
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Donghoon Ahn
Jiwon Kang
Sanghyun Lee
Minjae Kim
Jaewon Min
Wooseok Jang
Saungwu Lee
Sayak Paul
S. Hong
Seungryong Kim
DiffMAAML
532
0
0
12 Jun 2025
4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos
4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos
Zhen Xu
Zhengqin Li
Zhao Dong
Xiaowei Zhou
Richard Newcombe
Zhaoyang Lv
3DGSViT
297
25
0
09 Jun 2025
Restereo: Diffusion stereo video generation and restoration
Restereo: Diffusion stereo video generation and restoration
Xingchang Huang
Ashish Kumar Singh
Florian Dubost
C. N. Vasconcelos
Sakar Khattar
Liang Shi
Christian Theobalt
Steven Chacko
Gurprit Singh
DiffMVGen
344
3
0
06 Jun 2025
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
Zizhang Li
Hong-Xing Yu
Wei Liu
Yin Yang
Charles Herrmann
Gordon Wetzstein
Jiajun Wu
VGen
329
23
0
23 May 2025
M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion
M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion
Nina Shvetsova
Goutam Bhat
Prune Truong
Hilde Kuehne
Federico Tombari
DiffMVGenMDE
366
5
0
22 May 2025
SOAP: Style-Omniscient Animatable Portraits
SOAP: Style-Omniscient Animatable Portraits
Tingting Liao
Yujian Zheng
Adilbek Karmanov
Liwen Hu
Leyang Jin
Yuliang Xiu
Hao Li
DiffM
1.1K
7
0
08 May 2025
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Jiaxin Huang
Sheng Miao
BangBnag Yang
Yuewen Ma
Yiyi Liao
VGenMDE
769
6
0
15 Apr 2025
GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting
GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting
Junlin Hao
Peiheng Wang
Haoyang Wang
Xinggong Zhang
Xinggong Zhang
3DGSVGen
743
3
0
14 Apr 2025
In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
Sauradip Nag
Daniel Cohen-Or
Hao Zhang
Ali Mahdavi-Amiri
DiffMVGen
525
7
0
11 Apr 2025
OmniCam: Unified Multimodal Video Generation via Camera Control
OmniCam: Unified Multimodal Video Generation via Camera Control
Xiaoda Yang
Jiayang Xu
Kaixuan Luan
Xinyu Zhan
Hongshun Qiu
...
Shuai Yang
Li Zhang
Checheng Yu
Cewu Lu
Lixin Yang
DiffMVGen
329
7
0
03 Apr 2025
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Jangho Park
Taesung Kwon
Jong Chul Ye
VGen
552
11
0
28 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffMVGen
569
72
0
24 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGSVGen
704
37
0
20 Mar 2025
Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models
Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models
Marc Benedí San Millán
Angela Dai
Matthias Nießner
DiffM
358
3
0
20 Mar 2025
Advances in 4D Generation: A Survey
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Ping Liu
Yawei Luo
628
2
0
18 Mar 2025
Bolt3D: Generating 3D Scenes in Seconds
Bolt3D: Generating 3D Scenes in Seconds
Stanislaw Szymanowicz
Jason Y. Zhang
P. Srinivasan
Ruiqi Gao
Arthur Brussee
Aleksander Holynski
Ricardo Martín Brualla
Jonathan T. Barron
Philipp Henzler
546
45
0
18 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGenLLMSV
511
7
0
15 Mar 2025
12
Next
Page 1 of 2