Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.04993
Cited By
v1
v2 (latest)
MoCoGAN: Decomposing Motion and Content for Video Generation
17 July 2017
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoCoGAN: Decomposing Motion and Content for Video Generation"
50 / 673 papers shown
Title
Glad: A Streaming Scene Generator for Autonomous Driving
International Conference on Learning Representations (ICLR), 2025
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGS
VGen
271
11
0
26 Feb 2025
TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Menghao Li
Zhenghao Zhang
Junchao Liao
Long Qin
Weizhi Wang
DiffM
VGen
199
1
0
26 Feb 2025
ASurvey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
Junlin Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVM
VGen
AI4TS
234
0
0
25 Feb 2025
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni
Elias Ramzi
Victor Besnier
Shashanka Venkataramanan
Tuan-Hung Vu
...
Mickael Chen
Éloi Zablocki
Andrei Bursuc
Eduardo Valle
Matthieu Cord
VGen
288
11
0
24 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
633
5
0
18 Feb 2025
TextOCVP: Object-Centric Video Prediction with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
VGen
OCL
DiffM
464
1
0
17 Feb 2025
UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation
Wenzhang Sun
Qirui Hou
Donglin Di
Jiahui Yang
Yongjia Ma
Jianxun Cui
DiffM
VGen
299
6
0
06 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
232
7
0
02 Feb 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Deyu Zhou
Quan Sun
Yuang Peng
Kun Yan
Runpei Dong
...
Zheng Ge
Nan Duan
Xiangyu Zhang
L. Ni
H. Shum
VGen
337
19
0
21 Jan 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
Computer Vision and Pattern Recognition (CVPR), 2024
Yuanyang Yin
Yaqi Zhao
Mingwu Zheng
Ke Lin
Jiarong Ou
...
Pengfei Wan
Di Zhang
Baoqun Yin
Wentao Zhang
Kun Gai
369
9
0
03 Jan 2025
DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network
Xiangtian Li
Xiaobo Wang
Zhen Qi
Han Cao
Zhaoyang Zhang
Ao Xiang
GAN
TTA
220
13
0
22 Dec 2024
Can Generative Video Models Help Pose Estimation?
Computer Vision and Pattern Recognition (CVPR), 2024
Ruojin Cai
Jason Y. Zhang
Philipp Henzler
Zhengqi Li
Noah Snavely
Ricardo Martín Brualla
VGen
182
6
0
20 Dec 2024
Can video generation replace cinematographers? Research on the cinematic language of generated video
Xuelong Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGen
DiffM
343
2
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
811
7
0
14 Dec 2024
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGen
DiffM
508
11
0
10 Dec 2024
Navigation World Models
Computer Vision and Pattern Recognition (CVPR), 2024
Amir Bar
G. Zhou
Danny Tran
Trevor Darrell
Yann LeCun
VGen
EgoV
498
126
0
04 Dec 2024
Motion Dreamer: Boundary Conditional Motion Reasoning for Physically Coherent Video Generation
Tianshuo Xu
Zhifei Chen
Leyi Wu
Hao Lu
Yuying Chen
Lihui Jiang
Bingbing Liu
Yingcong Chen
VGen
324
2
0
30 Nov 2024
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Florinel-Alin Croitoru
Andrei Iulian Hiji
Vlad Hondru
Nicolae-Cătălin Ristea
Paul Irofti
Marius Popescu
Cristian Rusu
Radu Tudor Ionescu
Fahad Shahbaz Khan
Mubarak Shah
377
15
0
29 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Yu Xie
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
210
18
0
17 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
354
3
0
12 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
450
38
0
08 Nov 2024
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Yue Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
249
5
0
05 Nov 2024
Video to Video Generative Adversarial Network for Few-shot Learning Based on Policy Gradient
Yintai Ma
Diego Klabjan
J. Utke
VGen
GAN
141
3
0
28 Oct 2024
Unsupervised Representation Learning from Sparse Transformation Analysis
Yue Song
Thomas Anderson Keller
Yisong Yue
Pietro Perona
Max Welling
DRL
263
2
0
07 Oct 2024
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Yaofang Liu
Y. Ren
Xiaodong Cun
Aitor Artola
Yang Liu
Tieyong Zeng
Raymond H. Chan
Jean-Michel Morel
VGen
DiffM
244
8
0
04 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
332
65
0
03 Oct 2024
COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Mingzhen Sun
Weining Wang
Xinxin Zhu
Jing Liu
VGen
DiffM
150
0
0
02 Oct 2024
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
AAAI Conference on Artificial Intelligence (AAAI), 2024
Lingling Cai
Kang Zhao
Hangjie Yuan
Yingya Zhang
Shiwei Zhang
Kejie Huang
VGen
124
2
0
30 Sep 2024
MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling
Neural Information Processing Systems (NeurIPS), 2024
Weihao Yuan
Weichao Shen
Yisheng He
Yuan Dong
Xiaodong Gu
Zilong Dong
Liefeng Bo
Qixing Huang
MQ
261
18
0
26 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
219
0
0
21 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
IEEE Access (IEEE Access), 2024
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen
3DV
231
2
0
14 Sep 2024
SVS-GAN: Leveraging GANs for Semantic Video Synthesis
Khaled M. Seyam
Julian Wiederer
Markus Braun
Bin Yang
147
0
0
09 Sep 2024
Latent Space Energy-based Neural ODEs
Sheng Cheng
Deqian Kong
Jianwen Xie
Kookjin Lee
Ying Nian Wu
Yezhou Yang
DiffM
786
4
0
05 Sep 2024
AMG: Avatar Motion Guided Video Generation
Zhangsihao Yang
Mengyi Shan
Mohammad Farazi
Wenhui Zhu
Yanxi Chen
Xuanzhao Dong
Yalin Wang
VGen
DiffM
264
1
0
02 Sep 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
International Conference on Learning Representations (ICLR), 2024
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffM
VGen
819
1,229
0
12 Aug 2024
Sequential Representation Learning via Static-Dynamic Conditional Disentanglement
European Conference on Computer Vision (ECCV), 2024
Mathieu Cyrille Simon
Pascal Frossard
Christophe De Vleeschouwer
CoGe
CML
191
4
0
10 Aug 2024
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seung-geun Chi
Hyung-Gun Chi
Hengbo Ma
Nakul Agarwal
Faizan Siddiqui
Karthik Ramani
Kwonjoon Lee
DiffM
308
17
0
19 Jul 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang
Haoxin Chen
Yong Zhang
Menghan Xia
Xiaodong Cun
Zhixun Su
Ying Shan
DiffM
187
3
0
14 Jul 2024
Graph Transformers: A Survey
Ahsan Shehzad
Xiwei Xu
Shagufta Abid
Ciyuan Peng
Shuo Yu
Dongyu Zhang
Karin Verspoor
AI4CE
366
36
0
13 Jul 2024
Bora: Biomedical Generalist Video Generation Model
Weixiang Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Xiang Li
Lifang He
Quanzheng Li
Lichao Sun
VGen
MedIm
199
13
0
12 Jul 2024
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Wentao Lei
Jinting Wang
Fengji Ma
Guanjie Huang
Li Liu
VGen
EGVM
257
16
0
11 Jul 2024
Guiding Video Prediction with Explicit Procedural Knowledge
Patrick Takenaka
Johannes Maucher
Marco F. Huber
204
2
0
26 Jun 2024
Sequential Disentanglement by Extracting Static Information From A Single Sequence Element
Nimrod Berman
Ilan Naiman
Idan Arbiv
Gal Fadlon
Omri Azencot
CoGe
253
9
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
293
32
0
26 Jun 2024
Do As I Do: Pose Guided Human Motion Copy
Sifan Wu
Zhenguang Liu
Beibei Zhang
Roger Zimmermann
Zhongjie Ba
Xiaosong Zhang
Kui Ren
196
15
0
24 Jun 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
145
0
0
23 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Information Processing Systems (NeurIPS), 2024
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
385
7
0
19 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
301
5
0
15 Jun 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang
Yi Jiang
Zehuan Yuan
Binyue Peng
Zuxuan Wu
Yu-Gang Jiang
ViT
VGen
271
78
0
13 Jun 2024
FacEnhance: Facial Expression Enhancing with Recurrent DDPMs
Hamza Bouzid
Lahoucine Ballihi
DiffM
228
1
0
13 Jun 2024
Previous
1
2
3
4
5
...
12
13
14
Next