Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.04993
Cited By
v1
v2 (latest)
MoCoGAN: Decomposing Motion and Content for Video Generation
17 July 2017
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoCoGAN: Decomposing Motion and Content for Video Generation"
50 / 671 papers shown
Title
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation
AAAI Conference on Artificial Intelligence (AAAI), 2023
A. Davtyan
Paolo Favaro
VGen
211
7
0
06 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGen
DiffM
118
18
0
05 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
Neural Information Processing Systems (NeurIPS), 2023
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
414
445
0
03 Jun 2023
We never go out of Style: Motion Disentanglement by Subspace Decomposition of Latent Space
Rishubh Parihar
Raghav Magazine
P. Tiwari
R. Venkatesh Babu
DRL
170
1
0
01 Jun 2023
Sample and Predict Your Latent: Modality-free Sequential Disentanglement via Contrastive Estimation
International Conference on Machine Learning (ICML), 2023
Ilana D Naiman
Nimrod Berman
Omri Azencot
DRL
247
12
0
25 May 2023
DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
IEEE Access (IEEE Access), 2023
Tae-Jung Yeom
Minhyeok Lee
DiffM
179
9
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
221
12
0
24 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
International Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
186
95
0
22 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
International Journal of Computer Vision (IJCV), 2023
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffM
VGen
508
151
0
18 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yuan Liu
Yogesh Balaji
DiffM
VGen
273
295
0
17 May 2023
LEO: Generative Latent Image Animator for Human Video Synthesis
International Journal of Computer Vision (IJCV), 2023
Yaohui Wang
Xin Ma
Xinyuan Chen
A. Dantcheva
Bo Dai
Yu Qiao
DiffM
462
43
0
06 May 2023
Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Computer Vision and Image Understanding (CVIU), 2023
Théophile Cabannes
Shreya Ghosh
Raphaël Marinier
Tom Gedeon
Alexandre M. Bayen
Munawar Hayat
282
47
0
03 May 2023
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Computer Vision and Pattern Recognition (CVPR), 2023
Sumith Kulal
Tim Brooks
A. Aiken
Jiajun Wu
Jimei Yang
Jingwan Lu
Alexei A. Efros
Krishna Kumar Singh
DiffM
160
56
0
27 Apr 2023
Motion-Conditioned Diffusion Model for Controllable Video Synthesis
Tsai-Shien Chen
C. Lin
Hung-Yu Tseng
Nayeon Lee
Ming-Hsuan Yang
DiffM
VGen
327
85
0
27 Apr 2023
LaMD: Latent Motion Diffusion for Image-Conditional Video Generation
International Journal of Computer Vision (IJCV), 2023
Yaosi Hu
Zhenzhong Chen
Chong Luo
DiffM
VGen
152
13
0
23 Apr 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Pattern Recognition (Pattern Recogn.), 2023
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
146
13
0
18 Apr 2023
Text2Performer: Text-Driven Human Video Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Yuming Jiang
Shuai Yang
Tong Liang Koh
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
VGen
205
66
0
17 Apr 2023
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction Domain
Applied Soft Computing (Appl. Soft Comput.), 2023
Zhifeng Ma
Hao Zhang
Jie Liu
356
10
0
16 Apr 2023
Video Generation Beyond a Single Clip
Hsin-Ping Huang
Yu-Chuan Su
Ming-Hsuan Yang
VLM
DiffM
VGen
255
3
0
15 Apr 2023
VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs
IEEE International Conference on Computer Vision (ICCV), 2023
Moayed Haji-Ali
Andrew Bond
Tolga Birdal
Duygu Ceylan
Levent Karacan
Erkut Erdem
Aykut Erdem
VGen
DiffM
415
2
0
12 Apr 2023
MoStGAN-V: Video Generation with Temporal Motion Styles
Computer Vision and Pattern Recognition (CVPR), 2023
Xiaoqian Shen
Xiang Li
Mohamed Elhoseiny
VGen
145
39
0
05 Apr 2023
TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration
IEEE International Conference on Computer Vision (ICCV), 2023
Kehong Gong
Dongze Lian
Heng Chang
Chuan Guo
Zihang Jiang
Wei Ji
Michael Bi Mi
Xinchao Wang
291
82
0
05 Apr 2023
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
IEEE International Conference on Computer Vision (ICCV), 2023
Mingyuan Zhang
Xinying Guo
Liang Pan
Zhongang Cai
Fangzhou Hong
Huirong Li
Lei Yang
Ziwei Liu
DiffM
VGen
242
242
0
03 Apr 2023
Multifactor Sequential Disentanglement via Structured Koopman Autoencoders
International Conference on Learning Representations (ICLR), 2023
Nimrod Berman
Ilana D Naiman
Omri Azencot
CoGe
162
27
0
30 Mar 2023
4D Facial Expression Diffusion Model
K. Zou
S. Faisan
Boyang Yu
S. Valette
Hyewon Seo
203
19
0
29 Mar 2023
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
IEEE transactions on multimedia (IEEE TMM), 2023
Jiawei Liu
Weining Wang
Sihan Chen
Xinxin Zhu
Qingbin Liu
DiffM
VGen
130
18
0
29 Mar 2023
Information-Theoretic GAN Compression with Variational Energy-based Model
Neural Information Processing Systems (NeurIPS), 2023
Minsoo Kang
Hyewon Yoo
Eunhee Kang
Sehwan Ki
Hyong-Euk Lee
Bohyung Han
GAN
177
4
0
28 Mar 2023
Seer: Language Instructed Video Prediction with Latent Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Xianfan Gu
Chuan Wen
Weirui Ye
Jiaming Song
Yang Gao
DiffM
VGen
222
51
0
27 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
111
1
0
24 Mar 2023
Persistent Nature: A Generative Model of Unbounded 3D Worlds
Computer Vision and Pattern Recognition (CVPR), 2023
Lucy Chai
Richard Tucker
Zhengqi Li
Phillip Isola
Noah Snavely
VGen
199
44
0
23 Mar 2023
Pix2Video: Video Editing using Image Diffusion
IEEE International Conference on Computer Vision (ICCV), 2023
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffM
VGen
362
329
0
22 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
185
177
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
271
193
0
21 Mar 2023
CoopInit: Initializing Generative Adversarial Networks via Cooperative Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yang Zhao
Jianwen Xie
Ping Li
GAN
275
3
0
21 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
193
6
0
20 Mar 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Zhengxiong Luo
Dayou Chen
Yingya Zhang
Yan Huang
Liangsheng Wang
Yujun Shen
Deli Zhao
Jinren Zhou
Tien-Ping Tan
DiffM
VGen
485
384
0
15 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model
Wendong Zhang
Wendong Zhang
Geng Chen
Siyu Gao
Yunbo Wang
Xiaokang Yang
Yunbo Wang
CLL
277
3
0
12 Mar 2023
Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Yucheng Xu
Nanbo Li
A. Goel
Zijian Guo
Zonghai Yao
Hamidreza Kasaei
Mohammad-Sajad Kasaei
Zhibin Li
186
5
0
09 Mar 2023
Pedestrian Attribute Editing for Gait Recognition and Anonymization
Jingzhe Ma
Dingqiang Ye
Chao Fan
Shiqi Yu
CVBM
177
7
0
09 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Computer Vision and Pattern Recognition (CVPR), 2023
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
328
300
0
08 Mar 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
Computer Vision and Pattern Recognition (CVPR), 2023
M. Sun
Weining Wang
Xinxin Zhu
Jing Liu
210
17
0
07 Mar 2023
MotionVideoGAN: A Novel Video Generator Based on the Motion Space Learned from Image Pairs
IEEE transactions on multimedia (IEEE TMM), 2023
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
VGen
126
7
0
06 Mar 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
313
5
0
26 Feb 2023
Video Probabilistic Diffusion Models in Projected Latent Space
Computer Vision and Pattern Recognition (CVPR), 2023
Sihyun Yu
Kihyuk Sohn
Subin Kim
Jinwoo Shin
VGen
DiffM
237
200
0
15 Feb 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffM
VGen
363
655
0
06 Feb 2023
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
International Conference on Learning Representations (ICLR), 2023
Yuhta Takida
Masaaki Imaizumi
Takashi Shibuya
Chieh-Hsin Lai
Toshimitsu Uesaka
Naoki Murata
Yuki Mitsufuji
GAN
420
23
0
30 Jan 2023
Audio2Gestures: Generating Diverse Gestures from Audio
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Jing Li
Di Kang
Wenjie Pei
Xuefei Zhe
Ying Zhang
Linchao Bao
Zhenyu He
DiffM
SLR
197
9
0
17 Jan 2023
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations
Computer Vision and Pattern Recognition (CVPR), 2023
Jianrong Zhang
Yangsong Zhang
Xiaodong Cun
Shaoli Huang
Yong Zhang
Hongwei Zhao
Hongtao Lu
Xiaodong Shen
351
500
0
15 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Image and Vision Computing (IVC), 2023
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
235
39
0
10 Jan 2023
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Michal Stypulkowski
Konstantinos Vougioukas
Sen He
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffM
226
177
0
06 Jan 2023
Previous
1
2
3
4
5
6
...
12
13
14
Next