Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.04993
Cited By
v1
v2 (latest)
MoCoGAN: Decomposing Motion and Content for Video Generation
17 July 2017
Sergey Tulyakov
Ming-Yuan Liu
Xiaodong Yang
Jan Kautz
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MoCoGAN: Decomposing Motion and Content for Video Generation"
50 / 671 papers shown
Title
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Jingdong Sun
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
248
21
0
10 Jun 2024
GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications
S. Akhmedova
Nils Körber
GAN
MedIm
193
0
0
07 Jun 2024
SF-V: Single Forward Video Generation Model
Neural Information Processing Systems (NeurIPS), 2024
Zhixing Zhang
Yanyu Li
Yushu Wu
Yanwu Xu
Vidit Goel
...
Aliaksandr Siarohin
Junli Cao
Dimitris N. Metaxas
Sergey Tulyakov
Jian Ren
DiffM
VGen
211
23
0
06 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
87
1
0
05 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Fu Liu
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
211
41
0
03 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
278
72
0
03 Jun 2024
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
Zhengang Li
Yan Kang
Yuchen Liu
Difan Liu
Tobias Hinz
Feng Liu
Yanzhi Wang
DiffM
193
1
0
31 May 2024
Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling
Feiyang Cai
Chuchu Fan
Stanley Bak
445
9
0
28 May 2024
Controllable Longer Image Animation with Diffusion Models
Qiang Wang
Minghua Liu
Junjun Hu
Fan Jiang
Mu Xu
VGen
209
2
0
27 May 2024
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation
Shentong Mo
Yapeng Tian
Mamba
162
22
0
24 May 2024
Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 2024
Zhibo Chen
Heming Sun
Li Zhang
Fan Zhang
246
6
0
23 May 2024
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Hritik Bansal
Yonatan Bitton
Michal Yarom
Idan Szpektor
Aditya Grover
Kai-Wei Chang
DiffM
359
21
0
07 May 2024
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
317
29
0
06 May 2024
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
341
27
0
05 May 2024
DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance
Linxuan Xin
Zheng Zhang
Jinfu Wei
Ge Li
Duan Gao
180
1
0
23 Apr 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
196
31
0
18 Apr 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
220
172
0
16 Apr 2024
Direct May Not Be the Best: An Incremental Evolution View of Pose Generation
Yuelong Li
Tengfei Xiao
Lei Geng
Jianming Wang
200
2
0
12 Apr 2024
Translation-based Video-to-Video Synthesis
Pratim Saha
Chengcui Zhang
DiffM
121
1
0
03 Apr 2024
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
Aggelina Chatziagapi
Grigorios G. Chrysos
Dimitris Samaras
CVBM
224
4
0
29 Mar 2024
AnimateMe: 4D Facial Expressions via Diffusion Models
Dimitrios Gerogiannis
Foivos Paraperas-Papantoniou
Rolandos Alexandros Potamias
Alexandros Lattas
Stylianos Moschoglou
Stylianos Ploumpis
Stefanos Zafeiriou
225
9
0
25 Mar 2024
StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN
Jongwoo Choi
Kwanggyoon Seo
Amirsaman Ashtari
Junyong Noh
GAN
174
5
0
21 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGen
DiffM
222
24
0
21 Mar 2024
Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail
Mingjin Chen
Junhao Chen
Xiaojun Ye
Huan-ang Gao
Xiaoxue Chen
Zhaoxin Fan
Hao Zhao
3DH
206
15
0
18 Mar 2024
Spatio-Temporal Fluid Dynamics Modeling via Physical-Awareness and Parameter Diffusion Guidance
Hao Wu
Fan Xu
Yifan Duan
Ziwei Niu
Weiyan Wang
Gaofeng Lu
Kun Wang
Yuxuan Liang
Yang Wang
DiffM
AI4CE
193
14
0
18 Mar 2024
SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Yuta Oshima
Shohei Taniguchi
Masahiro Suzuki
Yutaka Matsuo
263
7
0
12 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
129
9
0
12 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
181
128
0
11 Mar 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
271
94
0
22 Feb 2024
RNDiff: Rainfall nowcasting with Condition Diffusion Model
Xudong Ling
Chaorong Li
Fengqing Qin
Peng Yang
Yuanyuan Huang
AI4Cl
DiffM
214
7
0
21 Feb 2024
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun Cen
Chenfei Wu
Xiao Liu
Sheng-Siang Yin
Yixuan Pei
Jinglong Yang
Qifeng Chen
Nan Duan
Jianguo Zhang
214
9
0
16 Feb 2024
Extreme Video Compression with Pre-trained Diffusion Models
Bohan Li
Yiming Liu
Xueyan Niu
Bo Bai
Lei Deng
Deniz Gündüz
DiffM
VGen
138
5
0
14 Feb 2024
Modeling Spatio-temporal Dynamical Systems with Neural Discrete Learning and Levels-of-Experts
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024
Kun Wang
Hao Wu
Guibin Zhang
Cunchun Li
Yuxuan Liang
Yuankai Wu
Roger Zimmermann
Yang Wang
145
18
0
06 Feb 2024
Anything in Any Scene: Photorealistic Video Object Insertion
Chen Bai
Zeman Shao
Guoxiang Zhang
Di Liang
Jie Yang
...
Zhendong Wang
Yichen Guan
Xiaoyin Zheng
Tao Wang
Cheng Lu
DiffM
VGen
153
6
0
30 Jan 2024
Personality Perception in Human Videos Altered by Motion Transfer Networks
Computers & graphics (CG), 2024
Ayda Yurtoğlu
Sinan Sonlu
Yalim Dogan
U. Güdükbay
198
3
0
26 Jan 2024
DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations
International Conference on Learning Representations (ICLR), 2024
Dogyun Park
S. Kim
Sojin Lee
Hyunwoo J. Kim
DiffM
284
11
0
23 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
289
6
0
21 Jan 2024
ActAnywhere: Subject-Aware Video Background Generation
Boxiao Pan
Zhan Xu
Chun-Hao Paul Huang
Krishna Kumar Singh
Yang Zhou
Leonidas Guibas
Jimei Yang
VGen
DiffM
155
8
0
19 Jan 2024
Continuous Piecewise-Affine Based Motion Model for Image Animation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Hexiang Wang
Fengqi Liu
Qianyu Zhou
Ran Yi
Xin Tan
Lizhuang Ma
VGen
124
11
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
381
476
0
17 Jan 2024
RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks
International Conference on Information Photonics (ICIP), 2024
Partha Ghosh
Soubhik Sanyal
Cordelia Schmid
Bernhard Scholkopf
VGen
211
1
0
11 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
766
411
0
05 Jan 2024
GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Xuehao Gao
Yang Yang
Zhenyu Xie
Shaoyi Du
Zhongqian Sun
Yang Wu
DiffM
232
21
0
04 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
195
34
0
03 Jan 2024
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
392
14
0
31 Dec 2023
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei
Le Chen
Caiwen Ding
VGen
135
2
0
30 Dec 2023
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
DiffM
VGen
232
49
0
25 Dec 2023
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model
Zhenyu Xie
Yang Wu
Xuehao Gao
Zhongqian Sun
Wei Yang
Xiaodan Liang
DiffM
255
15
0
18 Dec 2023
T2M-HiFiGPT: Generating High Quality Human Motion from Textual Descriptions with Residual Discrete Representations
Congyi Wang
227
10
0
17 Dec 2023
VideoLCM: Video Latent Consistency Model
Xiang Wang
Shiwei Zhang
Han Zhang
Yu Liu
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
284
66
0
14 Dec 2023
Previous
1
2
3
4
5
6
...
12
13
14
Next