ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
Controllable Radiance Fields for Dynamic Face Synthesis
Controllable Radiance Fields for Dynamic Face SynthesisInternational Conference on 3D Vision (3DV), 2022
Peiye Zhuang
Liqian Ma
Oluwasanmi Koyejo
Alex Schwing
CVBM3DH
192
13
0
11 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual DescriptionInternational Conference on Learning Representations (ICLR), 2022
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffMVGen
364
486
0
05 Oct 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image
  Generator
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image GeneratorInternational Conference on Information Photonics (ICIP), 2022
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
239
33
0
15 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid
  generalization
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
158
1
0
07 Sep 2022
Modelling Latent Dynamics of StyleGAN using Neural ODEs
Modelling Latent Dynamics of StyleGAN using Neural ODEs
Weihao Xia
Yujiu Yang
Jing-Hao Xue
168
0
0
23 Aug 2022
FaceOff: A Video-to-Video Face Swapping System
FaceOff: A Video-to-Video Face Swapping SystemIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Aditya Agarwal
Bipasha Sen
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
PICVCVBM
187
4
0
21 Aug 2022
StyleFaceV: Face Video Generation via Decomposing and Recomposing
  Pretrained StyleGAN3
StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3
Haonan Qiu
Yuming Jiang
Hang Zhou
Wayne Wu
Ziwei Liu
CVBM
191
13
0
16 Aug 2022
Free-HeadGAN: Neural Talking Head Synthesis with Explicit Gaze Control
Free-HeadGAN: Neural Talking Head Synthesis with Explicit Gaze ControlIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
M. Doukas
Evangelos Ververas
V. Sharmanska
Stefanos Zafeiriou
CVBM
202
23
0
03 Aug 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene GenerationNeural Information Processing Systems (NeurIPS), 2022
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa3DGS
253
155
0
27 Jul 2022
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
CelebV-HQ: A Large-Scale Video Facial Attributes DatasetEuropean Conference on Computer Vision (ECCV), 2022
Haoning Zhu
Wayne Wu
Wentao Zhu
Liming Jiang
Siwei Tang
Li Zhang
Ziwei Liu
Chen Change Loy
518
259
0
25 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual SynthesisNeural Information Processing Systems (NeurIPS), 2022
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
223
95
0
20 Jul 2022
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video SynthesisEuropean Conference on Computer Vision (ECCV), 2022
Long Zhuo
Guangcong Wang
Shikai Li
Wayne Wu
Ziwei Liu
VGen
207
23
0
11 Jul 2022
A Probabilistic Model Of Interaction Dynamics for Dyadic Face-to-Face
  Settings
A Probabilistic Model Of Interaction Dynamics for Dyadic Face-to-Face Settings
Renke Wang
Ifeoma Nwogu
CVBM
137
0
0
10 Jul 2022
Interaction Transformer for Human Reaction Generation
Interaction Transformer for Human Reaction GenerationIEEE transactions on multimedia (IEEE TMM), 2022
Baptiste Chopin
Hao Tang
N. Otberdout
Mohamed Daoudi
Andrii Zadaianchuk
ViT
220
46
0
04 Jul 2022
Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature
  Decoupling
Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature DecouplingInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Jiamin Liang
Xin Yang
Yuhao Huang
Kai Liu
Xinrui Zhou
...
Zehui Lin
H. Luo
Yuanji Zhang
Yi Xiong
Dong Ni
99
7
0
01 Jul 2022
3D-Aware Video Generation
3D-Aware Video Generation
Sherwin Bahmani
Jeong Joon Park
Despoina Paschalidou
Hao Tang
Gordon Wetzstein
Leonidas Guibas
Luc Van Gool
Radu Timofte
421
24
0
29 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video PredictionInternational Conference on Learning Representations (ICLR), 2022
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
360
137
0
23 Jun 2022
Diffusion Models for Video Prediction and Infilling
Diffusion Models for Video Prediction and Infilling
Tobias Höppe
Arash Mehrjou
Stefan Bauer
Didrik Nielsen
Andrea Dittadi
DiffMVGen
312
157
0
15 Jun 2022
Generating Long Videos of Dynamic Scenes
Generating Long Videos of Dynamic ScenesNeural Information Processing Systems (NeurIPS), 2022
Tim Brooks
Janne Hellsten
M. Aittala
Ting-Chun Wang
Timo Aila
J. Lehtinen
Xuan Li
Alexei A. Efros
Tero Karras
SyDa
284
130
0
07 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Cascaded Video Generation for Videos In-the-WildInternational Conference on Pattern Recognition (ICPR), 2022
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
191
0
0
01 Jun 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via TransformersInternational Conference on Learning Representations (ICLR), 2022
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
716
901
0
29 May 2022
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long VideosNeural Information Processing Systems (NeurIPS), 2022
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian D. Weilbach
Frank Wood
DiffMBDLVGen
523
347
0
23 May 2022
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and
  Interpolation
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Vikram S. Voleti
Alexia Jolicoeur-Martineau
Christopher Pal
DiffMVGen
426
378
0
19 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
STAU: A SpatioTemporal-Aware Unit for Video Prediction and BeyondIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
247
4
0
20 Apr 2022
Sound-Guided Semantic Video Generation
Sound-Guided Semantic Video GenerationEuropean Conference on Computer Vision (ECCV), 2022
Seung Hyun Lee
Gyeongrok Oh
Wonmin Byeon
Chanyoung Kim
Wonjae Ryoo
Sang Ho Yoon
Hyunjun Cho
Jihyun Bae
Jinkyu Kim
Sangpil Kim
VGen
331
42
0
20 Apr 2022
Controllable Video Generation through Global and Local Motion Dynamics
Controllable Video Generation through Global and Local Motion DynamicsEuropean Conference on Computer Vision (ECCV), 2022
A. Davtyan
Paolo Favaro
135
9
0
13 Apr 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive TransformerEuropean Conference on Computer Vision (ECCV), 2022
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
531
271
0
07 Apr 2022
Video Diffusion Models
Video Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffMVGen
854
2,218
0
07 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and
  Uncertainty
FoV-Net: Field-of-View Extrapolation Using Self-Attention and UncertaintyIEEE Robotics and Automation Letters (RA-L), 2021
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
195
7
0
04 Apr 2022
Expressive Talking Head Video Encoding in StyleGAN2 Latent-Space
Expressive Talking Head Video Encoding in StyleGAN2 Latent-Space
Trevine Oorloff
Yaser Yacoob
VGen
206
5
0
28 Mar 2022
Stochastic Video Prediction with Structure and Motion
Stochastic Video Prediction with Structure and Motion
Adil Kaan Akan
Sadra Safadoust
Fatma Guney
VGen
179
10
0
20 Mar 2022
Transframer: Arbitrary Frame Prediction with Generative Models
Transframer: Arbitrary Frame Prediction with Generative Models
C. Nash
João Carreira
Jacob Walker
Iain Barr
Andrew Jaegle
Mateusz Malinowski
Peter W. Battaglia
ViT
277
44
0
17 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffMVGen
597
314
0
16 Mar 2022
Show Me What and Tell Me How: Video Synthesis via Multimodal
  Conditioning
Show Me What and Tell Me How: Video Synthesis via Multimodal ConditioningComputer Vision and Pattern Recognition (CVPR), 2022
Ligong Han
Jian Ren
Hsin-Ying Lee
Francesco Barbieri
Kyle Olszewski
Shervin Minaee
Dimitris N. Metaxas
Sergey Tulyakov
DiffMVGen
226
46
0
04 Mar 2022
Playable Environments: Video Manipulation in Space and Time
Playable Environments: Video Manipulation in Space and TimeComputer Vision and Pattern Recognition (CVPR), 2022
Willi Menapace
Stéphane Lathuilière
Aliaksandr Siarohin
Christian Theobalt
Sergey Tulyakov
Vladislav Golyanik
Elisa Ricci
VGen
270
28
0
03 Mar 2022
Inkorrect: Online Handwriting Spelling Correction
Inkorrect: Online Handwriting Spelling Correction
Andrii Maksai
H. Rowley
Jesse Berent
C. Musat
131
3
0
28 Feb 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial
  Networks
Generating Videos with Dynamics-aware Implicit Generative Adversarial NetworksInternational Conference on Learning Representations (ICLR), 2022
Sihyun Yu
Jihoon Tack
Sangwoo Mo
Hyunsu Kim
Junho Kim
Jung-Woo Ha
Jinwoo Shin
DiffMVGen
248
220
0
21 Feb 2022
Finding Directions in GAN's Latent Space for Neural Face Reenactment
Finding Directions in GAN's Latent Space for Neural Face ReenactmentBritish Machine Vision Conference (BMVC), 2022
Stella Bounareli
Vasileios Argyriou
Georgios Tzimiropoulos
3DHCVBM
379
40
0
31 Jan 2022
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality
  and Perks of StyleGAN2
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2Computer Vision and Pattern Recognition (CVPR), 2021
Ivan Skorokhodov
Sergey Tulyakov
Mohamed Elhoseiny
VGen
419
341
0
29 Dec 2021
Controllable Animation of Fluid Elements in Still Images
Controllable Animation of Fluid Elements in Still ImagesComputer Vision and Pattern Recognition (CVPR), 2021
Aniruddha Mahapatra
K. Kulkarni
VGen
419
54
0
06 Dec 2021
Make It Move: Controllable Image-to-Video Generation with Text
  Descriptions
Make It Move: Controllable Image-to-Video Generation with Text DescriptionsComputer Vision and Pattern Recognition (CVPR), 2021
Yaosi Hu
Chong Luo
Zhenzhong Chen
VGen
246
102
0
06 Dec 2021
Layered Controllable Video Generation
Layered Controllable Video Generation
Jiahui Huang
Yuhe Jin
K. M. Yi
Leonid Sigal
VGen
395
12
0
24 Nov 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViTVGen
298
343
0
24 Nov 2021
Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Image Comes Dancing with Collaborative Parsing-Flow Video Synthesis
Bowen Wu
Zhenyu Xie
Xiaodan Liang
Yubei Xiao
Haoye Dong
Liang Lin
3DH
134
6
0
27 Oct 2021
Creating and Reenacting Controllable 3D Humans with Differentiable
  Rendering
Creating and Reenacting Controllable 3D Humans with Differentiable RenderingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Thiago L. Gomes
Thiago M. Coutinho
Rafael Azevedo
Renato Martins
Erickson R. Nascimento
3DH
147
3
0
22 Oct 2021
LARNet: Latent Action Representation for Human Action Synthesis
LARNet: Latent Action Representation for Human Action SynthesisBritish Machine Vision Conference (BMVC), 2021
Naman Biyani
A. J. Rana
Shruti Vyas
Yogesh S Rawat
168
4
0
21 Oct 2021
Pose-guided Generative Adversarial Net for Novel View Action Synthesis
Pose-guided Generative Adversarial Net for Novel View Action Synthesis
Xianhang Li
Junhao Zhang
Kunchang Li
Shruti Vyas
Yogesh S Rawat
GAN
299
3
0
15 Oct 2021
Towards Using Clothes Style Transfer for Scenario-aware Person Video
  Generation
Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation
Jingning Xu
Azade Farshad
Mingjie Wang
Siyuan Bian
Helisa Dhamo
Xiang Yin
Zejun Ma
VGen
249
0
0
14 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised
  Predictive Learning
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Zhiyu Yao
Yunbo Wang
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
236
12
0
08 Oct 2021
Physical Context and Timing Aware Sequence Generating GANs
Physical Context and Timing Aware Sequence Generating GANs
Hayato Futase
Tomoki Tsujimura
Tetsuya Kajimoto
Hajime Kawarazaki
Toshiyuki Suzuki
Makoto Miwa
Yutaka Sasaki
GAN
264
0
0
28 Sep 2021
Previous
123...12131415
Next
Page 13 of 15
Pageof 15