ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGenDiffM
535
16
0
06 Sep 2024
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
Weipeng Tan
Chuming Lin
Chengming Xu
Xiaozhong Ji
Junwei Zhu
Chengjie Wang
Yanwei Fu
DiffM
153
3
0
05 Sep 2024
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video
  Diffusion Model
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
Liuhan Chen
Zongjian Li
Bin Lin
Bin Zhu
Qian Wang
Shenghai Yuan
X. Zhou
Xinhua Cheng
Li Yuan
DiffM
361
22
0
02 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive
  Content Generation
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haobo Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wen Liu
DiffM
226
39
0
02 Sep 2024
DriveGenVLM: Real-world Video Generation for Vision Language Model based
  Autonomous Driving
DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving
Yongjie Fu
Anmol Jain
Xuan Di
Xu Chen
Chengbo Zang
VGen
218
10
0
29 Aug 2024
GenRec: Unifying Video Generation and Recognition with Diffusion Models
GenRec: Unifying Video Generation and Recognition with Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2024
Zejia Weng
Xitong Yang
Zhen Xing
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
336
14
0
27 Aug 2024
Empowering Sign Language Communication: Integrating Sentiment and
  Semantics for Facial Expression Synthesis
Empowering Sign Language Communication: Integrating Sentiment and Semantics for Facial Expression SynthesisComputers & graphics (CG), 2024
Rafael Azevedo
Thiago M. Coutinho
Joao Klock Ferreira
Thiago L. Gomes
Erickson R. Nascimento
SLR
214
8
0
27 Aug 2024
TC-PDM: Temporally Consistent Patch Diffusion Models for
  Infrared-to-Visible Video Translation
TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation
Anh-Dzung Doan
Vu Minh Hieu Phan
Surabhi Gupta
Markus Wagner
Tat-Jun Chin
Ian Reid
VGenDiffM
182
1
0
26 Aug 2024
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
Joseph Cho
Samuel Schmidgall
C. Zakka
Mrudang Mathur
Dhamanpreet Kaur
R. Shad
W. Hiesinger
VGenMedIm
308
18
0
26 Aug 2024
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human PreferencesComputer Vision and Pattern Recognition (CVPR), 2024
Zhikai Li
Xuewen Liu
Dongrong Fu
Jianquan Li
Qingyi Gu
Kurt Keutzer
Zhen Dong
EGVMVGenDiffM
351
8
0
26 Aug 2024
E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video
  Editing Quality Assessment
E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun
Xiaoyu Liang
S. Fan
Wenxu Gao
Wei-Nan Gao
DiffM
262
6
0
21 Aug 2024
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
TrackGo: A Flexible and Efficient Method for Controllable Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2024
Haitao Zhou
Chuang Wang
Rui Nie
Jinxiao Lin
Dongdong Yu
Qian Yu
Changhu Wang
VGenDiffM
551
28
0
21 Aug 2024
Factorized-Dreamer: Training A High-Quality Video Generator with Limited
  and Low-Quality Data
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffMVGen
206
0
0
19 Aug 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffMVGen
461
10
0
19 Aug 2024
Quality Assessment in the Era of Large Models: A Survey
Quality Assessment in the Era of Large Models: A Survey
Zicheng Zhang
Yingjie Zhou
Chunyi Li
Baixuan Zhao
Xiaohong Liu
Guangtao Zhai
344
33
0
17 Aug 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
250
0
0
31 Jul 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
558
111
0
31 Jul 2024
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's
  Impact on Spatio-Temporal Cross-Attentions
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-AttentionsIEEE Access (IEEE Access), 2024
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Zinuo Li
Hamid Laga
F. Boussaïd
VGen
256
5
0
27 Jul 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image
  Animation
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Zhenzhi Wang
Shouqing Yang
Yanhong Zeng
Youqing Fang
Yuwei Guo
...
Jing Tan
Kai Chen
Tianfan Xue
Bo Dai
Dahua Lin
VGen3DH
385
50
0
24 Jul 2024
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Yiming Xie
Chun-Han Yao
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
VGen
384
99
0
24 Jul 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGenEGVM
331
28
0
23 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGenDiffM
549
107
0
17 Jul 2024
QVD: Post-training Quantization for Video Diffusion Models
QVD: Post-training Quantization for Video Diffusion Models
Shilong Tian
Hong Chen
Chengtao Lv
Yu Liu
Jinyang Guo
Xianglong Liu
Shengxi Li
Hao Yang
Tao Xie
VGenMQ
280
12
0
16 Jul 2024
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint
  Video-Depth Generation
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Yuanhao Zhai
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Jianfeng Wang
Zhengyuan Yang
David Doermann
Junsong Yuan
Zicheng Liu
Lijuan Wang
DiffMVGen
241
11
0
15 Jul 2024
Towards Robust Event-based Networks for Nighttime via Unpaired
  Day-to-Night Event Translation
Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation
Yuhwan Jeong
Hoonhee Cho
Kuk-Jin Yoon
DiffM
202
8
0
15 Jul 2024
Kinetic Typography Diffusion Model
Kinetic Typography Diffusion Model
Seonmi Park
Inhwan Bae
Seunghyun Shin
Hae-Gon Jeon
DiffM
298
5
0
15 Jul 2024
TCAN: Animating Human Images with Temporally Consistent Pose Guidance
  using Diffusion Models
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
J. Kim
Min-Jung Kim
Junsoo Lee
Jaegul Choo
DiffM
204
11
0
12 Jul 2024
A Comprehensive Survey on Human Video Generation: Challenges, Methods,
  and Insights
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Wentao Lei
Jinting Wang
Fengji Ma
Guanjie Huang
Li Liu
VGenEGVM
289
16
0
11 Jul 2024
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse
  Disciplines
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Zidong Wang
Zeyu Lu
Di Huang
Tong He
Xihui Liu
Wanli Ouyang
Mengwei He
239
9
0
11 Jul 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
458
54
0
10 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
340
7
0
10 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured
  Captions
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
291
104
0
08 Jul 2024
VIMI: Grounding Video Generation through Multi-modal Instruction
VIMI: Grounding Video Generation through Multi-modal Instruction
Yuwei Fang
Willi Menapace
Aliaksandr Siarohin
Tsai-Shien Chen
Kuan-Chien Wang
Ivan Skorokhodov
Graham Neubig
Sergey Tulyakov
VGen
332
10
0
08 Jul 2024
Towards a Scalable Reference-Free Evaluation of Generative Models
Towards a Scalable Reference-Free Evaluation of Generative Models
Azim Ospanov
Jingwei Zhang
Mohammad Jalali
Xuenan Cao
Andrej Bogdanov
Farzan Farnia
EGVM
251
18
0
03 Jul 2024
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang
Jiaxi Gu
L. Wang
Han Wang
Junqi Cheng
Yuefeng Zhu
Fangyuan Zou
VGen
429
153
0
28 Jun 2024
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables
  Open-World Instruction Following Agents
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
Zihao Wang
Shaofei Cai
Zhancun Mu
Haowei Lin
Ceyao Zhang
Xuejie Liu
Qing Li
Hoang Trung-Dung
Xiaojian Ma
Yitao Liang
LM&Ro
310
25
0
27 Jun 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
300
52
0
26 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis
  through Structure Guidance
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
688
26
0
26 Jun 2024
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
Haonan Qiu
Zhaoxi Chen
Zhouxia Wang
Yingqing He
Menghan Xia
Ziwei Liu
VGenDiffM
208
46
0
24 Jun 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video
  Generation
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
193
0
0
23 Jun 2024
Image Conductor: Precision Control for Interactive Video Synthesis
Image Conductor: Precision Control for Interactive Video Synthesis
Yaowei Li
Xintao Wang
Zhaoyang Zhang
Zhouxia Wang
Ziyang Yuan
Liangbin Xie
Yuexian Zou
Ying Shan
VGen
257
45
0
21 Jun 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human
  Feedback for Video Generation
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Xuan He
Dongfu Jiang
Ge Zhang
Max Ku
Achint Soni
...
Yaswanth Narsupalli
Rongqi Fan
Zhiheng Lyu
Yuchen Lin
Wenhu Chen
EGVMVGenALM
313
118
0
21 Jun 2024
Video Generation with Learned Action Prior
Video Generation with Learned Action Prior
Meenakshi Sarkar
Devansh Bhardwaj
Debasish Ghose
VGenGAN
306
0
0
20 Jun 2024
IRASim: A Fine-Grained World Model for Robot Manipulation
IRASim: A Fine-Grained World Model for Robot Manipulation
Fangqi Zhu
Hongtao Wu
Song Guo
Yuxiao Liu
Chilam Cheang
Tao Kong
329
25
0
20 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Residual Diffusion Models for Deep Scalable Vision GenerationNeural Information Processing Systems (NeurIPS), 2024
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
412
8
0
19 Jun 2024
L4GM: Large 4D Gaussian Reconstruction Model
L4GM: Large 4D Gaussian Reconstruction ModelNeural Information Processing Systems (NeurIPS), 2024
Jiawei Ren
Kevin Xie
Ashkan Mirzaei
Hanxue Liang
Xiaohui Zeng
...
Ziwei Liu
Antonio Torralba
Sanja Fidler
Seung Wook Kim
Huan Ling
3DGS
261
95
0
14 Jun 2024
Training-free Camera Control for Video Generation
Training-free Camera Control for Video GenerationInternational Conference on Learning Representations (ICLR), 2024
Chen Hou
Guoqiang Wei
VGenDiffM
627
80
0
14 Jun 2024
SimGen: Simulator-conditioned Driving Scene Generation
SimGen: Simulator-conditioned Driving Scene Generation
Yunsong Zhou
Michael Simon
Zhenghao Peng
Sicheng Mo
Hongzi Zhu
Minyi Guo
Bolei Zhou
VGen
301
23
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing
  Reliability,Reproducibility, and Practicality
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
356
4
0
13 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
245
20
0
12 Jun 2024
Previous
123...789...131415
Next