ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
Yatian Pang
Bin Zhu
Bin Lin
Mingzhe Zheng
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
VGen3DH
299
12
0
30 Nov 2024
Fleximo: Towards Flexible Text-to-Human Motion Video Generation
Fleximo: Towards Flexible Text-to-Human Motion Video Generation
Yuhang Zhang
Yuan Zhou
Zeyu Liu
Yuxuan Cai
Qiuyue Wang
Aidong Men
Huan Yang
VGenDiffM
262
3
0
29 Nov 2024
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of
  Text-to-Video Generation with LMM
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMMComputer Vision and Pattern Recognition (CVPR), 2024
Jiarui Wang
Huiyu Duan
Guoquan Zheng
Juntong Wang
Xiongkuo Min
EGVM
257
24
0
26 Nov 2024
Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
Guanjie Chen
Xinyu Zhao
Yucheng Zhou
Tianlong Chen
Cheng Yu
Yu Cheng
514
1
0
26 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guoquan Zheng
EGVM
533
11
0
25 Nov 2024
AnimateAnything: Consistent and Controllable Animation for Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Guojun Lei
Chi-Yin Wang
Hong Li
Rong Zhang
Yikai Wang
W. Xu
VGenDiffM
171
24
0
16 Nov 2024
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human AnimationComputer Vision and Pattern Recognition (CVPR), 2024
Rang Meng
Xingyu Zhang
Yuming Li
Chenguang Ma
455
49
0
15 Nov 2024
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video
  Generation
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Xiaofeng Wang
Kang Zhao
Fan Liu
Jiayu Wang
Guosheng Zhao
Xiaoyi Bao
Zheng Hua Zhu
Yingya Zhang
Xingang Wang
VGen
279
24
0
13 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
402
3
0
12 Nov 2024
Improved Video VAE for Latent Video Diffusion Model
Improved Video VAE for Latent Video Diffusion ModelComputer Vision and Pattern Recognition (CVPR), 2024
Pingyu Wu
Kai Zhu
Yu Liu
Liming Zhao
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
VGenDiffM
175
18
0
10 Nov 2024
Autoregressive Models in Vision: A Survey
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
489
38
0
08 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent
  Collaboration
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGenDiffM
373
18
0
07 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
SG-I2V: Self-Guided Trajectory Control in Image-to-Video GenerationInternational Conference on Learning Representations (ICLR), 2024
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
587
39
0
07 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
GenXD: Generating Any 3D and 4D ScenesInternational Conference on Learning Representations (ICLR), 2024
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Zhiyong Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
371
41
0
04 Nov 2024
Optical Flow Representation Alignment Mamba Diffusion Model for Medical
  Video Generation
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
Zhenwei Zhang
VGenMedIm
318
6
0
03 Nov 2024
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Fashion-VDM: Video Diffusion Model for Virtual Try-OnACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
J. Karras
Yingwei Li
Nan Liu
Luyang Zhu
Innfarn Yoo
Andreas Lugmayr
Chris Lee
Ira Kemelmacher-Shlizerman
DiffMVGen
293
13
0
31 Oct 2024
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding
  and Conditioning
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and ConditioningNeural Information Processing Systems (NeurIPS), 2024
Penghui Ruan
Pichao Wang
Divya Saxena
Jiannong Cao
Yuhui Shi
DiffMVGen
220
1
0
31 Oct 2024
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation
TPC: Test-time Procrustes Calibration for Diffusion-based Human Image AnimationNeural Information Processing Systems (NeurIPS), 2024
Sunjae Yoon
Gwanhyeong Koo
Younghwan Lee
Chang D. Yoo
VGen
373
10
0
31 Oct 2024
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level
  and Fidelity-Rich Conditions in Diffusion Models
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Shengkai Zhang
Nianhong Jiao
Tian Li
Chaojie Yang
Chenhui Xue
Boya Niu
Jun Gao
VGenVLMDiffM
153
8
0
30 Oct 2024
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
LARP: Tokenizing Videos with a Learned Autoregressive Generative PriorInternational Conference on Learning Representations (ICLR), 2024
Hanyu Wang
Saksham Suri
Yixuan Ren
Hao Chen
Abhinav Shrivastava
VGen
352
27
0
28 Oct 2024
VISAGE: Video Synthesis using Action Graphs for Surgery
VISAGE: Video Synthesis using Action Graphs for Surgery
Yousef Yeganeh
Rachmadio Lazuardi
Amir Shamseddin
Emine Dari
Yash Thirani
Nassir Navab
Azade Farshad
MedIm
162
7
0
23 Oct 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
Jianfei Chen
DiffMVGen
1.0K
13
0
20 Oct 2024
EVA: An Embodied World Model for Future Video Anticipation
EVA: An Embodied World Model for Future Video Anticipation
Yatian Wang
Hengyuan Zhang
Chun-Kai Fan
Xingqun Qi
Rongyu Zhang
...
Chi-Min Chan
Wei Xue
Wenhan Luo
Shanghang Zhang
Wenhan Luo
VGen
235
18
0
20 Oct 2024
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient
  Representation Learning
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning
Etai Littwin
Vimal Thilak
Anand Gopalakrishnan
299
17
0
14 Oct 2024
Separation of Neural Drives to Muscles from Transferred Polyfunctional
  Nerves using Implanted Micro-electrode Arrays
Separation of Neural Drives to Muscles from Transferred Polyfunctional Nerves using Implanted Micro-electrode Arrays
Laura Ferrante
Anna Boesendorfer
D. Barsakcioglu
Benedikt Baumgartner
Yazan Al-Ajam
Alex Woollard
Norbert Venantius Kang
Oskar Aszmann
D. Farina
240
15
0
14 Oct 2024
Asymptotic Analysis of Sample-averaged Q-learning
Asymptotic Analysis of Sample-averaged Q-learningIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2024
Saunak Kumar Panda
Ruiqi Liu
Yisha Xiang
OnRL
389
17
0
14 Oct 2024
HARIVO: Harnessing Text-to-Image Models for Video Generation
HARIVO: Harnessing Text-to-Image Models for Video GenerationEuropean Conference on Computer Vision (ECCV), 2024
Mingi Kwon
Seoung Wug Oh
Yang Zhou
Difan Liu
Joon-Young Lee
Haoran Cai
Baqiao Liu
Feng Liu
Youngjung Uh
VGen
143
6
0
10 Oct 2024
Progressive Autoregressive Video Diffusion Models
Progressive Autoregressive Video Diffusion Models
Desai Xie
Zhan Xu
Yicong Hong
Hao Tan
Difan Liu
Feng Liu
Arie E. Kaufman
Yang Zhou
DiffMVGen
314
39
0
10 Oct 2024
DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh
  Hybrid Representation
DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid RepresentationNeural Information Processing Systems (NeurIPS), 2024
Zhiqi Li
Yiming Chen
Peidong Liu
3DGS
165
38
0
09 Oct 2024
Restructuring Vector Quantization with the Rotation Trick
Restructuring Vector Quantization with the Rotation TrickInternational Conference on Learning Representations (ICLR), 2024
Christopher Fifty
Ronald G. Junkins
Dennis Duan
Aniketh Iger
Jerry W. Liu
Ehsan Amid
Sebastian Thrun
Christopher Ré
LLMSV
508
35
0
08 Oct 2024
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark
  for Video Generation
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
Fanqing Meng
Jiaqi Liao
Xinyu Tan
Wenqi Shao
Quanfeng Lu
Kaipeng Zhang
Yu Cheng
Dianqi Li
Yu Qiao
Ping Luo
VGenEGVM
262
68
0
07 Oct 2024
Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality
Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality
Ge Ya Luo
Gian Mario Favero
Zhi Hao Luo
Alexia Jolicoeur-Martineau
Christopher Pal
VGen
223
7
0
07 Oct 2024
Noise Crystallization and Liquid Noise: Zero-shot Video Generation using
  Image Diffusion Models
Noise Crystallization and Liquid Noise: Zero-shot Video Generation using Image Diffusion Models
Muhammad Haaris Khan
Hadrien Reynaud
Bernhard Kainz
VGenDiffM
151
0
0
05 Oct 2024
ECHOPulse: ECG controlled echocardio-grams video generation
ECHOPulse: ECG controlled echocardio-grams video generationInternational Conference on Learning Representations (ICLR), 2024
Yiwei Li
Sekeun Kim
Zihao Wu
Hanqi Jiang
Yi Pan
...
Sifan Song
Yucheng Shi
Tianming Liu
Quanzheng Li
Xiang Li
VGen
183
4
0
04 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
375
68
0
03 Oct 2024
COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based
  Video Generation
COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Mingzhen Sun
Weining Wang
Xinxin Zhu
Jing Liu
VGenDiffM
174
0
0
02 Oct 2024
Replace Anyone in Videos
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Yuanxing Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGenDiffM
367
2
0
30 Sep 2024
High Quality Human Image Animation using Regional Supervision and Motion
  Blur Condition
High Quality Human Image Animation using Regional Supervision and Motion Blur Condition
Zhongcong Xu
Chaoyue Song
Guoxian Song
Jianfeng Zhang
Jun Hao Liew
...
You Xie
Linjie Luo
Guosheng Lin
Jiashi Feng
Mike Zheng Shou
DiffM3DHVGen
351
5
0
29 Sep 2024
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
PhysGen: Rigid-Body Physics-Grounded Image-to-Video GenerationEuropean Conference on Computer Vision (ECCV), 2024
Shaowei Liu
Zhongzheng Ren
Saurabh Gupta
Shenlong Wang
VGenDiffMPINN
266
99
0
27 Sep 2024
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Yuki Mitsufuji
VGenDiffM
403
16
0
26 Sep 2024
Pose-Guided Fine-Grained Sign Language Video Generation
Pose-Guided Fine-Grained Sign Language Video GenerationEuropean Conference on Computer Vision (ECCV), 2024
Tongkai Shi
Lianyu Hu
Fanhua Shang
Jichao Feng
Peidong Liu
Wei Feng
VGenSLRDiffM
335
5
0
25 Sep 2024
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Xinrui Zhou
Yuhao Huang
Haoran Dou
Shijing Chen
Ao Chang
...
Jie Jessie Ren
Ruobing Huang
Jun Cheng
Wufeng Xue
Dong Ni
MedIm
774
1
0
25 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
MIMO: Controllable Character Video Synthesis with Spatial Decomposed ModelingComputer Vision and Pattern Recognition (CVPR), 2024
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
450
51
0
24 Sep 2024
MIMAFace: Face Animation via Motion-Identity Modulated Appearance
  Feature Learning
MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning
Yue Han
Junwei Zhu
Yuxiang Feng
Xiaozhong Ji
Keke He
Xiangtai Li
Zhucun Xue
Yong Liu
178
2
0
23 Sep 2024
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
Donglin Di
Hao Feng
Wenzhang Sun
Yongjia Ma
Hao Li
Wei Chen
Xiaofei Gou
Tonghua Su
Xun Yang
CVBM
411
2
0
23 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
469
5
0
22 Sep 2024
OSV: One Step is Enough for High-Quality Image to Video Generation
OSV: One Step is Enough for High-Quality Image to Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Xiaofeng Mao
Zhengkai Jiang
Fu-Yun Wang
Wenbing Zhu
Hao Chen
Mingmin Chi
Yabiao Wang
Wenhan Luo
DiffMVGen
410
22
0
17 Sep 2024
MyGo: Consistent and Controllable Multi-View Driving Video Generation
  with Camera Control
MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control
Yining Yao
Xi Guo
C. Ding
Wei Wu
VGen
204
3
0
10 Sep 2024
DriveScape: Towards High-Resolution Controllable Multi-View Driving
  Video Generation
DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation
Wei Wu
Xi Guo
Weixuan Tang
Tingxuan Huang
Chiyu Wang
Dongyue Chen
C. Ding
VGen
333
13
0
09 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGenDiffM
535
16
0
06 Sep 2024
Previous
123...678...131415
Next