ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
Donglin Huang
Yongyuan Li
Tianhang Liu
Junming Huang
Xiaoda Yang
Chi-Yin Wang
Weiwei Xu
VGen
150
1
0
11 Oct 2025
iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation
iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation
Chuanrui Zhang
Zhengxian Wu
Guanxing Lu
Yansong Tang
Ziwei Wang
VGen
100
0
0
10 Oct 2025
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving
Tianrui Zhang
Yichen Liu
Zilin Guo
Yuxin Guo
Jingcheng Ni
Chenjing Ding
Dan Xu
Lewei Lu
Z. Wu
VGen
194
0
0
09 Oct 2025
Real-Time Motion-Controllable Autoregressive Video Diffusion
Real-Time Motion-Controllable Autoregressive Video Diffusion
Kesen Zhao
Jiaxin Shi
B. Zhu
Junbao Zhou
Xiaolong Shen
Yuan Zhou
Qianru Sun
Hanwang Zhang
VGen
221
1
0
09 Oct 2025
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
Zhiyuan Zhang
Can Wang
Dongdong Chen
Jing Liao
VGen
244
2
0
09 Oct 2025
An approach for systematic decomposition of complex llm tasks
An approach for systematic decomposition of complex llm tasks
Tianle Zhou
Jiakai Xu
G. Liu
Jiaxiang Liu
Haonan Wang
Eugene Wu
147
0
0
09 Oct 2025
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning
Minghong Cai
Qiulin Wang
Zongli Ye
Wenze Liu
Quande Liu
Weicai Ye
X. Wang
Pengfei Wan
Kun Gai
Xiangyu Yue
VGen
92
0
0
09 Oct 2025
AVO: Amortized Value Optimization for Contact Mode Switching in Multi-Finger Manipulation
AVO: Amortized Value Optimization for Contact Mode Switching in Multi-Finger Manipulation
Adam Hung
Fan Yang
Abhinav Kumar
Sergio Aguilera Marinovic
Soshi Iba
Rana Soltani Zarrin
Dmitry Berenson
112
3
0
08 Oct 2025
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Siyoon Jin
S. Kim
Dahyun Chung
J. Lee
Hyunwook Choi
Jisu Nam
J. Kim
S. Kim
VGen
106
1
0
08 Oct 2025
Split Conformal Classification with Unsupervised Calibration
Split Conformal Classification with Unsupervised Calibration
Santiago Mazuelas
209
1
0
08 Oct 2025
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
Zezhong Qian
Xiaowei Chi
Yuming Li
Shizun Wang
Zhiyuan Qin
Xiaozhu Ju
Sirui Han
Shanghang Zhang
VGen
129
2
0
08 Oct 2025
Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
Andreas Maurer
Erfan Mirzaei
Massimiliano Pontil
AI4CE
131
0
0
07 Oct 2025
Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models
Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models
Jiahao Wang
Zhenpei Yang
Yijing Bai
Yingwei Li
Yuliang Zou
...
Zehao Zhu
Jyh-Jing Hwang
Dragomir Anguelov
Mingxing Tan
C. Jiang
VGen
101
0
0
07 Oct 2025
ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model
ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model
Luo Cheng
Song Siyang
Yan Siyuan
Yu Zhen
Ge Zongyuan
89
1
0
06 Oct 2025
Bridging Text and Video Generation: A Survey
Bridging Text and Video Generation: A Survey
Nilay Kumar
Priyansh Bhandari
G. Maragatham
VGen
264
0
0
06 Oct 2025
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction
Kaisi Guan
Xihua Wang
Zhengfeng Lai
Xin Cheng
Peng Zhang
Xiaojiang Liu
Ruihua Song
Meng Cao
DiffM
256
4
0
03 Oct 2025
Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!
Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!
Junbao Zhou
Yuan Zhou
Kesen Zhao
Qingshan Xu
B. Zhu
Richang Hong
Hanwang Zhang
DiffMVGen
234
2
0
03 Oct 2025
Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks
Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks
Bruno Corcuera
Carlos Eiras-Franco
Brais Cancela
108
0
0
02 Oct 2025
Arbitrary Generative Video Interpolation
Arbitrary Generative Video Interpolation
Guozhen Zhang
Haiguang Wang
C. Wang
Yuan Zhou
Qinglin Lu
Limin Wang
VGen
143
0
0
01 Oct 2025
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
Jiahao Wang
Luoxin Ye
Taiming Lu
Junfei Xiao
Jiahan Zhang
...
Xijun Liu
Rama Chellappa
Cheng-Fang Peng
Alan Yuille
Jieneng Chen
VGen
129
2
0
01 Oct 2025
Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation
Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation
Agneet Chatterjee
Rahim Entezari
Maksym Zhuravinskyi
Maksim Lapin
Reshinth Adithyan
Amit Raj
Chitta Baral
Yezhou Yang
Varun Jampani
DiffMEGVMVGen
137
0
0
30 Sep 2025
UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark
UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark
Ailing Zhang
Lina Lei
Dehong Kong
Zhixin Wang
Jiaqi Xu
Fenglong Song
Chun-Le Guo
Chang Liu
Fan Li
Jie Chen
VGen
89
3
0
29 Sep 2025
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
Junyu Chen
Wenkun He
Yuchao Gu
Yuyang Zhao
Jincheng Yu
...
Haocheng Xi
Ligeng Zhu
Enze Xie
Song Han
Han Cai
VGen
174
2
0
29 Sep 2025
FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation
FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation
Yunyang Ge
Xinhua Cheng
ChengShu Zhao
Xianyi He
Shenghai Yuan
Bin Lin
Bin Zhu
Li Yuan
VGenVLM
200
0
0
29 Sep 2025
Fidelity-Aware Data Composition for Robust Robot Generalization
Fidelity-Aware Data Composition for Robust Robot Generalization
Zizhao Tong
Di Chen
Sicheng Hu
Hongwei Fan
Liliang Chen
Maoqing Yao
Hao Tang
Hao Dong
Ling Shao
136
1
0
29 Sep 2025
NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis
NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis
Yixuan Ren
Hanyu Wang
Hao Chen
Bo He
Abhinav Shrivastava
DiffMVGen
120
1
0
29 Sep 2025
PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos
PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos
Ting-Hsuan Liao
Haowen Liu
Yiran Xu
Songwei Ge
Gengshan Yang
Jia-Bin Huang
88
0
0
29 Sep 2025
Reinforcement Learning with Inverse Rewards for World Model Post-training
Reinforcement Learning with Inverse Rewards for World Model Post-training
Yang Ye
Tianyu He
Shuo Yang
Jiang Bian
VGen
159
1
0
28 Sep 2025
WoW: Towards a World omniscient World model Through Embodied Interaction
WoW: Towards a World omniscient World model Through Embodied Interaction
Xiaowei Chi
Peidong Jia
Chun-Kai Fan
Xiaozhu Ju
Weishi Mi
...
Wei Xue
Sirui Han
Yike Guo
Shanghang Zhang
Yong Dai
VGen
160
2
0
26 Sep 2025
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
Liyang Chen
Tianze Zhou
Xu He
Boshi Tang
Zhiyong Wu
Yang Huang
Yang Wu
Zhongqian Sun
Wei Yang
Helen M. Meng
DiffM
193
0
0
26 Sep 2025
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
Jibin Song
Mingi Kwon
Jaeseok Jeong
Youngjung Uh
DiffMVGen
1.4K
0
0
26 Sep 2025
Physically Plausible Multi-System Trajectory Generation and Symmetry Discovery
Physically Plausible Multi-System Trajectory Generation and Symmetry Discovery
Jiayin Liu
Yulong Yang
Vineet Bansal
Christine Allen-Blanchette
115
0
0
26 Sep 2025
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Yuan Xu
Jiabing Yang
X. Wang
Yixiang Chen
Zheng Zhu
...
Shuo Lu
Jing Liu
Nianfeng Liu
Yan Huang
Liang Wang
VGen
140
3
0
26 Sep 2025
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Xingyu Fu
Siyi Liu
Yinuo Xu
Pan Lu
Guangqiuse Hu
...
Chung Un Lee
Yejin Choi
James Zou
Dan Roth
Chris Callison-Burch
141
0
0
26 Sep 2025
What Happens Next? Anticipating Future Motion by Generating Point Trajectories
What Happens Next? Anticipating Future Motion by Generating Point Trajectories
Gabrijel Boduljak
Laurynas Karazija
Iro Laina
Christian Rupprecht
Andrea Vedaldi
VGen
112
1
0
25 Sep 2025
MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation
MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation
Guojun Lei
Chi-Yin Wang
Yikai Wang
Hong Li
Ying Song
W. Xu
DiffMVGen
99
0
0
25 Sep 2025
CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion
CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion
Chenhao Ji
Chaohui Yu
Junyao Gao
Fan Wang
Cairong Zhao
DiffMVGen
114
1
0
24 Sep 2025
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
Zhennan Jiang
Kai Liu
Yuxin Qin
Shuai Tian
Yupeng Zheng
Mingcai Zhou
Chao Yu
Haoran Li
Dongbin Zhao
105
3
0
23 Sep 2025
Echo-Path: Pathology-Conditioned Echo Video Generation
Echo-Path: Pathology-Conditioned Echo Video Generation
Kabir Hamzah Muhammad
Marawan Elbatel
Yi Qin
Xiaomeng Li
VGenMedIm
88
0
0
21 Sep 2025
$\mathtt{M^3VIR}$: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation
M3VIR\mathtt{M^3VIR}M3VIR: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation
Y. Li
Lebin Zhou
Nam Ling
Zhenghao Chen
Wei Wang
Wei Jiang
VGen
161
0
0
21 Sep 2025
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Yue Ma
Zexuan Yan
Hongyu Liu
H. Wang
Heng Pan
...
H. Shum
Zhifeng Li
Wei Liu
Linfeng Zhang
Qifeng Chen
VGen
263
13
0
20 Sep 2025
Neural Atlas Graphs for Dynamic Scene Decomposition and Editing
Neural Atlas Graphs for Dynamic Scene Decomposition and Editing
Jan Philipp Schneider
Pratik Singh Bisht
Ilya Chugunov
A. Kolb
Michael Moeller
Felix Heide
197
1
0
19 Sep 2025
SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models
SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models
Sen Wang
Jingyi Tian
Le Wang
Zhimin Liao
Jiayi Li
Huaiyi Dong
Kun Xia
Sanping Zhou
Wei Tang
Hua Gang
VGenLRM
175
0
0
19 Sep 2025
WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance
WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance
Chenxi Song
Yanming Yang
Tong Zhao
Ruibo Li
Chi Zhang
VGen
261
4
0
18 Sep 2025
Wan-Animate: Unified Character Animation and Replacement with Holistic Replication
Wan-Animate: Unified Character Animation and Replacement with Holistic Replication
Gang Cheng
X. Gao
Li Hu
Siqi Hu
Mingyang Huang
...
Peng Zhang
Xindi Zhang
Zhe Zhang
Jingren Zhou
Lian Zhuo
VGen
234
13
0
17 Sep 2025
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
Y. Zhou
Yifan Wang
Jianjun Zhou
Wenzheng Chang
Haoyu Guo
...
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Kaipeng Zhang
Tong He
VGen
272
5
0
15 Sep 2025
HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments
HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments
J. Karras
Yingwei Li
Yasamin Jafarian
Ira Kemelmacher-Shlizerman
129
0
0
15 Sep 2025
Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders
Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders
Dohun Lee
Hyeonho Jeong
Jiwook Kim
Duygu Ceylan
J. C. Ye
VGen
129
0
0
11 Sep 2025
LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations
LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations
Payal Varshney
Adriano Lucieri
Christoph Balada
Sheraz Ahmed
Andreas Dengel
VGen
204
0
0
10 Sep 2025
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Jenna Kang
Maria Silva
Patsorn Sangkloy
Kenneth Chen
Niall Williams
Qi Sun
EGVMVGen
144
0
0
10 Sep 2025
Previous
12345...131415
Next