Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.01717
Cited By
v1
v2 (latest)
Towards Accurate Generative Models of Video: A New Metric & Challenges
3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Accurate Generative Models of Video: A New Metric & Challenges"
50 / 715 papers shown
DeRA: Decoupled Representation Alignment for Video Tokenization
Pengbo Guo
Junke Wang
Zhen Xing
Chengxu Liu
Daoguo Dong
Xueming Qian
Zuxuan Wu
AI4TS
77
0
0
04 Dec 2025
Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Yuchen Deng
Xiuyang Wu
Hai-Tao Zheng
Jie Wang
Feidiao Yang
Yuxing Han
VGen
216
0
0
03 Dec 2025
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Lanxiang Hu
Abhilash Shankarampeta
Yixin Huang
Zilin Dai
Haoyang Yu
Yujie Zhao
Haoqiang Kang
Daniel Zhao
Tajana Rosing
Hao Zhang
VGen
LRM
216
0
0
02 Dec 2025
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Xavier Thomas
Youngsun Lim
Ananya Srinivasan
Audrey Zheng
Deepti Ghadiyaram
EGVM
VGen
316
0
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffM
VGen
3DPC
259
0
0
01 Dec 2025
SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Zisu Li
Hengye Lyu
Jiaxin Shi
Yufeng Zeng
Mingming Fan
Hanwang Zhang
Chen Liang
VGen
172
0
0
01 Dec 2025
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi
Pragati Jaiswal
T. Habtegebrial
Christen Millerdurai
Shaoxiang Wang
A. Pagani
Didier Stricker
DiffM
VGen
126
0
0
30 Nov 2025
Image Generation as a Visual Planner for Robotic Manipulation
Ye Pang
VGen
78
0
0
29 Nov 2025
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Lingdong Wang
Guan-Ming Su
D. Kothandaraman
Tsung-Wei Huang
Mohammad Hajiesmaili
R. Sitaraman
DiffM
VGen
168
0
0
29 Nov 2025
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
Hongfei Zhang
Kanghao Chen
Zixin Zhang
Harold Haodong Chen
Yuanhuiyi Lyu
Yuqi Zhang
Shuai Yang
Kun Zhou
Yingcong Chen
DiffM
VGen
158
1
0
28 Nov 2025
InstanceV: Instance-Level Video Generation
Yuheng Chen
Teng Hu
Jiangning Zhang
Zhucun Xue
Ran Yi
Lizhuang Ma
DiffM
VGen
120
0
0
28 Nov 2025
Captain Safari: A World Engine
Yu-Cheng Chou
X. Wang
Yitong Li
Jiahao Wang
Hanting Liu
Cihang Xie
Alan Yuille
Junfei Xiao
VGen
172
0
0
28 Nov 2025
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
S. Shi
Jing Xu
Zhihang Li
Chunli Peng
Xiaoda Yang
Lijing Lu
Kai Hu
Jiangning Zhang
DiffM
112
0
0
28 Nov 2025
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
Bo Chen
Tao Liu
Qi Chen
Xie Chen
Zilong Zheng
VGen
88
0
0
27 Nov 2025
WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation
Quanjian Song
Yiren Song
Kelly Peng
Yuan Gao
Mike Zheng Shou
DiffM
VGen
92
0
0
27 Nov 2025
Fusion of classical and quantum kernels enables accurate and robust two-sample tests
Yu Terada
Yugo Ogio
Ken Arai
Hiroyuki Tezuka
Yu Tanaka
132
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffM
VGen
192
0
0
26 Nov 2025
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Changlin Li
Jiawei Zhang
Shuhao Liu
Sihao Lin
Z. Shi
Zhihui Li
Xiaojun Chang
DiffM
VGen
258
0
0
26 Nov 2025
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Chao Wang
Chengan Che
Xinyue Chen
Sophia Tsoka
Luis C. Garcia-Peraza-Herrera
227
0
0
25 Nov 2025
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
Duolikun Danier
Ge Gao
Steven McDonagh
Changjian Li
Hakan Bilen
Oisin Mac Aodha
DiffM
VGen
134
0
0
24 Nov 2025
Eevee: Towards Close-up High-resolution Video-based Virtual Try-on
Jianhao Zeng
Y. Bai
Ruidong Chen
Xuanpu Zhang
Lei-huan Sun
Dongyang Jin
Ryan Xu
Nannan Zhang
Dan Song
Xiangxiang Chu
190
0
0
24 Nov 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
116
0
0
23 Nov 2025
Native 3D Editing with Full Attention
Weiwei Cai
Shuangkang Fang
Weicai Ye
Xin Dong
Y. Yang
Xuanyang Zhang
Wei Cheng
Yanpei Cao
Gang Yu
Tao Chen
DiffM
127
0
0
21 Nov 2025
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Yujiang Pu
Zhanbo Huang
Vishnu Boddeti
Yu Kong
DiffM
VGen
108
0
0
21 Nov 2025
H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation
Yijie Zhu
Rui Shao
Ziyang Liu
Jie He
Jizhihui Liu
Jiuru Wang
Zitong Yu
210
1
0
21 Nov 2025
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
Enhui Ma
Lijun Zhou
Tao Tang
Jiahuan Zhang
Junpeng Jiang
...
Xianpeng Lang
Haiyang Sun
Xia Zhou
Di Lin
Kaicheng Yu
241
0
0
17 Nov 2025
Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Taiyi Su
Jian Zhu
Yaxuan Li
Chong Ma
Zitai Huang
Yichen Zhu
Hanli Wang
VGen
250
0
0
17 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
182
1
0
10 Nov 2025
ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search
Zhenjie Liu
Jianzhang Lu
Renjie Lu
Cong Liang
S. Wang
DiffM
VGen
297
0
0
10 Nov 2025
Driving scenario generation and evaluation using a structured layer representation and foundational models
Arthur Hubert
Gamal Elghazaly
R. Frank
88
0
0
03 Nov 2025
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Panwang Pan
Chenguo Lin
Jingjing Zhao
Chenxin Li
Yuchen Lin
...
Honglei Yan
Kairun Wen
Yunlong Lin
Yixuan Yuan
Yadong Mu
3DGS
VGen
145
1
0
01 Nov 2025
DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model
Yucheng Xing
Jinxing Yin
Xiaodong Liu
VGen
160
0
0
31 Oct 2025
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
Baolu Li
Y. Zhang
Qinghe Wang
Liqian Ma
Xiaoyu Shi
...
Pengfei Wan
Zhenfei Yin
Yunzhi Zhuge
Huchuan Lu
Xu Jia
VGen
224
2
0
29 Oct 2025
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Junyoung Seo
Rodrigo Mira
A. Haliassos
Stella Bounareli
Honglie Chen
Linh Tran
Seungryong Kim
Zoe Landgraf
Jie Shen
VGen
145
1
0
27 Oct 2025
DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection
Kangran Zhao
Yupeng Chen
Xiaoyu Zhang
Yize Chen
Weinan Guan
...
Chengzhe Sun
Soumyya Kanti Datta
Qingshan Liu
Siwei Lyu
Baoyuan Wu
120
1
0
26 Oct 2025
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen
Ziyu Jiang
Mingfu Liang
Bingbing Zhuang
Jong-Chyi Su
Sparsh Garg
Ying Wu
Manmohan Chandraker
VGen
150
0
0
23 Oct 2025
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
Zhida Zhao
Talas Fu
Yifan Wang
Lijun Wang
Huchuan Lu
VGen
214
1
0
22 Oct 2025
OmniNWM: Omniscient Driving Navigation World Models
Bohan Li
Zhuang Ma
Dalong Du
Baorui Peng
Zhujin Liang
...
Chao Ma
Yueming Jin
Hao Zhao
Wenjun Zeng
Xin Jin
VGen
311
3
0
21 Oct 2025
UltraGen: High-Resolution Video Generation with Hierarchical Attention
Teng Hu
Jiangning Zhang
Zihan Su
Ran Yi
DiffM
VGen
206
5
0
21 Oct 2025
Demystifying Transition Matching: When and Why It Can Beat Flow Matching
Jaihoon Kim
Rajarshi Saha
Minhyuk Sung
Youngsuk Park
125
0
0
20 Oct 2025
A Comprehensive Survey on World Models for Embodied AI
Xinqing Li
Xin He
Le Zhang
Yun-Hai Liu
Xiaoli Li
Yun-Hai Liu
VGen
LM&Ro
SyDa
248
3
0
19 Oct 2025
ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints
Meiqi Wu
Jiashu Zhu
Xiaokun Feng
C. L. Philip Chen
Chen Zhu
Bingze Song
Fangyuan Mao
Jiahong Wu
Xiangxiang Chu
Kaiqi Huang
VGen
EGVM
VLM
354
1
0
16 Oct 2025
CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas
Zian Li
Muhan Zhang
DiffM
VGen
146
0
0
15 Oct 2025
LayerSync: Self-aligning Intermediate Layers
Yasaman Haghighi
B. V. Delft
Mariam Hassan
Alexandre Alahi
115
0
0
14 Oct 2025
Time-Correlated Video Bridge Matching
Viacheslav Vasilev
Arseny Ivanov
Nikita Gushchin
Maria Kovaleva
Alexander Korotin
DiffM
92
1
0
14 Oct 2025
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Haomin Wang
Jinhui Yin
Qi Wei
Wenguang Zeng
Lixin Gu
...
Yanwen Guo
Wenhai Wang
Kai Chen
Yu Qiao
Hongjie Zhang
VLM
185
2
0
13 Oct 2025
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
Jinxuan Li
Chaolei Tan
Haoxuan Chen
Jianxin Ma
Jian-Fang Hu
Wei-Shi Zheng
Jianhuang Lai
VLM
141
1
0
12 Oct 2025
DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis
Peiyin Chen
Zhuowei Yang
Hui Feng
Sheng Jiang
Rui Yan
DiffM
VGen
92
0
0
12 Oct 2025
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Yanjiang Guo
Lucy Xiaoyang Shi
Jianyu Chen
Chelsea Finn
VGen
154
15
0
11 Oct 2025
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
Donglin Huang
Yongyuan Li
Tianhang Liu
Junming Huang
Xiaoda Yang
Chi-Yin Wang
Weiwei Xu
VGen
150
1
0
11 Oct 2025
1
2
3
4
...
13
14
15
Next