ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
DeRA: Decoupled Representation Alignment for Video Tokenization
DeRA: Decoupled Representation Alignment for Video Tokenization
Pengbo Guo
Junke Wang
Zhen Xing
Chengxu Liu
Daoguo Dong
Xueming Qian
Zuxuan Wu
AI4TS
77
0
0
04 Dec 2025
Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Yuchen Deng
Xiuyang Wu
Hai-Tao Zheng
Jie Wang
Feidiao Yang
Yuxing Han
VGen
216
0
0
03 Dec 2025
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Lanxiang Hu
Abhilash Shankarampeta
Yixin Huang
Zilin Dai
Haoyang Yu
Yujie Zhao
Haoqiang Kang
Daniel Zhao
Tajana Rosing
Hao Zhang
VGenLRM
216
0
0
02 Dec 2025
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Xavier Thomas
Youngsun Lim
Ananya Srinivasan
Audrey Zheng
Deepti Ghadiyaram
EGVMVGen
316
0
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffMVGen3DPC
259
0
0
01 Dec 2025
SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Zisu Li
Hengye Lyu
Jiaxin Shi
Yufeng Zeng
Mingming Fan
Hanwang Zhang
Chen Liang
VGen
172
0
0
01 Dec 2025
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi
Pragati Jaiswal
T. Habtegebrial
Christen Millerdurai
Shaoxiang Wang
A. Pagani
Didier Stricker
DiffMVGen
126
0
0
30 Nov 2025
Image Generation as a Visual Planner for Robotic Manipulation
Image Generation as a Visual Planner for Robotic Manipulation
Ye Pang
VGen
78
0
0
29 Nov 2025
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Lingdong Wang
Guan-Ming Su
D. Kothandaraman
Tsung-Wei Huang
Mohammad Hajiesmaili
R. Sitaraman
DiffMVGen
168
0
0
29 Nov 2025
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
Hongfei Zhang
Kanghao Chen
Zixin Zhang
Harold Haodong Chen
Yuanhuiyi Lyu
Yuqi Zhang
Shuai Yang
Kun Zhou
Yingcong Chen
DiffMVGen
158
1
0
28 Nov 2025
InstanceV: Instance-Level Video Generation
InstanceV: Instance-Level Video Generation
Yuheng Chen
Teng Hu
Jiangning Zhang
Zhucun Xue
Ran Yi
Lizhuang Ma
DiffMVGen
120
0
0
28 Nov 2025
Captain Safari: A World Engine
Captain Safari: A World Engine
Yu-Cheng Chou
X. Wang
Yitong Li
Jiahao Wang
Hanting Liu
Cihang Xie
Alan Yuille
Junfei Xiao
VGen
172
0
0
28 Nov 2025
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
S. Shi
Jing Xu
Zhihang Li
Chunli Peng
Xiaoda Yang
Lijing Lu
Kai Hu
Jiangning Zhang
DiffM
112
0
0
28 Nov 2025
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
Bo Chen
Tao Liu
Qi Chen
Xie Chen
Zilong Zheng
VGen
88
0
0
27 Nov 2025
WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation
WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation
Quanjian Song
Yiren Song
Kelly Peng
Yuan Gao
Mike Zheng Shou
DiffMVGen
92
0
0
27 Nov 2025
Fusion of classical and quantum kernels enables accurate and robust two-sample tests
Fusion of classical and quantum kernels enables accurate and robust two-sample tests
Yu Terada
Yugo Ogio
Ken Arai
Hiroyuki Tezuka
Yu Tanaka
132
0
0
26 Nov 2025
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation
Y. Li
Heyu Si
Federico Landi
Pilar Oplustil Gallegos
Ioannis Koutsoumpas
...
Ruiju Fu
Qi Guo
Xin Jin
Shunyu Liu
Mingli Song
DiffMVGen
192
0
0
26 Nov 2025
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Changlin Li
Jiawei Zhang
Shuhao Liu
Sihao Lin
Z. Shi
Zhihui Li
Xiaojun Chang
DiffMVGen
258
0
0
26 Nov 2025
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
Chao Wang
Chengan Che
Xinyue Chen
Sophia Tsoka
Luis C. Garcia-Peraza-Herrera
227
0
0
25 Nov 2025
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
Duolikun Danier
Ge Gao
Steven McDonagh
Changjian Li
Hakan Bilen
Oisin Mac Aodha
DiffMVGen
134
0
0
24 Nov 2025
Eevee: Towards Close-up High-resolution Video-based Virtual Try-on
Eevee: Towards Close-up High-resolution Video-based Virtual Try-on
Jianhao Zeng
Y. Bai
Ruidong Chen
Xuanpu Zhang
Lei-huan Sun
Dongyang Jin
Ryan Xu
Nannan Zhang
Dan Song
Xiangxiang Chu
190
0
0
24 Nov 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
116
0
0
23 Nov 2025
Native 3D Editing with Full Attention
Native 3D Editing with Full Attention
Weiwei Cai
Shuangkang Fang
Weicai Ye
Xin Dong
Y. Yang
Xuanyang Zhang
Wei Cheng
Yanpei Cao
Gang Yu
Tao Chen
DiffM
127
0
0
21 Nov 2025
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Yujiang Pu
Zhanbo Huang
Vishnu Boddeti
Yu Kong
DiffMVGen
108
0
0
21 Nov 2025
H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation
H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation
Yijie Zhu
Rui Shao
Ziyang Liu
Jie He
Jizhihui Liu
Jiuru Wang
Zitong Yu
210
1
0
21 Nov 2025
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
Enhui Ma
Lijun Zhou
Tao Tang
Jiahuan Zhang
Junpeng Jiang
...
Xianpeng Lang
Haiyang Sun
Xia Zhou
Di Lin
Kaicheng Yu
241
0
0
17 Nov 2025
Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Taiyi Su
Jian Zhu
Yaxuan Li
Chong Ma
Zitai Huang
Yichen Zhu
Hanli Wang
VGen
250
0
0
17 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
182
1
0
10 Nov 2025
ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search
ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search
Zhenjie Liu
Jianzhang Lu
Renjie Lu
Cong Liang
S. Wang
DiffMVGen
297
0
0
10 Nov 2025
Driving scenario generation and evaluation using a structured layer representation and foundational models
Driving scenario generation and evaluation using a structured layer representation and foundational models
Arthur Hubert
Gamal Elghazaly
R. Frank
88
0
0
03 Nov 2025
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Panwang Pan
Chenguo Lin
Jingjing Zhao
Chenxin Li
Yuchen Lin
...
Honglei Yan
Kairun Wen
Yunlong Lin
Yixuan Yuan
Yadong Mu
3DGSVGen
145
1
0
01 Nov 2025
DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model
DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model
Yucheng Xing
Jinxing Yin
Xiaodong Liu
VGen
160
0
0
31 Oct 2025
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
Baolu Li
Y. Zhang
Qinghe Wang
Liqian Ma
Xiaoyu Shi
...
Pengfei Wan
Zhenfei Yin
Yunzhi Zhuge
Huchuan Lu
Xu Jia
VGen
224
2
0
29 Oct 2025
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Junyoung Seo
Rodrigo Mira
A. Haliassos
Stella Bounareli
Honglie Chen
Linh Tran
Seungryong Kim
Zoe Landgraf
Jie Shen
VGen
145
1
0
27 Oct 2025
DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection
DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection
Kangran Zhao
Yupeng Chen
Xiaoyu Zhang
Yize Chen
Weinan Guan
...
Chengzhe Sun
Soumyya Kanti Datta
Qingshan Liu
Siwei Lyu
Baoyuan Wu
120
1
0
26 Oct 2025
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen
Ziyu Jiang
Mingfu Liang
Bingbing Zhuang
Jong-Chyi Su
Sparsh Garg
Ying Wu
Manmohan Chandraker
VGen
150
0
0
23 Oct 2025
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
Zhida Zhao
Talas Fu
Yifan Wang
Lijun Wang
Huchuan Lu
VGen
214
1
0
22 Oct 2025
OmniNWM: Omniscient Driving Navigation World Models
OmniNWM: Omniscient Driving Navigation World Models
Bohan Li
Zhuang Ma
Dalong Du
Baorui Peng
Zhujin Liang
...
Chao Ma
Yueming Jin
Hao Zhao
Wenjun Zeng
Xin Jin
VGen
311
3
0
21 Oct 2025
UltraGen: High-Resolution Video Generation with Hierarchical Attention
UltraGen: High-Resolution Video Generation with Hierarchical Attention
Teng Hu
Jiangning Zhang
Zihan Su
Ran Yi
DiffMVGen
206
5
0
21 Oct 2025
Demystifying Transition Matching: When and Why It Can Beat Flow Matching
Demystifying Transition Matching: When and Why It Can Beat Flow Matching
Jaihoon Kim
Rajarshi Saha
Minhyuk Sung
Youngsuk Park
125
0
0
20 Oct 2025
A Comprehensive Survey on World Models for Embodied AI
A Comprehensive Survey on World Models for Embodied AI
Xinqing Li
Xin He
Le Zhang
Yun-Hai Liu
Xiaoli Li
Yun-Hai Liu
VGenLM&RoSyDa
248
3
0
19 Oct 2025
ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints
ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints
Meiqi Wu
Jiashu Zhu
Xiaokun Feng
C. L. Philip Chen
Chen Zhu
Bingze Song
Fangyuan Mao
Jiahong Wu
Xiangxiang Chu
Kaiqi Huang
VGenEGVMVLM
354
1
0
16 Oct 2025
CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas
CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas
Zian Li
Muhan Zhang
DiffMVGen
146
0
0
15 Oct 2025
LayerSync: Self-aligning Intermediate Layers
LayerSync: Self-aligning Intermediate Layers
Yasaman Haghighi
B. V. Delft
Mariam Hassan
Alexandre Alahi
115
0
0
14 Oct 2025
Time-Correlated Video Bridge Matching
Time-Correlated Video Bridge Matching
Viacheslav Vasilev
Arseny Ivanov
Nikita Gushchin
Maria Kovaleva
Alexander Korotin
DiffM
92
1
0
14 Oct 2025
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Haomin Wang
Jinhui Yin
Qi Wei
Wenguang Zeng
Lixin Gu
...
Yanwen Guo
Wenhai Wang
Kai Chen
Yu Qiao
Hongjie Zhang
VLM
185
2
0
13 Oct 2025
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
Jinxuan Li
Chaolei Tan
Haoxuan Chen
Jianxin Ma
Jian-Fang Hu
Wei-Shi Zheng
Jianhuang Lai
VLM
141
1
0
12 Oct 2025
DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis
DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis
Peiyin Chen
Zhuowei Yang
Hui Feng
Sheng Jiang
Rui Yan
DiffMVGen
92
0
0
12 Oct 2025
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Yanjiang Guo
Lucy Xiaoyang Shi
Jianyu Chen
Chelsea Finn
VGen
154
15
0
11 Oct 2025
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
Donglin Huang
Yongyuan Li
Tianhang Liu
Junming Huang
Xiaoda Yang
Chi-Yin Wang
Weiwei Xu
VGen
150
1
0
11 Oct 2025
1234...131415
Next