Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1812.01717
Cited By

Towards Accurate Generative Models of Video: A New Metric & Challenges

v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018

Thomas Unterthiner

Sjoerd van Steenkiste

Raphaël Marinier

Marcin Michalski

ArXiv (abs)PDF HTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown

DeRA: Decoupled Representation Alignment for Video Tokenization

DeRA: Decoupled Representation Alignment for Video Tokenization

82

0

0

04 Dec 2025

Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation

Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation

220

0

0

03 Dec 2025

Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench

Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench

Abhilash Shankarampeta

221

1

0

02 Dec 2025

Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

Ananya Srinivasan

Deepti Ghadiyaram

317

0

0

01 Dec 2025

Generative Video Motion Editing with 3D Point Tracks

DiffM VGen 3DPC

262

0

0

01 Dec 2025

SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation

177

0

0

01 Dec 2025

TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model

Alireza Javanmardi

Pragati Jaiswal

T. Habtegebrial

Christen Millerdurai

Didier Stricker

134

0

0

30 Nov 2025

Image Generation as a Visual Planner for Robotic Manipulation

Image Generation as a Visual Planner for Robotic Manipulation

87

0

0

29 Nov 2025

Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

D. Kothandaraman

Tsung-Wei Huang

Mohammad Hajiesmaili

183

0

0

29 Nov 2025

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Harold Haodong Chen

175

1

0

28 Nov 2025

InstanceV: Instance-Level Video Generation

InstanceV: Instance-Level Video Generation

Jiangning Zhang

120

0

0

28 Nov 2025

Captain Safari: A World Engine

Captain Safari: A World Engine

172

0

0

28 Nov 2025

One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer

One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer

Jiangning Zhang

119

0

0

28 Nov 2025

IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer

IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer

89

0

0

27 Nov 2025

WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation

WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation

Mike Zheng Shou

95

0

0

27 Nov 2025

Fusion of classical and quantum kernels enables accurate and robust two-sample tests

Fusion of classical and quantum kernels enables accurate and robust two-sample tests

Hiroyuki Tezuka

132

0

0

26 Nov 2025

3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation

3MDiT: Unified Tri-Modal Diffusion Transformer for Text-Driven Synchronized Audio-Video Generation

Pilar Oplustil Gallegos

Ioannis Koutsoumpas

...

192

0

0

26 Nov 2025

Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning

Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning

263

0

0

26 Nov 2025

Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations

Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations

Luis C. Garcia-Peraza-Herrera

235

0

0

25 Nov 2025

View-Consistent Diffusion Representations for 3D-Consistent Video Generation

View-Consistent Diffusion Representations for 3D-Consistent Video Generation

Duolikun Danier

Steven McDonagh

Oisin Mac Aodha

135

0

0

24 Nov 2025

Eevee: Towards Close-up High-resolution Video-based Virtual Try-on

Eevee: Towards Close-up High-resolution Video-based Virtual Try-on

195

0

0

24 Nov 2025

Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization

Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization

Sina Mokhtarzadeh Azar

Enrico Pallotta

Gianpiero Francesca

121

0

0

23 Nov 2025

Native 3D Editing with Full Attention

Native 3D Editing with Full Attention

Shuangkang Fang

127

0

0

21 Nov 2025

Show Me: Unifying Instructional Image and Video Generation with Diffusion Models

Show Me: Unifying Instructional Image and Video Generation with Diffusion Models

118

0

0

21 Nov 2025

H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation

H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation

215

1

0

21 Nov 2025

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

...

249

0

0

17 Nov 2025

Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

255

0

0

17 Nov 2025

DIMO: Diverse 3D Motion Generation for Arbitrary Objects

DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Kostas Daniilidis

182

1

0

10 Nov 2025

ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search

ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise Search

299

0

0

10 Nov 2025

Driving scenario generation and evaluation using a structured layer representation and foundational models

Driving scenario generation and evaluation using a structured layer representation and foundational models

Gamal Elghazaly

88

0

0

03 Nov 2025

Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models

Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models

...

145

1

0

01 Nov 2025

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

164

0

0

31 Oct 2025

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

...

232

4

0

29 Oct 2025

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Stella Bounareli

152

1

0

27 Oct 2025

DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection

DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection

...

Soumyya Kanti Datta

133

1

0

26 Oct 2025

AutoScape: Geometry-Consistent Long-Horizon Scene Generation

AutoScape: Geometry-Consistent Long-Horizon Scene Generation

Bingbing Zhuang

Manmohan Chandraker

154

0

0

23 Oct 2025

From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction

From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction

219

1

0

22 Oct 2025

OmniNWM: Omniscient Driving Navigation World Models

OmniNWM: Omniscient Driving Navigation World Models

...

314

3

0

21 Oct 2025

UltraGen: High-Resolution Video Generation with Hierarchical Attention

UltraGen: High-Resolution Video Generation with Hierarchical Attention

Jiangning Zhang

206

5

0

21 Oct 2025

Demystifying Transition Matching: When and Why It Can Beat Flow Matching

Demystifying Transition Matching: When and Why It Can Beat Flow Matching

125

0

0

20 Oct 2025

A Comprehensive Survey on World Models for Embodied AI

A Comprehensive Survey on World Models for Embodied AI

VGen LM&Ro SyDa

252

5

0

19 Oct 2025

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

C. L. Philip Chen

360

1

0

16 Oct 2025

CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas

CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas

150

0

0

15 Oct 2025

LayerSync: Self-aligning Intermediate Layers

LayerSync: Self-aligning Intermediate Layers

Yasaman Haghighi

Alexandre Alahi

115

1

0

14 Oct 2025

Time-Correlated Video Bridge Matching

Time-Correlated Video Bridge Matching

Viacheslav Vasilev

Nikita Gushchin

Alexander Korotin

98

1

0

14 Oct 2025

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

...

186

2

0

13 Oct 2025

Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey

Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey

151

1

0

12 Oct 2025

DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis

DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesis

99

0

0

12 Oct 2025

Ctrl-World: A Controllable Generative World Model for Robot Manipulation

Ctrl-World: A Controllable Generative World Model for Robot Manipulation

Lucy Xiaoyang Shi

168

15

0

11 Oct 2025

VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework

VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework

150

1

0

11 Oct 2025

1 2 3 4...13 14 15