v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018

Thomas Unterthiner

Sjoerd van Steenkiste

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown

DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion

226

02 Jun 2025

Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

254

02 Jun 2025

Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction

1.1K

30 May 2025

DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds

201

30 May 2025

Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization

539

29 May 2025

MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

250

29 May 2025

Toward Memory-Aided World Models: Benchmarking via Spatial Consistency

235

29 May 2025

GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control

353

28 May 2025

PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms

301

28 May 2025

Assessing the Use of Face Swapping Methods as Face Anonymizers in VideosInternational Conference on Digital Signal Processing (DSP), 2025

Mustafa İzzet Muştu

Hazım Kemal Ekenel

PICV CVBM

407

27 May 2025

Unified Text-Image-to-Video Generation: A Training-Free Approach to Flexible Visual Conditioning

274

27 May 2025

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

377

27 May 2025

AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models

322

26 May 2025

ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos

Xiaodong Wang

Peixi Peng

VGen

1.3K

24 May 2025

SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain

190

23 May 2025

Temporal Differential Fields for 4D Motion Modeling via Image-to-Video SynthesisInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

567

22 May 2025

Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose InteractionComputer Vision and Pattern Recognition (CVPR), 2025

256

22 May 2025

Consistent World Models via Foresight Diffusion

229

22 May 2025

Vid2World: Crafting Video Diffusion Models to Interactive World Models

331

20 May 2025

FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal GuidanceComputer Vision and Pattern Recognition (CVPR), 2025

372

19 May 2025

Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking

400

19 May 2025

Building spatial world models from sparse transitional episodic memories

237

19 May 2025

Video-GPT via Next Clip Diffusion

620

18 May 2025

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

...

452

17 May 2025

MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation

803

15 May 2025

FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation

288

15 May 2025

Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation ModelInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

529

12 May 2025

ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images

239

10 May 2025

A Unit Enhancement and Guidance Framework for Audio-Driven Avatar Video Generation

577

06 May 2025

Learning 3D Persistent Embodied World Models

380

05 May 2025

Direct Motion Models for Assessing Generated Videos

...

Sjoerd van Steenkiste

EGVM DiffM VGen

487

30 Apr 2025

ReVision: Refining Video Diffusion with Explicit 3D Motion Modeling

510

30 Apr 2025

MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance

519

30 Apr 2025

DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer

268

28 Apr 2025

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosComputer Vision and Pattern Recognition (CVPR), 2025

313

27 Apr 2025

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

247

25 Apr 2025

ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance

326

23 Apr 2025

Solving New Tasks by Adapting Internet Video KnowledgeInternational Conference on Learning Representations (ICLR), 2025

235

21 Apr 2025

FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models

346

20 Apr 2025

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric VideosInternational Conference on Learning Representations (ICLR), 2025

277

16 Apr 2025

VideoPanda: Video Panoramic Diffusion with Multi-view Attention

407

15 Apr 2025

Taming Consistency Distillation for Accelerated Human Image Animation

332

15 Apr 2025

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

...

318

15 Apr 2025

Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting

616

15 Apr 2025

On Equivariance and Fast Sampling in Video Diffusion Models Trained with Warped Noise

Chao Liu

Arash Vahdat

DiffM VGen

384

14 Apr 2025

H-MoRe: Learning Human-centric Motion Representation for Action AnalysisComputer Vision and Pattern Recognition (CVPR), 2025

285

14 Apr 2025

Aligning Anime Video Generation with Human Feedback

387

14 Apr 2025

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

281

13 Apr 2025

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

316

11 Apr 2025

TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025

287

11 Apr 2025