v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018

Thomas Unterthiner

Sjoerd van Steenkiste

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown

Personalized Generation In Large Model Era: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

551

04 Mar 2025

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2025

357

02 Mar 2025

Learning to Animate Images from A Few Videos to Portray Delicate Human Actions

1.1K

01 Mar 2025

Unified Video Action Model

685

28 Feb 2025

WorldModelBench: Judging Video Generation Models As World Models

...

237

28 Feb 2025

Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos

400

28 Feb 2025

Mobius: Text to Seamless Looping Video Generation via Latent Shift

171

27 Feb 2025

C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

248

27 Feb 2025

Glad: A Streaming Scene Generator for Autonomous DrivingInternational Conference on Learning Representations (ICLR), 2025

291

26 Feb 2025

X-Dancer: Expressive Music to Human Dance Video Generation

322

24 Feb 2025

MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation

693

18 Feb 2025

MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025

303

17 Feb 2025

SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

333

17 Feb 2025

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

405

12 Feb 2025

Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance

417

10 Feb 2025

Pre-Trained Video Generative Models as World Simulators

376

10 Feb 2025

History-Guided Video Diffusion

554

10 Feb 2025

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

1.2K

09 Feb 2025

MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation

...

663

03 Feb 2025

Improving Tropical Cyclone Forecasting With Video Diffusion Models

Zhibo Ren

Pritthijit Nath

Pancham Shukla

422

27 Jan 2025

Taming Teacher Forcing for Masked Autoregressive Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025

...

387

21 Jan 2025

BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video RepresentationsComputer Vision and Pattern Recognition (CVPR), 2025

207

13 Jan 2025

MEt3R: Measuring Multi-View Consistency in Generated ImagesComputer Vision and Pattern Recognition (CVPR), 2025

257

10 Jan 2025

VideoAuteur: Towards Long Narrative Video Generation

393

10 Jan 2025

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025

190

06 Jan 2025

Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation

296

02 Jan 2025

AKiRa: Augmentation Kit on Rays for optical video generationComputer Vision and Pattern Recognition (CVPR), 2024

419

31 Dec 2024

DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesAAAI Conference on Artificial Intelligence (AAAI), 2024

392

31 Dec 2024

Grid Diffusion Models for Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024

Taegyeong Lee

Soyeong Kwon

Taehwan Kim

313

31 Dec 2024

DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers

Yuntao Chen

Yuqi Wang

Rundong Wang

1.0K

24 Dec 2024

VidTwin: Video VAE with Decoupled Structure and DynamicsComputer Vision and Pattern Recognition (CVPR), 2024

369

23 Dec 2024

Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac FluoroscopyAAAI Conference on Artificial Intelligence (AAAI), 2024

468

20 Dec 2024

Parallelized Autoregressive Visual GenerationComputer Vision and Pattern Recognition (CVPR), 2024

647

19 Dec 2024

AniDoc: Animation Creation Made EasierComputer Vision and Pattern Recognition (CVPR), 2024

485

18 Dec 2024

VideoDPO: Omni-Preference Alignment for Video Diffusion GenerationComputer Vision and Pattern Recognition (CVPR), 2024

324

18 Dec 2024

SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video GenerationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024

435

18 Dec 2024

Can video generation replace cinematographers? Research on the cinematic language of generated video

...

388

16 Dec 2024

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlComputer Vision and Pattern Recognition (CVPR), 2024

...

322

15 Dec 2024

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

...

874

15 Dec 2024

Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism

357

13 Dec 2024

OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation

171

12 Dec 2024

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

357

12 Dec 2024

Efficient Continuous Video Flow Model for Video Prediction

Gaurav Shrivastava

Abhinav Shrivastava

VGen

239

07 Dec 2024

DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models

297

05 Dec 2024

The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control

288

04 Dec 2024

Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention

349

04 Dec 2024

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation

334

02 Dec 2024

HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving

360

02 Dec 2024

Playable Game Generation

292

01 Dec 2024

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion TransformerComputer Vision and Pattern Recognition (CVPR), 2024

545

01 Dec 2024