v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018

Thomas Unterthiner

Sjoerd van Steenkiste

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown

VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework

150

11 Oct 2025

iMoWM: Taming Interactive Multi-Modal World Model for Robotic Manipulation

100

10 Oct 2025

CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

194

09 Oct 2025

Real-Time Motion-Controllable Autoregressive Video Diffusion

221

09 Oct 2025

FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control

244

09 Oct 2025

An approach for systematic decomposition of complex llm tasks

147

09 Oct 2025

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

09 Oct 2025

AVO: Amortized Value Optimization for Contact Mode Switching in Multi-Finger Manipulation

Adam Hung

Fan Yang

Abhinav Kumar

Sergio Aguilera Marinovic

Soshi Iba

Rana Soltani Zarrin

Dmitry Berenson

112

08 Oct 2025

MATRIX: Mask Track Alignment for Interaction-aware Video Generation

106

08 Oct 2025

Split Conformal Classification with Unsupervised Calibration

Santiago Mazuelas

209

08 Oct 2025

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation

129

08 Oct 2025

Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime

131

07 Oct 2025

Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models

...

101

07 Oct 2025

ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model

06 Oct 2025

Bridging Text and Video Generation: A Survey

264

06 Oct 2025

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

256

03 Oct 2025

Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!

234

03 Oct 2025

Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks

Bruno Corcuera

Carlos Eiras-Franco

Brais Cancela

108

02 Oct 2025

Arbitrary Generative Video Interpolation

143

01 Oct 2025

EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

...

129

01 Oct 2025

Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation

137

30 Sep 2025

UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark

29 Sep 2025

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

...

174

29 Sep 2025

FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation

200

29 Sep 2025

Fidelity-Aware Data Composition for Robust Robot Generalization

136

29 Sep 2025

NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis

120

29 Sep 2025

PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos

29 Sep 2025

Reinforcement Learning with Inverse Rewards for World Model Post-training

159

28 Sep 2025

WoW: Towards a World omniscient World model Through Embodied Interaction

...

160

26 Sep 2025

StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing

193

26 Sep 2025

Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers

1.4K

26 Sep 2025

Physically Plausible Multi-System Trajectory Generation and Symmetry Discovery

Jiayin Liu

Yulong Yang

Vineet Bansal

Christine Allen-Blanchette

115

26 Sep 2025

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation

...

140

26 Sep 2025

Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs

...

141

26 Sep 2025

What Happens Next? Anticipating Future Motion by Generating Point Trajectories

112

25 Sep 2025

MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation

25 Sep 2025

CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion

114

24 Sep 2025

World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation

105

23 Sep 2025

Echo-Path: Pathology-Conditioned Echo Video Generation

Kabir Hamzah Muhammad

21 Sep 2025

$$\mathtt{M^3VIR}$: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation$

\mathtt{M^3VIR}

: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation

161

21 Sep 2025

Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation

...

263

20 Sep 2025

Neural Atlas Graphs for Dynamic Scene Decomposition and Editing

Jan Philipp Schneider

197

19 Sep 2025

SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models

175

19 Sep 2025

WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance

261

18 Sep 2025

Wan-Animate: Unified Character Animation and Replacement with Holistic Replication

...

234

17 Sep 2025

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

...

272

15 Sep 2025

HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments

J. Karras

Yingwei Li

Yasamin Jafarian

Ira Kemelmacher-Shlizerman

129

15 Sep 2025

Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders

129

11 Sep 2025

LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations

204

10 Sep 2025

GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts

144

10 Sep 2025