ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.03458
  4. Cited By
Video Diffusion Models
v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,538 papers shown
PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference
PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference
Yang Yang
Severi Rissanen
Paul E. Chang
Nasrulloh Loka
Daolang Huang
Arno Solin
Markus Heinonen
Luigi Acerbi
140
0
0
15 Oct 2025
Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance
Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance
Jincheng Zhong
Boyuan Jiang
Xin Tao
Pengfei Wan
Kun Gai
Mingsheng Long
DiffM
104
0
0
14 Oct 2025
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
Sandeep Mishra
Oindrila Saha
A. Bovik
DiffMVGen
122
0
0
14 Oct 2025
BIGFix: Bidirectional Image Generation with Token Fixing
BIGFix: Bidirectional Image Generation with Token Fixing
Victor Besnier
David Hurych
Andrei Bursuc
Eduardo Valle
VGen
136
0
0
14 Oct 2025
LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference
LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference
Jianhao Yuan
Fabio Pizzati
Francesco Pinto
Lars Kunze
Ivan Laptev
Paul Newman
Philip Torr
D. Martini
DiffMVGen
163
1
0
13 Oct 2025
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
Jiayuan Sheng
Hanyang Zhao
Haoxian Chen
David Yao
Wenpin Tang
139
0
0
12 Oct 2025
Multi-Scale Diffusion Transformer for Jointly Simulating User Mobility and Mobile Traffic Pattern
Multi-Scale Diffusion Transformer for Jointly Simulating User Mobility and Mobile Traffic Pattern
Ziyi Liu
Qingyue Long
Zhiwen Xue
Huandong Wang
Yong Li
76
0
0
11 Oct 2025
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
Wuyang Li
W. Pan
Po-Chien Luan
Yang Gao
Alexandre Alahi
DiffMVGen
146
7
0
10 Oct 2025
TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Leigang Qu
Ziyang Wang
Na Zheng
Wenjie Wang
Liqiang Nie
Tat-Seng Chua
166
1
0
09 Oct 2025
An approach for systematic decomposition of complex llm tasks
An approach for systematic decomposition of complex llm tasks
Tianle Zhou
Jiakai Xu
G. Liu
Jiaxiang Liu
Haonan Wang
Eugene Wu
147
0
0
09 Oct 2025
A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking
A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking
Gal Fadlon
Idan Arbiv
Nimrod Berman
Omri Azencot
DiffMMedIm
157
2
0
08 Oct 2025
Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
Vision-Language-Action Models for Robotics: A Review Towards Real-World ApplicationsIEEE Access (IEEE Access), 2025
Kento Kawaharazuka
Jihoon Oh
Jun Yamada
Ingmar Posner
Yuke Zhu
LM&Ro
259
24
0
08 Oct 2025
Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models
Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models
Jiahao Wang
Zhenpei Yang
Yijing Bai
Yingwei Li
Yuliang Zou
...
Zehao Zhu
Jyh-Jing Hwang
Dragomir Anguelov
Mingxing Tan
C. Jiang
VGen
101
0
0
07 Oct 2025
Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning
Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning
Andrew Ly
Pulin Gong
AI4CE
180
0
0
07 Oct 2025
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
Danush Kumar Venkatesh
Adam Schmidt
Muhammad Abdullah Jamal
Omid Mohareri
VGenMedIm
142
0
0
07 Oct 2025
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion
Satoshi Hayakawa
Yuhta Takida
Masaaki Imaizumi
Hiromi Wakaki
Yuki Mitsufuji
DiffM
335
0
0
06 Oct 2025
Learning Robust Diffusion Models from Imprecise Supervision
Learning Robust Diffusion Models from Imprecise Supervision
Dong-Dong Wu
Jiacheng Cui
Wei Wang
Zhiqiang She
Masashi Sugiyama
DiffM
336
0
0
03 Oct 2025
What Drives Compositional Generalization in Visual Generative Models?
What Drives Compositional Generalization in Visual Generative Models?
Karim Farid
Rajat Sahay
Yumna Ali Alnaggar
Simon Schrodi
Volker Fischer
Cordelia Schmid
Thomas Brox
CoGe
313
0
0
03 Oct 2025
Learning to Generate Rigid Body Interactions with Video Diffusion Models
Learning to Generate Rigid Body Interactions with Video Diffusion Models
David Romero
Ariana Bermúdez
Hao Li
Fabio Pizzati
Ivan Laptev
DiffMVGen
444
0
0
02 Oct 2025
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Justin Cui
Jie Wu
Ming Li
Tao Yang
Xiaojie Li
Rui Wang
Andrew Bai
Yuanhao Ban
Cho-Jui Hsieh
DiffMVGen
225
27
0
02 Oct 2025
LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration
LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration
Alessio Spagnoletti
Andrés Almansa
Marcelo Pereyra
DiffMVGen
171
0
0
01 Oct 2025
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Hengtao Li
Pengxiang Ding
Runze Suo
Yihao Wang
Zirui Ge
...
Kexian Yu
Mingyang Sun
Hongyin Zhang
Donglin Wang
Weihua Su
140
6
0
01 Oct 2025
Code2Video: A Code-centric Paradigm for Educational Video Generation
Code2Video: A Code-centric Paradigm for Educational Video Generation
Yanzhe Chen
Kevin Qinghong Lin
Mike Zheng Shou
VGen
138
0
0
01 Oct 2025
Diffusion Alignment as Variational Expectation-Maximization
Diffusion Alignment as Variational Expectation-Maximization
Jaewoo Lee
Minsu Kim
S. Choi
Inhyuck Song
Sujin Yun
Hyeongyu Kang
Woocheol Shin
Taeyoung Yun
Kiyoung Om
Jinkyoo Park
107
0
0
01 Oct 2025
PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection
PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection
Tuan Nguyen
Naseem Khan
Khang Tran
Nhathai Phan
Issa M. Khalil
170
0
0
30 Sep 2025
Contrastive Diffusion Guidance for Spatial Inverse Problems
Contrastive Diffusion Guidance for Spatial Inverse Problems
Sattwik Basu
Chaitanya Amballa
Zhongweiyang Xu
Jorge Vančo Sampedro
Srihari Nelakuditi
Romit Roy Choudhury
88
0
0
30 Sep 2025
3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation
3DiFACE: Synthesizing and Editing Holistic 3D Facial AnimationInternational Conference on 3D Vision (3DV), 2025
Balamurugan Thambiraja
Malte Prinzler
S. Aliakbarian
Darren Cosker
Justus Thies
DiffMVGen
152
1
0
30 Sep 2025
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
Guanxi Lu
Hao Mark Chen
Yuto Karashima
Zhican Wang
Daichi Fujiki
Hongxiang Fan
AI4CE
106
4
0
30 Sep 2025
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
Abdelilah Aitrouga
Youssef Hmamouche
Amal El Fallah Seghrouchni
VGen
214
0
0
30 Sep 2025
Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis
Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis
Kaizhen Zhu
Mokai Pan
Zhechuan Yu
Jingya Wang
Jingyi Yu
Ye-ling Shi
DiffM
201
2
0
29 Sep 2025
UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark
UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark
Ailing Zhang
Lina Lei
Dehong Kong
Zhixin Wang
Jiaqi Xu
Fenglong Song
Chun-Le Guo
Chang Liu
Fan Li
Jie Chen
VGen
89
3
0
29 Sep 2025
Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility
Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility
Yutong Hao
Chen Chen
Ajmal Saeed Mian
Chang Xu
Daochang Liu
DiffMVGen
140
3
0
29 Sep 2025
UniVid: The Open-Source Unified Video Model
UniVid: The Open-Source Unified Video Model
Jiabin Luo
Junhui Lin
Zeyu Zhang
Biao Wu
Meng Fang
Ling-Hao Chen
Hao Tang
VGen
276
7
0
29 Sep 2025
Diff-3DCap: Shape Captioning with Diffusion Models
Diff-3DCap: Shape Captioning with Diffusion ModelsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Zhenyu Shu
Jiawei Wen
Shiyang Li
Shiqing Xin
Ligang Liu
DiffM
123
0
0
28 Sep 2025
Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
Jinpei Guo
Yifei Ji
Z. Chen
Yufei Wang
Sizhuo Ma
Yong Guo
Yulun Zhang
Jian Wang
188
0
0
28 Sep 2025
Autoregressive Video Generation beyond Next Frames Prediction
Autoregressive Video Generation beyond Next Frames Prediction
Sucheng Ren
C. L. Philip Chen
Zhenbang Wang
Liangchen Song
Xiangxin Zhu
Alan Yuille
Y. Yang
Jiasen Lu
VGen
166
2
0
28 Sep 2025
CREPE: Controlling Diffusion with Replica Exchange
CREPE: Controlling Diffusion with Replica Exchange
Jiajun He
Paul Jeha
Peter Potaptchik
Leo Zhang
José Miguel Hernández-Lobato
Yuanqi Du
Saifuddin Syed
Francisco Vargas
DiffM
97
0
0
27 Sep 2025
ARSS: Taming Decoder-only Autoregressive Visual Generation for View Synthesis From Single View
ARSS: Taming Decoder-only Autoregressive Visual Generation for View Synthesis From Single View
Wenbin Teng
Gonglin Chen
Haiwei Chen
Yajie Zhao
DiffMVGen
154
0
0
27 Sep 2025
d$^2$Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
d2^22Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
Yuchu Jiang
Yue Cai
Xiangzhong Luo
Jiale Fu
Jiarui Wang
Chonghan Liu
Xu Yang
93
6
0
27 Sep 2025
JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation
JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation
Guillem Capellera
Luis Ferraz
Antonio Rubio
Alexandre Alahi
Antonio Agudo
124
0
0
26 Sep 2025
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling
Chao Huang
Susan Liang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
131
0
0
26 Sep 2025
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
Jibin Song
Mingi Kwon
Jaeseok Jeong
Youngjung Uh
DiffMVGen
1.4K
0
0
26 Sep 2025
LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE
LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE
Yu Shang
Lei Jin
Yiding Ma
Xin Zhang
Chen Gao
Wei Wu
Yong Li
DiffMVGen
148
1
0
26 Sep 2025
X-Streamer: Unified Human World Modeling with Audiovisual Interaction
X-Streamer: Unified Human World Modeling with Audiovisual Interaction
You Xie
Tianpei Gu
Zenan Li
Chenxu Zhang
Guoxian Song
Xiaochen Zhao
C. Liang
Jianwen Jiang
Hongyi Xu
Linjie Luo
VGen
181
3
0
25 Sep 2025
DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models
DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models
Yinuo Ren
Wenhao Gao
Lexing Ying
Grant M. Rotskoff
Jiequn Han
184
3
0
25 Sep 2025
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
Yu Yuan
Xijun Wang
Tharindu Wickremasinghe
Zeeshan Nadir
Bole Ma
Stanley H. Chan
DiffMVGenPINN
1.5K
8
0
25 Sep 2025
PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models
PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models
Mingze Yuan
Pengfei Jin
Na Li
Shijie Zhao
AI4CE
139
0
0
24 Sep 2025
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
Chen Wang
Chuhao Chen
Yiming Huang
Zhiyang Dou
Yuan Liu
Jiatao Gu
Lingjie Liu
DiffMVGenPINN
619
9
0
24 Sep 2025
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Pin-Yen Chiu
I-Sheng Fang
Jun-Cheng Chen
DiffM
120
0
0
23 Sep 2025
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
S. Yu
Yuxin Chen
Hao Ju
Lianjie Jia
Fuxi Zhang
...
Lin Song
Lijun Wang
Yanwei Li
Y. Shan
Huchuan Lu
LRM
319
9
0
23 Sep 2025
Previous
123456...293031
Next