ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.03458
  4. Cited By
Video Diffusion Models
v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,542 papers shown
Training-Free Multi-Style Fusion Through Reference-Based Adaptive Modulation
Training-Free Multi-Style Fusion Through Reference-Based Adaptive Modulation
Xu Liu
Yibo Lu
Xinxian Wang
Xinyu Wu
DiffM
147
4
0
23 Sep 2025
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
Teng Xiao
Zuchao Li
Lefei Zhang
187
1
0
23 Sep 2025
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Pin-Yen Chiu
I-Sheng Fang
Jun-Cheng Chen
DiffM
135
0
0
23 Sep 2025
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
S. Yu
Yuxin Chen
Hao Ju
Lianjie Jia
Fuxi Zhang
...
Lin Song
Lijun Wang
Yanwei Li
Y. Shan
Huchuan Lu
LRM
324
12
0
23 Sep 2025
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
Jinshu Chen
Xinghui Li
Xu Bai
Tianxiang Ma
Pengze Zhang
...
Gen Li
Lijie Liu
Songtao Zhao
Bingchuan Li
Qian He
DiffMVGen
173
2
0
22 Sep 2025
DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models
DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models
Chi Zhang
Mengxin Zheng
Qian Lou
Fan Chen
DiffM
96
0
0
22 Sep 2025
VidCLearn: A Continual Learning Approach for Text-to-Video Generation
VidCLearn: A Continual Learning Approach for Text-to-Video Generation
Luca Zanchetta
Lorenzo Papa
Luca Maiano
Irene Amerini
DiffMVGen
131
0
0
21 Sep 2025
$\mathtt{M^3VIR}$: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation
M3VIR\mathtt{M^3VIR}M3VIR: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation
Y. Li
Lebin Zhou
Nam Ling
Zhenghao Chen
Wei Wang
Wei Jiang
VGen
177
0
0
21 Sep 2025
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Yue Ma
Zexuan Yan
Hongyu Liu
H. Wang
Heng Pan
...
H. Shum
Zhifeng Li
Wei Liu
Linfeng Zhang
Qifeng Chen
VGen
272
13
0
20 Sep 2025
SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models
SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models
Sen Wang
Jingyi Tian
Le Wang
Zhimin Liao
Jiayi Li
Huaiyi Dong
Kun Xia
Sanping Zhou
Wei Tang
Hua Gang
VGenLRM
185
0
0
19 Sep 2025
OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
Björn Möller
Zhengyang Li
Malte Stelzer
Thomas Graave
Fabian Bettels
Muaaz Ataya
Tim Fingscheidt
VGen
185
0
0
18 Sep 2025
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Fangjinhua Wang
Qingshan Xu
Yew-Soon Ong
Marc Pollefeys
206
2
0
18 Sep 2025
BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching
BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching
Hanshuai Cui
Zhiqing Tang
Zhifei Xu
Zhi Yao
Wenyi Zeng
Weijia Jia
VGen
312
1
0
17 Sep 2025
Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation
Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation
Z. Chen
Zihao Guo
Peng Wang
ThankGod Itua Egbe
Yan Lyu
Chenghao Qian
105
0
0
16 Sep 2025
TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving
TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving
Jiawei Wang
Haowei Sun
Xintao Yan
Shuo Feng
Jun Gao
Henry X. Liu
184
4
0
16 Sep 2025
Data-Efficient Ensemble Weather Forecasting with Diffusion Models
Data-Efficient Ensemble Weather Forecasting with Diffusion Models
Kevin Valencia
Ziyang Liu
Justin Cui
DiffM
199
0
0
14 Sep 2025
Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation
Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation
Yi-Ruei Liu
You-Zhe Xie
Yu-Hsiang Hsu
I-Sheng Fang
Yu-Lun Liu
Jun-Cheng Chen
VGen3DGS
193
0
0
13 Sep 2025
Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining
Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining
Yasar Utku Alçalar
Junno Yun
Mehmet Akçakaya
DiffMMedIm
143
2
0
11 Sep 2025
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis
Yikang Ding
Jiwen Liu
Wenyuan Zhang
Z. Wang
Wentao Hu
...
Xiaohan Li
Ming Chen
Xiaoqiang Liu
Yu-Shen Liu
Pengfei Wan
VGen
193
7
0
11 Sep 2025
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Jenna Kang
Maria Silva
Patsorn Sangkloy
Kenneth Chen
Niall Williams
Qi Sun
EGVMVGen
155
0
0
10 Sep 2025
Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities
Foundation Models for Autonomous Driving Perception: A Survey Through Core CapabilitiesIEEE Open Journal of Vehicular Technology (JOVT), 2025
Rajendramayavan Sathyam
Yueqi Li
VLMLRM
180
3
0
10 Sep 2025
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
Duomin Wang
W. Zuo
Aojie Li
L. Chen
Xinyao Liao
Deyu Zhou
Zixin Yin
Xili Dai
Daxin Jiang
Gang Yu
DiffMVGen
145
16
0
07 Sep 2025
STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs
STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs
Han Liang
Jiahui Zhou
Zicheng Zhou
Xiaoxi Zhang
Xu Chen
DiffM
173
1
0
05 Sep 2025
Fitting Image Diffusion Models on Video Datasets
Fitting Image Diffusion Models on Video Datasets
Juhun Lee
Simon S. Woo
VGen
104
0
0
04 Sep 2025
Scale-Adaptive Generative Flows for Multiscale Scientific Data
Scale-Adaptive Generative Flows for Multiscale Scientific Data
Yifan Chen
Eric Vanden-Eijnden
160
0
0
03 Sep 2025
Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling
Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling
Srinivas Anumasa
Barath Chandran.C
Tingting Chen
Dianbo Liu
DiffM
95
0
0
02 Sep 2025
Look Beyond: Two-Stage Scene View Generation via Panorama and Video Diffusion
Look Beyond: Two-Stage Scene View Generation via Panorama and Video Diffusion
Xueyang Kang
Zhengkang Xiang
Zezheng Zhang
Kourosh Khoshelham
DiffMVGen
127
0
0
31 Aug 2025
Visually Grounded Narratives: Reducing Cognitive Burden in Researcher-Participant Interaction
Visually Grounded Narratives: Reducing Cognitive Burden in Researcher-Participant Interaction
Runtong Wu
Jiayao Song
Fei Teng
Xianhao Ren
Yuyan Gao
Kailun Yang
144
0
0
30 Aug 2025
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Qiao Sun
Liujia Yang
Wei Tang
Wei Huang
Kaixin Xu
...
Tong He
Yilun Chen
Xili Dai
Nanyang Ye
Qinying Gu
VGenLM&Ro
416
1
0
28 Aug 2025
ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion
ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion
Nima Kondori
Hanwen Liang
H. Vaseli
Bingyu Xie
C. Luong
Purang Abolmaesumi
T. Tsang
Renjie Liao
MedIm
126
0
0
25 Aug 2025
On the Edge of Memorization in Diffusion Models
On the Edge of Memorization in Diffusion Models
Sam Buchanan
Druv Pai
Yi-An Ma
Valentin De Bortoli
TDI
279
3
0
25 Aug 2025
Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation
Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation
Chun-Peng Chang
Chen-Yu Wang
Julian Schmidt
Holger Caesar
A. Pagani
VGen
265
1
0
22 Aug 2025
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang
Zhenyu Liao
Jingfeng Wu
Difan Zou
DiffM
189
1
0
22 Aug 2025
Scaling Group Inference for Diverse and High-Quality Generation
Scaling Group Inference for Diverse and High-Quality Generation
Gaurav Parmar
Or Patashnik
Daniil Ostashev
Kuan-Chieh Wang
Kfir Aberman
Srinivasa Narasimhan
Jun-Yan Zhu
181
2
0
21 Aug 2025
Data Auctions for Retrieval Augmented Generation
Data Auctions for Retrieval Augmented Generation
Minbiao Han
Seyed A. Esmaeili
Michael Albert
Haifeng Xu
188
0
0
21 Aug 2025
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation
Haonan Qiu
Ning Yu
Ziqi Huang
P. Debevec
Ziwei Liu
VGen
193
3
0
21 Aug 2025
MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Guile Wu
David Huang
Dongfeng Bai
Bingbing Liu
VGen
133
0
0
20 Aug 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
174
4
0
20 Aug 2025
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Shaoshu Yang
Zhe Kong
Feng Gao
Meng Cheng
Xiangyu Liu
...
Zhuoliang Kang
Tong Lu
Xunliang Cai
Ran He
Xiaoming Wei
VGen
129
13
0
19 Aug 2025
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
4DNeX: Feed-Forward 4D Generative Modeling Made Easy
Zhaoxi Chen
Tianqi Liu
Long Zhuo
Jiawei Ren
Zeng Tao
He Zhu
Fangzhou Hong
Liang Pan
Ziwei Liu
VGen
147
11
0
18 Aug 2025
Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models
Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models
Jianshu Zeng
Yuxuan Liu
Yutong Feng
Chenxuan Miao
Zixiang Gao
Jiwang Qu
Jianzhang Zhang
Bin Wang
Kun Yuan
VGen
190
5
0
18 Aug 2025
EgoTwin: Dreaming Body and View in First Person
EgoTwin: Dreaming Body and View in First Person
Jingqiao Xiu
Fangzhou Hong
Yicong Li
Mengze Li
Wentao Wang
Sirui Han
Liang Pan
Ziwei Liu
DiffMVGen
160
4
0
18 Aug 2025
CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis
CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis
Jiayi Wang
Hadrien Reynaud
Franciskus Xaverius Erick
Bernhard Kainz
DiffMMedImVGen
109
0
0
18 Aug 2025
GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis
GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis
Sirshapan Mitra
Yogesh S Rawat
DiffM
165
0
0
18 Aug 2025
Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models
Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models
Xun Su
Jianming Huang
Yang Yusen
Zhongxi Fang
Hiroyuki Kasai
DiffM
167
1
0
17 Aug 2025
Projected Coupled Diffusion for Test-Time Constrained Joint Generation
Projected Coupled Diffusion for Test-Time Constrained Joint Generation
Hao Luan
Yi Xian Goh
See-Kiong Ng
Chun Kai Ling
DiffM
216
1
0
14 Aug 2025
Diffusion is a code repair operator and generator
Diffusion is a code repair operator and generator
Mukul Singh
Gust Verbruggen
Vu Le
Sumit Gulwani
DiffM
81
0
0
14 Aug 2025
Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances
Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances
Yuanzhi Liang
Yijie Fang
Rui Li
Ziqi Ni
Ruijie Su
Chi Zhang
Xuelong Li
EGVM
334
2
0
14 Aug 2025
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Lingen Li
Guangzhi Wang
Zhaoyang Zhang
Yaowei Li
Xiaoyu Li
Qi Dou
Jinwei Gu
Tianfan Xue
Mingyu Ding
VGen
177
2
0
14 Aug 2025
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
Kelin Yu
Sheng Zhang
Harshit Soora
Furong Huang
Heng Huang
Erfaun Noorani
Ruohan Gao
VGen
99
4
0
14 Aug 2025
Previous
12345...293031
Next
Page 4 of 31
Pageof 31