ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.03458
  4. Cited By
Video Diffusion Models
v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,556 papers shown
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Shiao Wang
Xinyu Wang
Bo Jiang
Lin Zhu
G. Li
Longji Xu
Yonghong Tian
Jin Tang
748
1
0
08 Apr 2025
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Mengchao Wang
Qiang Wang
Fan Jiang
Yaqi Fan
Yunpeng Zhang
Yonggang Qi
Kun Zhao
Mu Xu
DiffMVGen
286
56
0
07 Apr 2025
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Xuyang Guo
Zekai Huang
Jiayan Huo
Yingyu Liang
Zhenmei Shi
Zhao Song
Jiahao Zhang
ALMVGen
660
14
0
05 Apr 2025
Multi-identity Human Image Animation with Structural Video Diffusion
Multi-identity Human Image Animation with Structural Video Diffusion
Zhenzhi Wang
Yongqian Li
Yanhong Zeng
Yuwei Guo
Dahua Lin
Tianfan Xue
Bo Dai
VGen
331
7
0
05 Apr 2025
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Chuning Zhu
Raymond Yu
S. Feng
Benjamin Burchfiel
Paarth Shah
Abhishek Gupta
VGen
557
68
0
03 Apr 2025
MG-Gen: Single Image to Motion Graphics Generation
MG-Gen: Single Image to Motion Graphics Generation
Takahiro Shirakawa
Tomoyuki Suzuki
Takuto Narumoto
Daichi Haraguchi
VGen
695
0
0
03 Apr 2025
Autonomous Human-Robot Interaction via Operator Imitation
Autonomous Human-Robot Interaction via Operator Imitation
Sammy Christen
David Müller
Agon Serifi
Ruben Grandia
Georg Wiedebach
Michael A. Hopkins
Espen Knoop
Moritz Bächer
LM&Ro
314
2
0
03 Apr 2025
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and HarmonizationComputer Vision and Pattern Recognition (CVPR), 2025
Jiadong Wang
Jingyuan Liu
Xin Sun
Krishna Kumar Singh
Zhixin Shu
...
Nanxuan Zhao
Tuanfeng Y. Wang
Simon Chen
Ulrich Neumann
Jae Shin Yoon
359
5
0
03 Apr 2025
OmniCam: Unified Multimodal Video Generation via Camera Control
OmniCam: Unified Multimodal Video Generation via Camera Control
Xiaoda Yang
Jiayang Xu
Kaixuan Luan
Xinyu Zhan
Hongshun Qiu
...
Shuai Yang
Li Zhang
Checheng Yu
Cewu Lu
Lixin Yang
DiffMVGen
318
7
0
03 Apr 2025
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression
Random Conditioning with Distillation for Data-Efficient Diffusion Model CompressionComputer Vision and Pattern Recognition (CVPR), 2025
Dohyun Kim
S. Park
Geonhee Han
Seung Wook Kim
Paul Hongsuck Seo
DiffM
379
1
0
02 Apr 2025
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE SolutionsInternational Conference on Learning Representations (ICLR), 2025
Jinyoung Choi
Junoh Kang
Bohyung Han
213
3
0
02 Apr 2025
Hyperbolic Diffusion Recommender Model
Hyperbolic Diffusion Recommender ModelThe Web Conference (WWW), 2025
Meng Yuan
Yutian Xiao
Wei Chen
Chu Zhao
Deqing Wang
Fuzhen Zhuang
372
12
0
02 Apr 2025
FreSca: Scaling in Frequency Space Enhances Diffusion Models
FreSca: Scaling in Frequency Space Enhances Diffusion Models
Chao Huang
Susan Liang
Yunlong Tang
Li Ma
Yapeng Tian
Chenliang Xu
Chenliang Xu
DiffM
284
1
0
02 Apr 2025
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion ModelInternational Conference on Learning Representations (ICLR), 2025
Jincheng Zhong
Xiangcheng Zhang
Chao Guo
Mingsheng Long
306
4
0
02 Apr 2025
Can Test-Time Scaling Improve World Foundation Model?
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong
Hanqing Zhu
Peihao Wang
Bangya Liu
Dejia Xu
Kevin Wang
David Z. Pan
Yan Wang
Zhiwen Fan
Ziyi Wang
423
7
0
31 Mar 2025
MoCha: Towards Movie-Grade Talking Character Synthesis
MoCha: Towards Movie-Grade Talking Character Synthesis
Cong Wei
Bo Sun
Haoyu Ma
Ji Hou
F. Xu
...
Kunpeng Li
Tingbo Hou
Animesh Sinha
Peter Vajda
Lei Ma
VGen
837
24
0
30 Mar 2025
SketchVideo: Sketch-based Video Generation and Editing
SketchVideo: Sketch-based Video Generation and EditingComputer Vision and Pattern Recognition (CVPR), 2025
Feng-Lin Liu
Hongbo Fu
Xintao Wang
Weicai Ye
Pengfei Wan
Di Zhang
Lin Gao
DiffMVGen
387
11
0
30 Mar 2025
Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models
Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025
Haonan Chen
Jiaming Xu
Lily Sheng
Tianchen Ji
Shuijing Liu
Yunzhu Li
Katherine Driggs-Campbell
429
9
0
30 Mar 2025
CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving
CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving
Yishen Ji
Ziyue Zhu
Zhenxin Zhu
Kaixin Xiong
Jiaying Ying
Zhiqi Li
Lijun Zhou
Haiyang Sun
Bing Wang
Tong Lu
VGen
315
7
0
28 Mar 2025
EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation
EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation
Hadrien Reynaud
Alberto Gomez
Paul Leeson
Qingjie Meng
Bernhard Kainz
MedIm
247
6
0
28 Mar 2025
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
Mono2Stereo: A Benchmark and Empirical Study for Stereo ConversionComputer Vision and Pattern Recognition (CVPR), 2025
S. Yu
Yuxin Chen
Chen Ma
Zeke Xie
Yifan Wang
Lijun Wang
Mingyu Ding
Huchuan Lu
258
3
0
28 Mar 2025
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
SyncSDE: A Probabilistic Framework for Diffusion SynchronizationComputer Vision and Pattern Recognition (CVPR), 2025
Hyunjun Lee
Hyunsoo Lee
Sookwan Han
DiffM
514
1
0
27 Mar 2025
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Chi-Pin Huang
Yen-Siang Wu
Hung-Kai Chung
Kai-Po Chang
Fu-En Yang
Yu-Jie Wang
DiffMVGen
366
11
0
27 Mar 2025
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
Jinwei Li
Huan-ang Gao
Wenyi Li
Haohan Chi
Chenyu Liu
...
Yao Yao
Jingwei Zhao
Hongyang Li
Yikai Wang
Hao Zhao
390
3
0
26 Mar 2025
Guiding Human-Object Interactions with Rich Geometry and Relations
Guiding Human-Object Interactions with Rich Geometry and RelationsComputer Vision and Pattern Recognition (CVPR), 2025
Mengqing Xue
Yifei Liu
Ling Guo
Shaoli Huang
Changxing Ding
300
9
0
26 Mar 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
Jiale Cheng
Ruiliang Lyu
Xiaohan Zhang
Xiao-Chang Liu
Jiazheng Xu
...
Zhuoyi Yang
Yuxiao Dong
Jie Tang
Han Wang
Minlie Huang
VGen
345
16
0
26 Mar 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
592
6
0
26 Mar 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
T. Liu
Longxiang Zhang
Zhaoxi Chen
Guangcong Wang
Shoukang Hu
Liao Shen
Huiqiang Sun
Z. Cao
Wei Li
Ziwei Liu
VGen3DGS
551
20
0
26 Mar 2025
Debiasing Kernel-Based Generative Models
Debiasing Kernel-Based Generative Models
Tian Qin
Wei-Min Huang
402
0
0
26 Mar 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Yufei Cai
Hu Han
Yuxiang Wei
Shiguang Shan
Xilin Chen
DiffMVGen
308
2
0
25 Mar 2025
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Fernando Julio Cendra
Kai Han
VLM
493
1
0
25 Mar 2025
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
Qiusheng Huang
Xiaohui Zhong
Xu Fan
Lei Chen
Hao Li
AI4TSAI4CE
359
2
0
25 Mar 2025
Long-Context Autoregressive Video Modeling with Next-Frame Prediction
Long-Context Autoregressive Video Modeling with Next-Frame Prediction
Yuchao Gu
Weijia Mao
Mike Zheng Shou
VGen
611
84
0
25 Mar 2025
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Zhiyao Ren
Yibing Zhan
B. Yu
Dacheng Tao
DiffM
384
2
0
25 Mar 2025
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Haiyu Zhang
Xinyuan Chen
Yaohui Wang
Xihui Liu
Yunhong Wang
Yu Qiao
VGen
331
9
0
25 Mar 2025
Target-Aware Video Diffusion Models
Target-Aware Video Diffusion Models
Taeksoo Kim
Hanbyul Joo
DiffMVGen
562
5
0
24 Mar 2025
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
Qiang Qu
Ming Li
Xiaoming Chen
Tongliang Liu
DiffMVGen
398
3
0
24 Mar 2025
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment AnimationBritish Machine Vision Conference (BMVC), 2025
R. Vidaurre
Elena Garces
Dan Casas
DiffMAI4CE
332
1
0
24 Mar 2025
LongDiff: Training-Free Long Video Generation in One Go
LongDiff: Training-Free Long Video Generation in One GoComputer Vision and Pattern Recognition (CVPR), 2025
Zhuoling Li
Hossein Rahmani
Qiuhong Ke
Jing Liu
DiffMVGenVLM
333
6
0
23 Mar 2025
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
Xuewei Chen
Zhimin Chen
Yiren Song
VGen
459
20
0
23 Mar 2025
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia
David Bourgin
Krishna Kumar Singh
Yuheng Li
Yan Kang
Zhan Xu
N. Jha
Yixiao Liu
DiffMVGen
453
1
0
21 Mar 2025
Enabling Versatile Controls for Video Diffusion Models
Enabling Versatile Controls for Video Diffusion Models
Xu Zhang
Hao Zhou
Haoming Qin
Xiaobin Lu
Jiaxing Yan
Guanzhong Wang
Zeyu Chen
Yi Liu
DiffMVGen
316
4
0
21 Mar 2025
Bezier Distillation
Bezier Distillation
Ling Feng
SK Yang
171
0
0
20 Mar 2025
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
Leyang Wang
Joice Lin
DiffM
300
0
0
20 Mar 2025
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
Hyojun Go
Byeongjun Park
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
3DGSVGen
455
11
0
20 Mar 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffMVGen
962
11
0
20 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Jingdong Sun
Zuxuan Wu
VGen
503
28
0
20 Mar 2025
Text-Driven Diffusion Model for Sign Language Production
Text-Driven Diffusion Model for Sign Language Production
J. He
Xu Wang
Ruobei Zhang
Shengeng Tang
Yijiao Wang
Lechao Cheng
DiffM
385
5
0
20 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGSVGen
670
35
0
20 Mar 2025
Temporal Regularization Makes Your Video Generator Stronger
Temporal Regularization Makes Your Video Generator Stronger
Harold Haodong Chen
Haojian Huang
Xianfeng Wu
Yexin Liu
Yajing Bai
Wen-Jie Shu
Harry Yang
Ser-Nam Lim
VGen
411
9
0
19 Mar 2025
Previous
123...8910...303132
Next
Page 9 of 32
Pageof 32