ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.12346
  4. Cited By
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

Annual Meeting of the Association for Computational Linguistics (ACL), 2023
22 March 2023
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
Minheng Ni
Zhengyuan Yang
Linjie Li
Shuguang Liu
Fan Yang
Jianlong Fu
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
    VGen
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation"

50 / 77 papers shown
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
Qinghe Wang
Xiaoyu Shi
Baolu Li
Weikang Bian
Quande Liu
Huchuan Lu
Xintao Wang
Pengfei Wan
Kun Gai
Xu Jia
VGen
204
2
0
02 Dec 2025
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi
Pragati Jaiswal
T. Habtegebrial
Christen Millerdurai
Shaoxiang Wang
A. Pagani
Didier Stricker
DiffMVGen
134
0
0
30 Nov 2025
Flow and Depth Assisted Video Prediction with Latent Transformer
Eliyas Suleyman
Paul Henderson
Eksan Firkat
Nicolas Pugeault
149
0
0
20 Nov 2025
TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Yukuo Ma
Cong Liu
Junke Wang
J. Liu
Haibin Huang
Zuxuan Wu
C. Zhang
Xuelong Li
VGen
114
0
0
16 Nov 2025
A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates
A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates
Wei-Cheng Lee
Francesco Orabona
123
17
0
14 Nov 2025
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Junyoung Seo
Rodrigo Mira
A. Haliassos
Stella Bounareli
Honglie Chen
Linh Tran
Seungryong Kim
Zoe Landgraf
Jie Shen
VGen
152
1
0
27 Oct 2025
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation
Weinan Jia
Yuning Lu
Mengqi Huang
Hualiang Wang
Binyuan Huang
Nan Chen
Mu Liu
Jidong Jiang
Zhendong Mao
VGenVLM
116
3
0
21 Oct 2025
Terra: Explorable Native 3D World Model with Point Latents
Terra: Explorable Native 3D World Model with Point Latents
Yuanhui Huang
Weiliang Chen
Wenzhao Zheng
Xin Tao
Pengfei Wan
Jie Zhou
Jiwen Lu
VGen
126
1
0
16 Oct 2025
Arbitrary Generative Video Interpolation
Arbitrary Generative Video Interpolation
Guozhen Zhang
Haiguang Wang
C. Wang
Yuan Zhou
Qinglin Lu
Limin Wang
VGen
148
0
0
01 Oct 2025
LongLive: Real-time Interactive Long Video Generation
LongLive: Real-time Interactive Long Video Generation
Shuai Yang
Wei Huang
Ruihang Chu
Yicheng Xiao
Yuyang Zhao
...
Enze Xie
Yihao Chen
Yao Lu
Song Han
Yukang Chen
DiffMVGenVLM
241
30
0
26 Sep 2025
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
S. Yu
Yuxin Chen
Hao Ju
Lianjie Jia
Fuxi Zhang
...
Lin Song
Lijun Wang
Yanwei Li
Y. Shan
Huchuan Lu
LRM
319
9
0
23 Sep 2025
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
Zhiheng Liu
XueQing Deng
Shoufa Chen
Angtian Wang
Qiushan Guo
Mingfei Han
Zeyue Xue
M. Chen
Ping Luo
Linjie Yang
DiffMVGen
168
4
0
21 Aug 2025
PersonaVlog: Personalized Multimodal Vlog Generation with Multi-Agent Collaboration and Iterative Self-Correction
PersonaVlog: Personalized Multimodal Vlog Generation with Multi-Agent Collaboration and Iterative Self-Correction
Xiaolu Hou
Bing Ma
Jiaxiang Cheng
Xuhua Ren
Kai Yu
Wenyue Li
Tianxiang Zheng
Qinglin Lu
DiffMVGen
129
0
0
19 Aug 2025
Matrix-game 2.0: An open-source real-time and streaming interactive world model
Matrix-game 2.0: An open-source real-time and streaming interactive world model
Xianglong He
Chunli Peng
Zexiang Liu
Boyang Wang
Yifan Zhang
...
Wei Li
Xuchen Song
Wenshu Fan
Eric Li
Yahui Zhou
VGen
310
26
0
18 Aug 2025
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
Qian Wang
Z. Huang
Ruoxi Jia
P. Debevec
Ning Yu
DiffMVGen
307
1
0
11 Aug 2025
StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation
StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation
S. Tu
Yueming Pan
Y. Huang
Xintong Han
Zhen Xing
Jingdong Sun
Chong Luo
Zuxuan Wu
Yu-Gang Jiang
VGen
164
15
0
11 Aug 2025
Enhancing Scene Transition Awareness in Video Generation via Post-Training
Enhancing Scene Transition Awareness in Video Generation via Post-Training
Hanwen Shen
Jiajie Lu
Yupeng Cao
Xiaonan Yang
VGen
153
0
0
24 Jul 2025
NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation
NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation
X. Feng
H. Yu
M. Wu
Shuyan Hu
J. Chen
C. Zhu
J. Wu
X. Chu
K. Huang
DiffMEGVMVGen
549
6
0
15 Jul 2025
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution
Liangbin Xie
Yu Li
Shian Du
Menghan Xia
Xintao Wang
Fanghua Yu
Ziyan Chen
Pengfei Wan
Jiantao Zhou
Chao Dong
DiffMVGenSupR
405
1
0
24 Jun 2025
STAGE: A Stream-Centric Generative World Model for Long-Horizon Driving-Scene Simulation
STAGE: A Stream-Centric Generative World Model for Long-Horizon Driving-Scene Simulation
Jiamin Wang
Yichen Yao
Xiang Feng
Hang Wu
Yaming Wang
Qingqiu Huang
Y. Ma
Xinge Zhu
VGen
317
3
0
16 Jun 2025
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
Tianyu Huang
Wangguandong Zheng
Tengfei Wang
Yuhao Liu
Zhenwei Wang
...
Jie Jiang
Hui Li
Rynson W. H. Lau
W. Zuo
Chunchao Guo
VGen
326
27
0
04 Jun 2025
Physics-Guided Motion Loss for Video Generation Model
Physics-Guided Motion Loss for Video Generation Model
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
DiffMVGen
167
0
0
02 Jun 2025
A Survey of Generative Categories and Techniques in Multimodal Generative Models
A Survey of Generative Categories and Techniques in Multimodal Generative Models
Longzhen Han
Awes Mubarak
Almas Baimagambetov
Nikolaos Polatidis
Thar Baker
LRM
399
0
0
29 May 2025
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos
Xiaodong Wang
Peixi Peng
VGen
1.3K
1
0
24 May 2025
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts
Taewon Kang
Ming C. Lin
DiffMVGen
389
1
0
22 May 2025
EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
Hanle Zheng
Xujie Han
Zegang Peng
Shangbin Zhang
Guangxun Du
Zhuo Zou
Xiang Wang
Jibin Wu
Hao Guo
Lei Deng
DiffMVGen
260
1
0
13 May 2025
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component AnalysisComputer Vision and Pattern Recognition (CVPR), 2025
Jiangtong Tan
Hu Yu
Jie Huang
Jie Xiao
Feng Zhao
329
5
0
02 May 2025
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
Lvmin Zhang
S. Cai
Muyang Li
Gordon Wetzstein
Maneesh Agrawala
DiffMVGen
534
43
0
17 Apr 2025
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi
Jiadong Wang
Yuanzhi Liang
Xi Qiu
Yuchi Huo
Ruiqi Wang
Fangqiu Yi
Xuzhao Li
DiffMVGen
601
10
0
15 Apr 2025
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
Xingrui Wang
Jiang-Long Liu
Liang Luo
Xiaodong Yu
Jialian Wu
Xingwu Sun
Yusheng Su
Yaoyao Liu
Zicheng Liu
Emad Barsoum
DiffMVGen
281
4
0
13 Apr 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffMVGen
827
9
0
20 Mar 2025
EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis
EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis
Jianwu Fang
Lei-lei Li
Zhedong Zheng
Hongkai Yu
Jianru Xue
Zhengguo Li
Tat-Seng Chua
241
0
0
16 Mar 2025
Text2Story: Advancing Video Storytelling with Text Guidance
Text2Story: Advancing Video Storytelling with Text Guidance
Taewon Kang
D. Kothandaraman
Ming C. Lin
DiffMVGen
416
3
0
08 Mar 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame InterpolationComputer Vision and Pattern Recognition (CVPR), 2025
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffMVGen
357
8
0
03 Mar 2025
ASurvey: Spatiotemporal Consistency in Video Generation
ASurvey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
Junlin Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVMVGenAI4TS
274
0
0
25 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffMVGen
693
6
0
18 Feb 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
Towards Precise Scaling Laws for Video Diffusion TransformersComputer Vision and Pattern Recognition (CVPR), 2024
Yuanyang Yin
Yaqi Zhao
Mingwu Zheng
Ke Lin
Jiarong Ou
...
Pengfei Wan
Di Zhang
Baoqun Yin
Wentao Zhang
Kun Gai
437
9
0
03 Jan 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
332
19
0
31 Dec 2024
Grid Diffusion Models for Text-to-Video Generation
Grid Diffusion Models for Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Taegyeong Lee
Soyeong Kwon
Taehwan Kim
313
19
0
31 Dec 2024
Enhancing Long Video Generation Consistency without Tuning
Enhancing Long Video Generation Consistency without Tuning
Xingyao Li
Fengzhuo Zhang
Jiachun Pan
Yunlong Hou
Vincent Y. F. Tan
Zhuoran Yang
DiffMVGen
325
0
0
23 Dec 2024
Video Diffusion Transformers are In-Context Learners
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGenDiffM
882
7
0
14 Dec 2024
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Sonic: Shifting Focus to Global Audio Perception in Portrait AnimationComputer Vision and Pattern Recognition (CVPR), 2024
Xiaozhong Ji
Xiaobin Hu
Zhihong Xu
Junwei Zhu
Chuming Lin
...
Donghao Luo
Yi Chen
Qin Lin
Qinglin Lu
Chengjie Wang
VGen
411
47
0
25 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guoquan Zheng
EGVM
533
11
0
25 Nov 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
MovieBench: A Hierarchical Movie Level Dataset for Long Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffMVGen
445
13
0
22 Nov 2024
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video GenerationInternational Conference on Learning Representations (ICLR), 2024
Zongyi Li
Shujie Hu
Shujie Liu
Long Zhou
Jeongsoo Choi
Lingwei Meng
Xun Guo
Jiajian Li
H. Ling
Furu Wei
VGenDiffM
426
26
0
27 Oct 2024
EVA: An Embodied World Model for Future Video Anticipation
EVA: An Embodied World Model for Future Video Anticipation
Yatian Wang
Hengyuan Zhang
Chun-Kai Fan
Xingqun Qi
Rongyu Zhang
...
Chi-Min Chan
Wei Xue
Wenhan Luo
Shanghang Zhang
Wenhan Luo
VGen
235
18
0
20 Oct 2024
Progressive Autoregressive Video Diffusion Models
Progressive Autoregressive Video Diffusion Models
Desai Xie
Zhan Xu
Yicong Hong
Hao Tan
Difan Liu
Feng Liu
Arie E. Kaufman
Yang Zhou
DiffMVGen
314
39
0
10 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
375
68
0
03 Oct 2024
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
LVCD: Reference-based Lineart Video Colorization with Diffusion ModelsACM Transactions on Graphics (TOG), 2024
Zhitong Huang
Mohan Zhang
Jing Liao
DiffMVGen
305
24
0
19 Sep 2024
DriveGenVLM: Real-world Video Generation for Vision Language Model based
  Autonomous Driving
DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving
Yongjie Fu
Anmol Jain
Xuan Di
Xu Chen
Chengbo Zang
VGen
218
10
0
29 Aug 2024
12
Next