ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.08089
  4. Cited By
DragNUWA: Fine-grained Control in Video Generation by Integrating Text,
  Image, and Trajectory

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

16 August 2023
Sheng-Siang Yin
Chenfei Wu
Jian Liang
Jie Shi
Houqiang Li
Gong Ming
Nan Duan
    VGen
ArXiv (abs)PDFHTMLHuggingFace (22 upvotes)Github

Papers citing "DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory"

50 / 147 papers shown
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
Yiming Wang
Qihang Zhang
S. Cai
Tong Wu
Jan Ackermann
Zhengfei Kuang
Yang Zheng
Frano Rajič
Siyu Tang
Gordon Wetzstein
DiffMVGen
252
3
0
04 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffMVGen3DPC
357
4
0
01 Dec 2025
DisMo: Disentangled Motion Representations for Open-World Motion Transfer
DisMo: Disentangled Motion Representations for Open-World Motion Transfer
Thomas Ressler-Antal
Frank Fundel
Malek Ben Alaya
S. A. Baumann
Felix Krause
Ming Gui
Bjorn Ommer
DiffMVGen
147
0
0
28 Nov 2025
Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance
Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance
Haoxuan Wang
Jiachen Tao
Junyi Wu
Gaowen Liu
Ramana Rao Kompella
Yan Yan
VGen
229
0
0
25 Nov 2025
MotionV2V: Editing Motion in a Video
MotionV2V: Editing Motion in a Video
R. Burgert
Charles Herrmann
Forrester Cole
Michael S. Ryoo
Neal Wadhwa
Andrey Voynov
Nataniel Ruiz
DiffMVGen
312
5
0
25 Nov 2025
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Yeji Song
Jaehyun Lee
Mijin Koo
Junhoo Lee
Nojun Kwak
DiffMVGen
131
0
0
23 Nov 2025
Generative Augmented Reality: Paradigms, Technologies, and Future Applications
Generative Augmented Reality: Paradigms, Technologies, and Future Applications
Chen Liang
Jiawen Zheng
Yufeng Zeng
Yi Tan
Hengye Lyu
Yuhui Zheng
Zisu Li
Yueting Weng
Jiaxin Shi
Hanwang Zhang
214
1
0
20 Nov 2025
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Huiqiang Sun
Liao Shen
Zhan Peng
Kun Wang
Size Wu
...
Z. Huang
Xingyu Zeng
Zhiguo Cao
Wei Li
Chen Change Loy
DiffMVGen
234
0
0
17 Nov 2025
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models
Panwang Pan
Chenguo Lin
Jingjing Zhao
Chenxin Li
Yuchen Lin
...
Honglei Yan
Kairun Wen
Yunlong Lin
Yixuan Yuan
Yadong Mu
3DGSVGen
223
2
0
01 Nov 2025
World-in-World: World Models in a Closed-Loop World
World-in-World: World Models in a Closed-Loop World
Jiahan Zhang
Muqing Jiang
Nanru Dai
Taiming Lu
Arda Uzunoglu
...
Rama Chellappa
Tianmin Shu
Alan Yuille
Yilun Du
Jieneng Chen
VGenVLM
322
13
0
20 Oct 2025
Generalized Dynamics Generation towards Scannable Physical World Model
Generalized Dynamics Generation towards Scannable Physical World Model
Yichen Li
Zhiyi Li
Brandon Feng
Dinghuai Zhang
Antonio Torralba
3DGSAI4CE
174
0
0
16 Oct 2025
STANCE: Motion Coherent Video Generation Via Sparse-to-Dense Anchored Encoding
STANCE: Motion Coherent Video Generation Via Sparse-to-Dense Anchored Encoding
Zhifei Chen
Tianshuo Xu
Leyi Wu
Luozhou Wang
Dongyu Yan
Zihan You
Wenting Luo
Guo Zhang
Yingcong Chen
DiffMVGen
244
2
0
16 Oct 2025
What If : Understanding Motion Through Sparse Interactions
What If : Understanding Motion Through Sparse Interactions
S. A. Baumann
Nick Stracke
Timy Phan
Bjorn Ommer
188
1
0
14 Oct 2025
Real-Time Motion-Controllable Autoregressive Video Diffusion
Real-Time Motion-Controllable Autoregressive Video Diffusion
Kesen Zhao
Jiaxin Shi
B. Zhu
Junbao Zhou
Xiaolong Shen
Yuan Zhou
Qianru Sun
Hanwang Zhang
VGen
278
7
0
09 Oct 2025
An approach for systematic decomposition of complex llm tasks
An approach for systematic decomposition of complex llm tasks
Tianle Zhou
Jiakai Xu
G. Liu
Jiaxiang Liu
Haonan Wang
Eugene Wu
237
0
0
09 Oct 2025
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control
Zhiyuan Zhang
Can Wang
Dongdong Chen
Jing Liao
VGen
290
2
0
09 Oct 2025
MultiCOIN: Multi-Modal COntrollable Video INbetweening
MultiCOIN: Multi-Modal COntrollable Video INbetweening
Maham Tanveer
Yang Zhou
Simon Niklaus
Ali Mahdavi-Amiri
Hao Zhang
Krishna Kumar Singh
Nanxuan Zhao
DiffMVGen
229
2
0
09 Oct 2025
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Jin Cao
Hongrui Wu
Ziyong Feng
Hujun Bao
Xiaowei Zhou
Sida Peng
VGen
196
1
0
02 Oct 2025
ASTRA: Let Arbitrary Subjects Transform in Video Editing
ASTRA: Let Arbitrary Subjects Transform in Video Editing
Fei Shen
Weihao Xu
Rui Yan
Dong Zhang
Xiangbo Shu
Jinhui Tang
Maocheng Zhao
VOSVGen
184
6
0
01 Oct 2025
Drag4D: Align Your Motion with Text-Driven 3D Scene Generation
Drag4D: Align Your Motion with Text-Driven 3D Scene Generation
Minjun Kang
Inkyu Shin
Taeyeop Lee
In So Kweon
KuK-Jin Yoon
177
0
0
26 Sep 2025
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
Yu Yuan
Xijun Wang
Tharindu Wickremasinghe
Zeeshan Nadir
Bole Ma
Stanley H. Chan
DiffMVGenPINN
1.6K
19
0
25 Sep 2025
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
Jiahao Wang
Yufeng Yuan
Rujie Zheng
Youtian Lin
Jian Gao
...
Xiaoxiao Long
Hao Zhu
Z. Zhang
X. Cao
Yao Yao
VGen
445
26
0
11 Sep 2025
Zo3T: Zero-Shot 3D-Aware Trajectory-Guided Image-to-Video Generation via Test-Time Training
Zo3T: Zero-Shot 3D-Aware Trajectory-Guided Image-to-Video Generation via Test-Time Training
Ruicheng Zhang
Jun Zhou
Zunnan Xu
Zihao Liu
Jiehui Huang
M. Zhang
Yu Sun
Xiu Li
VGen
418
4
0
08 Sep 2025
O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
Yihao Chen
Junjie Wang
Lin Liu
Ruihang Chu
Xiaopeng Zhang
Qi Tian
Yujiu Yang
DiffMVGen
200
6
0
01 Sep 2025
Precise Action-to-Video Generation Through Visual Action Prompts
Precise Action-to-Video Generation Through Visual Action Prompts
Yuang Wang
Chao Wen
Haoyu Guo
Sida Peng
Minghan Qin
Hujun Bao
Xiaowei Zhou
Ruizhen Hu
VGen
177
8
0
18 Aug 2025
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
Jingyun Liang
Jingkai Zhou
Shikai Li
Chenjie Cao
Lei Sun
Yichen Qian
Weihua Chen
Fan Wang
DiffMVGen
203
6
0
12 Aug 2025
LayerT2V: A Unified Multi-Layer Video Generation Framework
LayerT2V: A Unified Multi-Layer Video Generation Framework
Kangrui Cen
Baixuan Zhao
Yi Xin
Siqi Luo
Guoquan Zheng
Xiaohong Liu
Lei Zhang
Xiaohong Liu
DiffMVGen
195
0
0
06 Aug 2025
QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots
QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots
Sheng Wu
Fei Teng
Hao Shi
Qi Jiang
Kai Luo
Kaiwei Wang
Kailun Yang
VGen
322
4
0
04 Aug 2025
TransFlow: Motion Knowledge Transfer from Video Diffusion Models to Video Salient Object Detection
TransFlow: Motion Knowledge Transfer from Video Diffusion Models to Video Salient Object Detection
Suhwan Cho
Minhyeok Lee
Jungho Lee
Sunghun Yang
Sangyoun Lee
154
0
0
26 Jul 2025
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
Yubin Chen
Xuyang Guo
Zhenmei Shi
Zhao Song
Jiahao Zhang
VGen
803
12
0
24 Jul 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Yuping He
Yifei Huang
Guo Chen
Lidong Lu
Baoqi Pei
Jilan Xu
Tong Lu
Yoichi Sato
EgoV
567
4
0
06 Jun 2025
EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh
EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh
Tao Hu
Haoyang Peng
Xiao Liu
Yuewen Ma
VGenMDE
210
17
0
05 Jun 2025
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Xiao Fu
Xintao Wang
Xian Liu
Jianhong Bai
R. Xu
Pengfei Wan
Di Zhang
Dahua Lin
VGen
376
22
0
02 Jun 2025
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control
Anthony Chen
Wenzhao Zheng
Yida Wang
Xueyang Zhang
Kun Zhan
Fu Liu
Kurt Keutzer
Shanghang Zhang
404
14
0
28 May 2025
ATI: Any Trajectory Instruction for Controllable Video Generation
ATI: Any Trajectory Instruction for Controllable Video Generation
Angtian Wang
Haibin Huang
Yizhi Wang
Yiding Yang
Chongyang Ma
DiffMVGen
430
21
0
28 May 2025
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Boyang Wang
Xuweiyi Chen
Matheus Gadelha
Zezhou Cheng
DiffMVGen
452
5
0
27 May 2025
EF-VI: Enhancing End-Frame Injection for Video Inbetweening
EF-VI: Enhancing End-Frame Injection for Video Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Mingyu Ding
Lichao Sun
VGen
399
0
0
27 May 2025
MotionPro: A Precise Motion Controller for Image-to-Video Generation
MotionPro: A Precise Motion Controller for Image-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Zhongwei Zhang
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Wu Liu
Ting Yao
Tao Mei
DiffMVGen
437
18
0
26 May 2025
Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals
Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals
Nate Gillman
Charles Herrmann
Michael Freeman
Daksh Aggarwal
Evan Luo
Deqing Sun
Chen Sun
VGenAI4CE
519
27
0
26 May 2025
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
Zizhang Li
Hong-Xing Yu
Wei Liu
Yin Yang
Charles Herrmann
Gordon Wetzstein
Jiajun Wu
VGen
329
23
0
23 May 2025
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal GuidanceComputer Vision and Pattern Recognition (CVPR), 2025
Dian Shao
Mingfei Shi
Shengda Xu
Haodong Chen
Yongle Huang
Binglu Wang
3DH
485
12
0
19 May 2025
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
Ahmet Berke Gokmen
Yigit Ekin
Bahri Batuhan Bilecen
Aysegül Dündar
942
5
0
19 May 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
Xianghao Kong
Qiaosong Qi
Yuanbin Wang
Anyi Rao
Biaolong Chen
Aixi Zhang
DiffMVGen
327
4
0
10 May 2025
On Equivariance and Fast Sampling in Video Diffusion Models Trained with Warped Noise
On Equivariance and Fast Sampling in Video Diffusion Models Trained with Warped Noise
Chao Liu
Arash Vahdat
DiffMVGen
464
5
0
14 Apr 2025
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Ruineng Li
Daitao Xing
Huiming Sun
Yuanzhou Ha
Jinglin Shen
C. Ho
DiffMVGen
336
6
0
11 Apr 2025
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
Jialu Li
Shoubin Yu
Han Lin
Jaemin Cho
Jaehong Yoon
Joey Tianyi Zhou
DiffMVGen
434
8
0
11 Apr 2025
PanoDreamer: Consistent Text to 360-Degree Scene Generation
PanoDreamer: Consistent Text to 360-Degree Scene Generation
Zhexiao Xiong
Z. Chen
Zhong Li
Yi Tian Xu
Nathan Jacobs
3DGSVGen
347
1
0
07 Apr 2025
Multi-identity Human Image Animation with Structural Video Diffusion
Multi-identity Human Image Animation with Structural Video Diffusion
Zhenzhi Wang
Yongqian Li
Yanhong Zeng
Yuwei Guo
Dahua Lin
Tianfan Xue
Bo Dai
VGen
340
8
0
05 Apr 2025
3D Scene Understanding Through Local Random Access Sequence Modeling
3D Scene Understanding Through Local Random Access Sequence Modeling
Wanhee Lee
Klemen Kotar
R. Venkatesh
Jared Watrous
Honglin Chen
Khai Loong Aw
Daniel L. K. Yamins
3DV
318
3
0
04 Apr 2025
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion TransferComputer Vision and Pattern Recognition (CVPR), 2025
Jiayi Gao
Zijin Yin
Changcheng Hua
Yuxin Peng
Kongming Liang
Zhanyu Ma
Jiaxin Guo
Yang Liu
VGenDiffM
398
9
0
03 Apr 2025
123
Next
Page 1 of 3