Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.01186
Cited By
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos
3 April 2023
Yue Ma
Yin-Yin He
Xiaodong Cun
Xintao Wang
Siran Chen
Ying Shan
Xiu Li
Qifeng Chen
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos"
23 / 23 papers shown
Title
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
Zeqi Gu
Difan Liu
Timothy Langlois
Matthew Fisher
Abe Davis
DiffM
3DH
57
0
0
19 Mar 2025
EEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
Zexuan Yan
Yue Ma
Chang Zou
Wenteng Chen
Qifeng Chen
Linfeng Zhang
49
0
0
13 Mar 2025
Get In Video: Add Anything You Want to the Video
Shaobin Zhuang
Zhipeng Huang
Binxin Yang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Chong Sun
Zheng-Jun Zha
Chen Li
Y. Wang
DiffM
VGen
43
0
0
08 Mar 2025
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Y. Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGen
DiffM
58
1
0
30 Sep 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffM
VGen
87
3
0
19 Aug 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Zeyue Tian
Zhaoyang Liu
Ruibin Yuan
Jiahao Pan
Xiaoqiang Huang
Xu Tan
Xu Tan
Qifeng Chen
Y. Guo
VGen
94
16
0
06 Jun 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
DiffM
VGen
20
86
0
02 May 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
89
22
0
22 Apr 2024
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan
Jinfa Huang
Yujun Shi
Yongqi Xu
Ruijie Zhu
Bin Lin
Xinhua Cheng
Li-xin Yuan
Jiebo Luo
VGen
70
31
0
07 Apr 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
44
24
0
14 Mar 2024
Context-aware Talking Face Video Generation
Meidai Xuanyuan
Yuwang Wang
Honglei Guo
Qionghai Dai
DiffM
19
0
0
28 Feb 2024
3D Human Pose Analysis via Diffusion Synthesis
Haorui Ji
Hongdong Li
DiffM
42
3
0
17 Jan 2024
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
16
8
0
19 Dec 2023
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
Zhiyuan Ma
Guoli Jia
Bowen Zhou
DiffM
36
8
0
13 Dec 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
11
103
0
12 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
17
73
0
09 Oct 2023
Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects
Chunming He
Kai Li
Yachao Zhang
Yulun Zhang
Z. Guo
Xiu Li
Martin Danelljan
F. I. F. Richard Yu
AAML
25
44
0
06 Aug 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
23
87
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
18
30
0
29 May 2023
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
68
75
0
13 Apr 2023
FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation
Ronghui Li
Junfan Zhao
Yachao Zhang
Mingyang Su
Zeping Ren
Han Zhang
Yansong Tang
Xiuhua Li
DiffM
15
49
0
07 Dec 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
235
556
0
29 May 2022
Motion Representations for Articulated Animation
Aliaksandr Siarohin
Oliver J. Woodford
Jian Ren
Menglei Chai
Sergey Tulyakov
OCL
100
252
0
22 Apr 2021
1