ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.04761
  4. Cited By
Video-P2P: Video Editing with Cross-attention Control

Video-P2P: Video Editing with Cross-attention Control

Computer Vision and Pattern Recognition (CVPR), 2023
8 March 2023
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Video-P2P: Video Editing with Cross-attention Control"

50 / 213 papers shown
Title
Understanding Attention Mechanism in Video Diffusion Models
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
287
2
0
16 Apr 2025
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Diljeet Jagpal
Xi Chen
Vinay P. Namboodiri
DiffMVGen
135
0
0
09 Apr 2025
RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
RAGME: Retrieval Augmented Video Generation for Enhanced Motion RealismInternational Conference on Multimedia Retrieval (ICMR), 2025
E. Peruzzo
Dejia Xu
Xingqian Xu
Humphrey Shi
Andrii Zadaianchuk
DiffMVGen
279
2
0
09 Apr 2025
FreeInv: Free Lunch for Improving DDIM Inversion
FreeInv: Free Lunch for Improving DDIM Inversion
Yuxiang Bao
Huijie Liu
Xun Gao
Huan Fu
Guoliang Kang
162
2
0
29 Mar 2025
Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations
Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations
Tharun Anand
Siva Sankar
Pravin Nair
AAML
207
2
0
28 Mar 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Longji Xu
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVMVGen
351
19
0
27 Mar 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Dian Zheng
Ziqi Huang
Hongbo Liu
Kai Zou
Yinan He
...
Jingwen He
Wei-Shi Zheng
Botian Shi
Yu Qiao
Ziwei Liu
EGVMVGen
291
76
0
27 Mar 2025
Video-T1: Test-Time Scaling for Video Generation
Video-T1: Test-Time Scaling for Video Generation
Fan Liu
Hanyang Wang
Yimo Cai
Kaiyan Zhang
Xiaohang Zhan
Yueqi Duan
DiffMVGen
402
15
0
24 Mar 2025
Target-Aware Video Diffusion Models
Target-Aware Video Diffusion Models
Taeksoo Kim
Hanbyul Joo
DiffMVGen
390
3
0
24 Mar 2025
InstructVEdit: A Holistic Approach for Instructional Video Editing
InstructVEdit: A Holistic Approach for Instructional Video Editing
Chi Zhang
C. Feng
Feng Yan
Qiming Zhang
Mingjin Zhang
Yujie Zhong
Jing Zhang
Lin Ma
DiffMVGen
192
3
0
22 Mar 2025
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion ModelComputer Vision and Pattern Recognition (CVPR), 2025
Yingying Fan
Quanwei Yang
Kaisiyuan Wang
Hang Zhou
Yingying Li
Haocheng Feng
Errui Ding
Y. Wu
Jiadong Wang
DiffM
321
6
0
21 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
Jing Zhang
Lu Qi
Xuelong Li
Yunhai Tong
231
4
0
21 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and EditingComputer Vision and Pattern Recognition (CVPR), 2025
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffMVGen
309
9
0
18 Mar 2025
EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis
EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis
Jianwu Fang
Lei-lei Li
Zhedong Zheng
Hongkai Yu
Jianru Xue
Zhengguo Li
Tat-Seng Chua
181
0
0
16 Mar 2025
RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing
Tianrui Pan
Lin Liu
Jie Liu
Xinsong Zhang
J. Tang
Gangshan Wu
Q. Tian
DiffMVGen
261
0
0
14 Mar 2025
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
H. Iqbal
Nazmul Karim
Umar Khalid
Azib Farooq
Z. Zhong
Jing Hua
Chen Chen
DiffM3DGSVGen
362
0
0
14 Mar 2025
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Jianhong Bai
Menghan Xia
Xiao Fu
Xintao Wang
Lianrui Mu
...
Zuozhu Liu
Haoji Hu
Xiang Bai
Pengfei Wan
Di Zhang
DiffMVGen
389
88
0
14 Mar 2025
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image
Qi Zhao
Zhan Ma
Pan Zhou
VGen
355
2
0
13 Mar 2025
VACE: All-in-One Video Creation and Editing
Zeyinzi Jiang
Zhen Han
Chaojie Mao
Junxuan Zhang
Yulin Pan
Yu Liu
DiffMVGen
342
141
0
10 Mar 2025
Synchronized Video-to-Audio Generation via Mel Quantization-Continuum DecompositionComputer Vision and Pattern Recognition (CVPR), 2025
Juncheng Wang
Chao Xu
Cheng Yu
Lei Shang
Zhe Hu
Shujun Wang
Liefeng Bo
DiffMVGen
222
2
0
10 Mar 2025
Get In Video: Add Anything You Want to the Video
Shaobin Zhuang
Zhipeng Huang
Binxin Yang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Chong Sun
Zheng-Jun Zha
Chen Li
Yijiao Wang
DiffMVGen
283
9
0
08 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
252
9
0
06 Mar 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
VideoGrain: Modulating Space-Time Attention for Multi-grained Video EditingInternational Conference on Learning Representations (ICLR), 2025
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffMVGen
280
26
0
24 Feb 2025
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
Shuheng Zhang
Wenshu Fan
Hongbo Zhou
Jun Peng
Weihao Ye
Xiaoshuai Sun
Rongrong Ji
VGen
248
3
0
08 Feb 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
284
0
0
13 Jan 2025
Edit as You See: Image-guided Video Editing via Masked Motion Modeling
Edit as You See: Image-guided Video Editing via Masked Motion Modeling
Zhi-Lin Huang
Zichen Liu
Chujun Qin
Zihan Wang
Dong Zhou
Dong Li
E. Barsoum
DiffMVGen
172
0
0
08 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
Xianrui Li
Tao Zhang
Zilong Huang
Shilin Xu
...
Yunhai Tong
Lu Qi
Jiashi Feng
Ming-Hsuan Yang
Ming-Hsuan Yang
VLM
534
68
0
07 Jan 2025
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
Yuanpeng Tu
Hao Luo
Xi Chen
S. Ji
Xiang Bai
Hengshuang Zhao
DiffMVGen
466
29
0
02 Jan 2025
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation
Haoyu Zheng
Wenqiao Zhang
Zheqi Lv
Yu Zhong
Yang Dai
...
Yongliang Shen
Juncheng Billy Li
Dongping Zhang
Siliang Tang
Yueting Zhuang
DiffMVGen
228
1
0
31 Dec 2024
Generative Video Propagation
Generative Video PropagationComputer Vision and Pattern Recognition (CVPR), 2024
Shaoteng Liu
Tianyu Wang
Jiadong Wang
Qing Liu
Zhifei Zhang
...
Rui Wang
Bei Yu
Zhe Lin
Seunggeun Kim
Jiaya Jia
DiffMVGen
235
16
0
27 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
Chentao Song
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGenDiffM
310
7
0
04 Dec 2024
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D
  Diffusion
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D DiffusionComputer Vision and Pattern Recognition (CVPR), 2024
Kai He
Chin-Hsuan Wu
Igor Gilitschenski
DiffM3DGS
291
5
0
02 Dec 2024
SPAgent: Adaptive Task Decomposition and Model Selection for General
  Video Generation and Editing
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
Rong-Cheng Tu
Wenhao Sun
Zhao Jin
Jingyi Liao
Jiaxing Huang
Dacheng Tao
VGenDiffM
325
12
0
28 Nov 2024
VideoDirector: Precise Video Editing via Text-to-Video Models
VideoDirector: Precise Video Editing via Text-to-Video ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Yukun Wang
Longguang Wang
Zhiyuan Ma
Qibin Hu
Kai Xu
Yulan Guo
VGenDiffM
421
14
0
26 Nov 2024
UVCG: Leveraging Temporal Consistency for Universal Video Protection
UVCG: Leveraging Temporal Consistency for Universal Video Protection
KaiZhou Li
Jindong Gu
Xinchun Yu
Junjie Cao
Yansong Tang
Jinqiang Cui
AAML
209
1
0
25 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Yu Xie
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffMVGen
210
18
0
17 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
354
3
0
12 Nov 2024
Taming Rectified Flow for Inversion and Editing
Taming Rectified Flow for Inversion and Editing
Jiangshan Wang
Junfu Pu
Chen Ma
Jiayi Guo
Yue Ma
Nisha Huang
Yuxin Chen
Xiu Li
Mingyu Ding
432
99
0
07 Nov 2024
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion
  Models
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2024
Giannis Daras
Weili Nie
Karsten Kreis
A. Dimakis
Morteza Mardani
Nikola B. Kovachki
Arash Vahdat
DiffM
319
15
0
21 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
323
7
0
14 Oct 2024
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free WayComputer Vision and Pattern Recognition (CVPR), 2024
Jiazi Bu
Pengyang Ling
Pan Zhang
Tong Wu
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
DiffMVGen
163
0
0
08 Oct 2024
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot
  Video Editing
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video EditingAAAI Conference on Artificial Intelligence (AAAI), 2024
Lingling Cai
Kang Zhao
Hangjie Yuan
Yingya Zhang
Shiwei Zhang
Kejie Huang
VGen
124
2
0
30 Sep 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
DNI: Dilutional Noise Initialization for Diffusion Video EditingEuropean Conference on Computer Vision (ECCV), 2024
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
214
8
0
19 Sep 2024
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024
Yupeng Chen
Penglin Chen
Xiaoyu Zhang
Yixian Huang
Qian Xie
DiffM
324
4
0
15 Sep 2024
Blended Latent Diffusion under Attention Control for Real-World Video
  Editing
Blended Latent Diffusion under Attention Control for Real-World Video Editing
Deyin Liu
Lin Yuanbo Wu
Xianghua Xie
DiffM
147
3
0
05 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive
  Content Generation
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haobo Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wen Liu
DiffM
192
37
0
02 Sep 2024
Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing
Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing
Yangyang Xu
Wenqi Shao
Yong Du
Haiming Zhu
Yang Zhou
Ping Luo
Shengfeng He
DiffM
223
2
0
23 Aug 2024
E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video
  Editing Quality Assessment
E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun
Xiaoyu Liang
S. Fan
Wenxu Gao
Wei-Nan Gao
DiffM
229
6
0
21 Aug 2024
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion
  Consistency
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion ConsistencyEuropean Conference on Computer Vision (ECCV), 2024
Xiaojing Zhong
Xinyi Huang
Xiaofeng Yang
Guosheng Lin
Qingyao Wu
DiffM
174
10
0
14 Aug 2024
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Manuel Kansy
Jacek Naruniec
Christopher Schroers
Markus Gross
Romann M. Weber
DiffMVGen
358
6
0
01 Aug 2024
Previous
12345
Next