ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.04761
  4. Cited By
Video-P2P: Video Editing with Cross-attention Control

Video-P2P: Video Editing with Cross-attention Control

Computer Vision and Pattern Recognition (CVPR), 2023
8 March 2023
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Video-P2P: Video Editing with Cross-attention Control"

50 / 209 papers shown
Title
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Qingyan Bai
Qiuyu Wang
Hao Ouyang
Yue Yu
Hanlin Wang
...
Yanhong Zeng
Zichen Liu
Yinghao Xu
Yujun Shen
Qifeng Chen
VGen
121
7
0
18 Dec 2025
Video Generation Models Are Good Latent Reward Models
Video Generation Models Are Good Latent Reward Models
Xiaoyue Mi
W. Yu
Jiesong Lian
Shibo Jie
Ruizhe Zhong
...
Z. Zhou
Zhiyong Xu
Yuan Zhou
Qinglin Lu
Fan Tang
EGVMVGen
121
0
0
26 Nov 2025
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
Farnoosh Koleini
Hongfei Xue
Ahmed Helmy
Pu Wang
119
0
0
24 Nov 2025
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Yeji Song
Jaehyun Lee
Mijin Koo
Junhoo Lee
Nojun Kwak
DiffMVGen
64
0
0
23 Nov 2025
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
Daniel Gilo
Or Litany
129
0
0
18 Nov 2025
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Huiqiang Sun
Liao Shen
Zhan Peng
Kun Wang
Size Wu
...
Z. Huang
Xingyu Zeng
Zhiguo Cao
Wei Li
Chen Change Loy
DiffMVGen
134
0
0
17 Nov 2025
Redundancy-optimized Multi-head Attention Networks for Multi-View Multi-Label Feature Selection
Redundancy-optimized Multi-head Attention Networks for Multi-View Multi-Label Feature Selection
Yuzhou Liu
Jiarui Liu
Wanfu Gao
24
0
0
16 Nov 2025
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
Z. Liang
D. Zhang
Huichi Zhou
Rui Huang
Bobo Li
...
Shengqiong Wu
X. Wang
Jiebo Luo
Lizi Liao
Hao Fei
VGen
149
0
0
11 Nov 2025
FAME: Fairness-aware Attention-modulated Video Editing
FAME: Fairness-aware Attention-modulated Video Editing
Zhangkai Wu
Xuhui Fan
Zhongyuan Xie
Kaize Shi
Zhidong Li
Longbing Cao
VGen
112
1
0
27 Oct 2025
In-Context Learning with Unpaired Clips for Instruction-based Video Editing
In-Context Learning with Unpaired Clips for Instruction-based Video Editing
Xinyao Liao
Xianfang Zeng
Ziye Song
Zhoujie Fu
Gang Yu
Guosheng Lin
99
3
0
16 Oct 2025
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
Sandeep Mishra
Oindrila Saha
A. Bovik
DiffMVGen
82
0
0
14 Oct 2025
EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection
EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection
Huaizhi Qu
Ruichen Zhang
Shuqing Luo
Luchao Qi
Zhihao Zhang
Xiaoming Liu
Roni Sengupta
Tianlong Chen
DiffMVGen
88
0
0
11 Oct 2025
Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
Jin-Chuan Shi
Chengye Su
Jiajun Wang
Ariel Shamir
Miao Wang
DiffM3DGSVGen
76
0
0
10 Oct 2025
UniVideo: Unified Understanding, Generation, and Editing for Videos
UniVideo: Unified Understanding, Generation, and Editing for Videos
Cong Wei
Quande Liu
Zixuan Ye
Qiulin Wang
Xintao Wang
Pengfei Wan
Kun Gai
Wenhu Chen
VGen
188
8
0
09 Oct 2025
Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!
Streaming Drag-Oriented Interactive Video Manipulation: Drag Anything, Anytime!
Junbao Zhou
Yuan Zhou
Kesen Zhao
Qingshan Xu
B. Zhu
Richang Hong
Hanwang Zhang
DiffMVGen
179
1
0
03 Oct 2025
FreeViS: Training-free Video Stylization with Inconsistent References
FreeViS: Training-free Video Stylization with Inconsistent References
Jiacong Xu
Yiqun Mei
Ke Zhang
Vishal M. Patel
DiffMVGen
176
2
0
02 Oct 2025
IMAGEdit: Let Any Subject Transform
IMAGEdit: Let Any Subject Transform
Fei Shen
Weihao Xu
Rui Yan
Dong Zhang
Xiangbo Shu
Jinhui Tang
VGen
88
0
0
01 Oct 2025
Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation
Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation
Mingyu Kang
Yong Suk Choi
DiffM
143
0
0
30 Sep 2025
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing
Abdelilah Aitrouga
Youssef Hmamouche
Amal El Fallah Seghrouchni
VGen
113
0
0
30 Sep 2025
VMDiff: Visual Mixing Diffusion for Limitless Cross-Object Synthesis
VMDiff: Visual Mixing Diffusion for Limitless Cross-Object Synthesis
Zeren Xiong
Yue Yu
Zedong Zhang
Shuo Chen
J. Yang
Jun Yu Li
DiffM
119
0
0
28 Sep 2025
Object-AVEdit: An Object-level Audio-Visual Editing Model
Object-AVEdit: An Object-level Audio-Visual Editing Model
Y. Fu
Ruiyang Si
Hongfa Wang
Dongzhan Zhou
J. Sun
Ping Luo
Di Hu
Hongyuan Zhang
Xuelong Li
DiffMVGenKELM
134
6
0
27 Sep 2025
UniTransfer: Video Concept Transfer via Progressive Spatial and Timestep Decomposition
UniTransfer: Video Concept Transfer via Progressive Spatial and Timestep Decomposition
Guojun Lei
Rong Zhang
Chi-Yin Wang
Tianhang Liu
Hong Li
Zhiyuan Ma
W. Xu
VGen
122
0
0
25 Sep 2025
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Pin-Yen Chiu
I-Sheng Fang
Jun-Cheng Chen
DiffM
72
0
0
23 Sep 2025
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation
Yue Ma
Zexuan Yan
Hongyu Liu
H. Wang
Heng Pan
...
H. Shum
Zhifeng Li
Wei Liu
Linfeng Zhang
Qifeng Chen
VGen
159
9
0
20 Sep 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
126
2
0
20 Aug 2025
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Canyu Zhao
Xiaoman Li
Tianjian Feng
Zhiyue Zhao
Hao Chen
Chunhua Shen
DiffMVGen
134
2
0
20 Aug 2025
Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing
Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing
Feng-Lin Liu
Shi-Yang Li
Yan-Pei Cao
Hongbo Fu
Lin Gao
DiffMVGen
108
0
0
19 Aug 2025
Story2Board: A Training-Free Approach for Expressive Storyboard Generation
Story2Board: A Training-Free Approach for Expressive Storyboard Generation
David Dinkevich
Matan Levy
Omri Avrahami
Dvir Samuel
Dani Lischinski
DiffM
55
0
0
13 Aug 2025
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Zixin Yin
Xili Dai
Ling Chen
Deyu Zhou
Jianan Wang
Duomin Wang
Gang Yu
Lionel M. Ni
Lei Zhang
H. Shum
DiffM
100
1
0
12 Aug 2025
Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis
Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis
Kunyu Feng
Yue Ma
X. Zhang
Boshi Liu
Yikuang Yuluo
...
Hongyu Liu
Zhiyuan Qin
Shanhui Mo
Qifeng Chen
Zeyu Wang
3DVSyDaVGen
109
7
0
07 Aug 2025
VideoGuard: Protecting Video Content from Unauthorized Editing
VideoGuard: Protecting Video Content from Unauthorized Editing
Junjie Cao
KaiZhou Li
Xinchun Yu
Hongxiang Li
Xiaoping Zhang
DiffMVGen
95
0
0
05 Aug 2025
RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution
RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution
Weisong Zhao
Jingkai Zhou
Xiangyu Zhu
Weihua Chen
Xiao-Yu Zhang
Zhen Lei
Fan Wang
VGen
97
0
0
25 Jul 2025
CarGait: Cross-Attention based Re-ranking for Gait recognition
CarGait: Cross-Attention based Re-ranking for Gait recognition
Gavriel Habib
Noa Barzilay
O. Shimshi
Rami Ben-Ari
N. Darshan
CVBM
235
1
0
01 Jul 2025
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer
Zhelun Shen
Chenming Wu
Junsheng Zhou
Chen Zhao
Kaisiyuan Wang
Hang Zhou
Yingying Li
Haocheng Feng
Wei He
Jingdong Wang
DiffM
194
0
0
15 Jun 2025
LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning
LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning
Chenjian Gao
Lihe Ding
Xin Cai
Zhanpeng Huang
Zibin Wang
Tianfan Xue
DiffMVGen
394
7
0
11 Jun 2025
Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
Qilin Yin
Wei Lu
Xiangyang Luo
Xiaochun Cao
139
1
0
10 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffMVGen
197
4
0
09 Jun 2025
TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
M. Kim
Dongjin Kim
Seokju Yun
Jaegul Choo
DiffMVGen
133
1
0
08 Jun 2025
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
FADE: Frequency-Aware Diffusion Model Factorization for Video EditingComputer Vision and Pattern Recognition (CVPR), 2025
Yixuan Zhu
Haolin Wang
Shilin Ma
Wenliang Zhao
Yansong Tang
Lei Chen
Jie Zhou
DiffMVGen
380
1
0
06 Jun 2025
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li
Yanming Yang
Chenxi Song
Chi Zhang
DiffMVGen
229
5
0
05 Jun 2025
Interactive Video Generation via Domain Adaptation
Interactive Video Generation via Domain Adaptation
Ishaan Rawal
Suryansh Kumar
DiffMVGen
124
0
0
30 May 2025
FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing
FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing
Jeongsol Kim
Yeobin Hong
Jong Chul Ye
J. C. Ye
260
5
0
29 May 2025
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffMVGen
253
0
0
11 May 2025
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component AnalysisComputer Vision and Pattern Recognition (CVPR), 2025
Jiangtong Tan
Hu Yu
Jie Huang
Jie Xiao
Feng Zhao
280
5
0
02 May 2025
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
Minkyu Choi
S P Sharan
Harsh Goel
Sahil Shah
Sandeep Chinchali
DiffMVGen
366
3
0
24 Apr 2025
Visual Prompting for One-shot Controllable Video Editing without Inversion
Visual Prompting for One-shot Controllable Video Editing without InversionComputer Vision and Pattern Recognition (CVPR), 2025
Zitao Gao
Yuxi Zhou
Duo Peng
Joo-Hwee Lim
Zhigang Tu
De Wen Soh
Lin Geng Foo
DiffM
313
3
0
19 Apr 2025
Understanding Attention Mechanism in Video Diffusion Models
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
287
2
0
16 Apr 2025
RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
RAGME: Retrieval Augmented Video Generation for Enhanced Motion RealismInternational Conference on Multimedia Retrieval (ICMR), 2025
E. Peruzzo
Dejia Xu
Xingqian Xu
Humphrey Shi
Andrii Zadaianchuk
DiffMVGen
251
2
0
09 Apr 2025
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Diljeet Jagpal
Xi Chen
Vinay P. Namboodiri
DiffMVGen
123
0
0
09 Apr 2025
FreeInv: Free Lunch for Improving DDIM Inversion
FreeInv: Free Lunch for Improving DDIM Inversion
Yuxiang Bao
Huijie Liu
Xun Gao
Huan Fu
Guoliang Kang
134
2
0
29 Mar 2025
12345
Next