ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.10656
  4. Cited By
VidToMe: Video Token Merging for Zero-Shot Video Editing

VidToMe: Video Token Merging for Zero-Shot Video Editing

17 December 2023
Xirui Li
Chao Ma
Xiaokang Yang
Ming-Hsuan Yang
    DiffM
    VGen
ArXivPDFHTML

Papers citing "VidToMe: Video Token Merging for Zero-Shot Video Editing"

34 / 34 papers shown
Title
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffM
VGen
24
0
0
11 May 2025
Towards Generalized and Training-Free Text-Guided Semantic Manipulation
Towards Generalized and Training-Free Text-Guided Semantic Manipulation
Yu Hong
Xiao Cai
Pengpeng Zeng
Shuai Zhang
Jingkuan Song
Lianli Gao
H. Shen
DiffM
31
0
0
24 Apr 2025
Physical Reservoir Computing in Hook-Shaped Rover Wheel Spokes for Real-Time Terrain Identification
Physical Reservoir Computing in Hook-Shaped Rover Wheel Spokes for Real-Time Terrain Identification
Xiao Jin
Zihan Wang
Zhenhua Yu
Changrak Choi
Kalind Carpenter
T. Nanayakkara
23
0
0
17 Apr 2025
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Zhihang Yuan
Rui Xie
Yuzhang Shang
H. Zhang
Siyuan Wang
Shengen Yan
Guohao Dai
Yu Wang
DiffM
VGen
42
0
0
16 Apr 2025
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling
Jaskirat Singh
Junshen Kevin Chen
Jonas Kohler
Michael Cohen
DiffM
VGen
35
0
0
08 Apr 2025
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Yingying Fan
Quanwei Yang
Kaisiyuan Wang
Hang Zhou
Yingying Li
Haocheng Feng
Errui Ding
Y. Wu
J. Wang
DiffM
42
0
0
21 Mar 2025
FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Minghan Li
C. Xie
Y. Wu
Lei Zhang
M. Wang
DiffM
VGen
52
0
0
17 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
90
0
0
06 Mar 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
50
0
0
13 Jan 2025
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Zhao Jin
Dacheng Tao
VGen
97
1
0
16 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
He Zhang
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGen
DiffM
76
0
0
04 Dec 2024
SPAgent: Adaptive Task Decomposition and Model Selection for General
  Video Generation and Editing
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
Rong-Cheng Tu
Wenhao Sun
Zhao Jin
Jingyi Liao
Jiaxing Huang
Dacheng Tao
VGen
DiffM
92
3
0
28 Nov 2024
VideoDirector: Precise Video Editing via Text-to-Video Models
VideoDirector: Precise Video Editing via Text-to-Video Models
Yukun Wang
Longguang Wang
Zhiyuan Ma
Qibin Hu
Kai Xu
Yulan Guo
VGen
DiffM
86
0
0
26 Nov 2024
Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge
Qinglong Cao
Ding Wang
Xirui Li
Yuntian Chen
Chao Ma
Xiaokang Yang
DiffM
VGen
113
2
0
18 Nov 2024
VeGaS: Video Gaussian Splatting
Weronika Smolak-Dyżewska
Dawid Malarz
Kornel Howil
Jan Kaczmarczyk
Marcin Mazur
P. Spurek
56
1
0
17 Nov 2024
Video Token Merging for Long-form Video Understanding
Video Token Merging for Long-form Video Understanding
Seon-Ho Lee
Jue Wang
Zhikang Zhang
D. Fan
Xinyu Li
35
5
0
31 Oct 2024
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
Wenhao Chai
Enxin Song
Y. Du
Chenlin Meng
Vashisht Madhavan
Omer Bar-Tal
Jeng-Neng Hwang
Saining Xie
Christopher D. Manning
3DV
77
25
0
04 Oct 2024
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous
  Driving
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving
Yunsheng Ma
Amr Abdelraouf
Rohit Gupta
Ziran Wang
Kyungtae Han
21
3
0
16 Sep 2024
Multi-sentence Video Grounding for Long Video Generation
Multi-sentence Video Grounding for Long Video Generation
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Wenwu Zhu
DiffM
32
0
0
18 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
54
5
0
01 Jul 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
55
22
0
26 Jun 2024
Splatter a Video: Video Gaussian Representation for Versatile Processing
Splatter a Video: Video Gaussian Representation for Versatile Processing
Yang-tian Sun
Yi-Hua Huang
Lin Ma
Xiaoyang Lyu
Yan-Pei Cao
Xiaojuan Qi
3DGS
30
5
0
19 Jun 2024
NaRCan: Natural Refined Canonical Image with Integration of Diffusion
  Prior for Video Editing
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing
Ting-Hsuan Chen
Jiewen Chan
Hau-Shiang Shiu
Shih-Han Yen
Chang-Han Yeh
Yu-Lun Liu
VGen
DiffM
40
3
0
10 Jun 2024
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video
  Motion Editing
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
Yi Zuo
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Shuyuan Yang
Yuwei Guo
VGen
DiffM
29
1
0
07 May 2024
Leveraging Temporal Contextualization for Video Action Recognition
Leveraging Temporal Contextualization for Video Action Recognition
Minji Kim
Dongyoon Han
Taekyung Kim
Bohyung Han
43
2
0
15 Apr 2024
Animate Your Motion: Turning Still Images into Dynamic Videos
Animate Your Motion: Turning Still Images into Dynamic Videos
Mingxiao Li
Bo Wan
Marie-Francine Moens
Tinne Tuytelaars
VGen
DiffM
30
4
0
15 Mar 2024
Video Editing via Factorized Diffusion Distillation
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
33
12
0
14 Mar 2024
Object-Centric Diffusion for Efficient Video Editing
Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya
Adil Karjauv
Davide Abati
Fatih Porikli
Yuki M. Asano
A. Habibian
VGen
27
12
0
11 Jan 2024
EVE: Efficient zero-shot text-based Video Editing with Depth Map
  Guidance and Temporal Consistency Constraints
EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Yutao Chen
Xingning Dong
Tian Gan
Chunluan Zhou
Ming Yang
Qingpei Guo
DiffM
25
5
0
21 Aug 2023
Diffusion Models in Vision: A Survey
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffM
VLM
MedIm
188
1,133
0
10 Sep 2022
Denoising Diffusion Restoration Models
Denoising Diffusion Restoration Models
Bahjat Kawar
Michael Elad
Stefano Ermon
Jiaming Song
DiffM
204
774
0
27 Jan 2022
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr
Martin Danelljan
Andrés Romero
F. I. F. Richard Yu
Radu Timofte
Luc Van Gool
DiffM
211
1,353
0
24 Jan 2022
Palette: Image-to-Image Diffusion Models
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
325
1,584
0
10 Nov 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
232
75,445
0
18 May 2015
1