Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.02974
Cited By
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
7 September 2021
R. Liu
Hanming Deng
Yangyi Huang
Xiaoyu Shi
Lewei Lu
Wenxiu Sun
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting"
50 / 68 papers shown
Title
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
34
0
0
04 Apr 2025
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
Tai D. Nguyen
Matthew C. Stamm
55
0
0
26 Mar 2025
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
Ruijie Lu
Yixin Chen
Yu Liu
Jiaxiang Tang
Junfeng Ni
Diwen Wan
Gang Zeng
Siyuan Huang
DiffM
VGen
51
3
0
15 Mar 2025
VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models
Chaohao Xie
Kai Han
Kwan-Yee K. Wong
VGen
DiffM
184
0
0
21 Jan 2025
Context-Aware Input Orchestration for Video Inpainting
Hoyoung Kim
Azimbek Khudoyberdiev
Seonghwan Jeong
Jihoon Ryoo
83
0
0
25 Nov 2024
Generative Omnimatte: Learning to Decompose Video into Layers
Yao-Chih Lee
Erika Lu
Sarah Rumbley
Michal Geyer
Jia-Bin Huang
Tali Dekel
Forrester Cole
DiffM
VGen
105
5
0
25 Nov 2024
Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization
Hongtao Wu
Yijun Yang
Angelica I Aviles-Rivero
Jingjing Ren
Sixiang Chen
Haoyu Chen
Lei Zhu
42
0
0
10 Oct 2024
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning
Jian Shi
Zhenyu Li
Peter Wonka
MDE
33
2
0
30 Sep 2024
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Sijie Zhao
Wenbo Hu
Xiaodong Cun
Yong Zhang
Xiaoyu Li
Zhe Kong
Xiangjun Gao
Muyao Niu
Ying Shan
VGen
DiffM
MDE
54
9
0
11 Sep 2024
Video Diffusion Models are Strong Video Inpainter
Minhyeok Lee
Suhwan Cho
Chajin Shin
Jungho Lee
Sunghun Yang
Sangyoun Lee
VGen
DiffM
47
7
0
21 Aug 2024
Video Inpainting Localization with Contrastive Learning
Zijie Lou
Gang Cao
Man Lin
50
1
0
25 Jun 2024
Trusted Video Inpainting Localization via Deep Attentive Noise Learning
Zijie Lou
Gang Cao
Man Lin
AAML
55
3
0
19 Jun 2024
Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring
Huicong Zhang
Haozhe Xie
H. Yao
38
7
0
11 Jun 2024
Semantically Consistent Video Inpainting with Conditional Diffusion Models
Dylan Green
William Harvey
Saeid Naderiparizi
Matthew Niedoba
Yunpeng Liu
...
Vasileios Lioutas
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank D. Wood
DiffM
36
1
0
30 Apr 2024
Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Zhong Ji
Yimu Su
Yan Zhang
Jiacheng Hou
Yanwei Pang
Jungong Han
38
2
0
24 Apr 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Zhiheng Liu
Ouyang Hao
Qiuyu Wang
Ka Leong Cheng
Jie Xiao
Kai Zhu
Nan Xue
Yu Liu
Yujun Shen
Yang Cao
DiffM
3DGS
45
20
0
17 Apr 2024
Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection
Ying Zhang
Yuezun Li
Bo Peng
Jiaran Zhou
Huiyu Zhou
Junyu Dong
48
0
0
17 Apr 2024
Towards Online Real-Time Memory-based Video Inpainting Transformers
Guillaume Thiry
Hao Tang
Radu Timofte
Luc Van Gool
ViT
21
0
0
24 Mar 2024
Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques
Shreyank N. Gowda
Yash Thakre
Shashank Narayana Gowda
Xiaobo Jin
32
0
0
31 Jan 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu
Xiangtai Li
Chenyang Si
Shangchen Zhou
Jingkang Yang
...
Yining Li
Kai Chen
Yunhai Tong
Ziwei Liu
Chen Change Loy
VGen
DiffM
MLLM
41
17
0
18 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
43
35
0
07 Jan 2024
DeepDR: Deep Structure-Aware RGB-D Inpainting for Diminished Reality
Christina Schwarz-Gsaxner
Shohei Mori
Dieter Schmalstieg
Jan Egger
Gerhard Paar
Werner Bailer
Denis Kalkofen
21
5
0
01 Dec 2023
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
DiffM
30
12
0
26 Nov 2023
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Manyuan Zhang
Guanglu Song
Yu Liu
Hongsheng Li
16
14
0
24 Oct 2023
Improving Drumming Robot Via Attention Transformer Network
Yang Yi
Zonghan Li
31
0
0
04 Oct 2023
Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation
Jingliang Deng
Zonghan Li
ViT
26
0
0
30 Sep 2023
Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method
Tianyi Liu
Kejun Wu
Yi Wang
Wenyang Liu
Kim-Hui Yap
Lap-Pui Chau
26
4
0
25 Sep 2023
ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou
Chongyi Li
Kelvin C. K. Chan
Chen Change Loy
ViT
35
90
0
07 Sep 2023
Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery
J. Park
Daniel Sungho Jung
Gyeongsik Moon
Kyoung Mu Lee
27
6
0
05 Sep 2023
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection
Yazhou Xing
Amrita Mazumdar
Anjul Patney
Chao Liu
Hongxu Yin
Qifeng Chen
Jan Kautz
I. Frosio
49
1
0
29 Aug 2023
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization
Rui Zhang
Hongxia Wang
Ming-han Du
Hanqing Liu
Yangqiaoyu Zhou
Q. Zeng
31
21
0
28 Aug 2023
Deficiency-Aware Masked Transformer for Video Inpainting
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
26
9
0
17 Jul 2023
FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Yijin Li
Hongwei Qin
Jifeng Dai
Xiaogang Wang
Hongsheng Li
33
4
0
08 Jun 2023
Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos
Matthew Chang
Aditya Prakash
Saurabh Gupta
DiffM
35
17
0
25 May 2023
GRACE: Loss-Resilient Real-Time Video through Neural Codecs
Yihua Cheng
Ziyi Zhang
Han-Chiang Li
Anton Arapin
Yue Zhang
...
Xu Zhang
Francis Y. Yan
Amrita Mazumdar
Nick Feamster
Junchen Jiang
27
17
0
21 May 2023
Learning Global-aware Kernel for Image Harmonization
Xintian Shen
Jiangning Zhang
Jun Chen
Shipeng Bai
Yue Han
Yabiao Wang
Chengjie Wang
Yong-Jin Liu
28
7
0
19 May 2023
TransFlow: Transformer as Flow Learner
Yawen Lu
Qifan Wang
Siqi Ma
Tong Geng
Victor Y. Chen
Huaijin Chen
Dongfang Liu
ViT
35
45
0
23 Apr 2023
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
Xiaoyu Shi
Zhaoyang Huang
Weikang Bian
Dasong Li
Manyuan Zhang
Ka Chun Cheung
Simon See
Hongwei Qin
Jifeng Dai
Hongsheng Li
92
73
0
15 Mar 2023
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation
Roy Miles
M. K. Yucel
Bruno Manganelli
Albert Saà-Garriga
VOS
38
24
0
14 Mar 2023
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation
Xiaoyu Shi
Zhaoyang Huang
Dasong Li
Manyuan Zhang
Ka Chun Cheung
Simon See
Hongwei Qin
Jifeng Dai
Hongsheng Li
27
82
0
02 Mar 2023
One-Shot Video Inpainting
Sangjin Lee
Suhwan Cho
Sangyoun Lee
19
1
0
28 Feb 2023
MorphGANFormer: Transformer-based Face Morphing and De-Morphing
Naifeng Zhang
Xudong Liu
Xuzhao Li
Guo-Jun Qi
CVBM
18
5
0
18 Feb 2023
Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey
S. Dubey
Satish Kumar Singh
ViT
41
33
0
17 Feb 2023
PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer
Zitong Yu
Yuming Shen
Jingang Shi
Hengshuang Zhao
Yawen Cui
Jiehua Zhang
Philip H. S. Torr
Guoying Zhao
ViT
MedIm
29
80
0
07 Feb 2023
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Kaiwen Zhang
Jialun Peng
Jingjing Fu
Dong Liu
ViT
27
8
0
24 Jan 2023
VideoFACT: Detecting Video Forgeries Using Attention, Scene Context, and Forensic Traces
T. D. Nguyen
Shengbang Fang
Matthew C. Stamm
44
11
0
28 Nov 2022
Beyond the Field-of-View: Enhancing Scene Visibility and Perception with Clip-Recurrent Transformer
Haowen Shi
Qi Jiang
Kailun Yang
Xiaoyue Yin
Ze Wang
Kaiwei Wang
ViT
43
5
0
21 Nov 2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting
Zexu Pan
Wupeng Wang
Marvin Borsdorf
Haizhou Li
11
10
0
31 Oct 2022
DeViT: Deformed Vision Transformers in Video Inpainting
Jiayin Cai
Changlin Li
Xin Tao
Chun Yuan
Yu-Wing Tai
ViT
30
12
0
28 Sep 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Meng-Hao Guo
Chenggang Lu
Qibin Hou
Zheng Liu
Ming-Ming Cheng
Shiyong Hu
SSeg
ViT
VLM
23
608
0
18 Sep 2022
1
2
Next