Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2003.12039
Cited By
v1
v2
v3 (latest)
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
European Conference on Computer Vision (ECCV), 2020
26 March 2020
Zachary Teed
Gaowen Liu
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3586★)
Papers citing
"RAFT: Recurrent All-Pairs Field Transforms for Optical Flow"
50 / 1,790 papers shown
WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field
Qi Zhu
Jingyi Zhang
Naishan Zheng
Wei Yu
Jinghao Zhang
Deyi Ji
Feng Zhao
123
0
0
05 Dec 2025
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Tianyu Yuan
Yuanbo Yang
Lin Chen
Yao Yao
Zhuzhong Qian
VGen
297
0
0
04 Dec 2025
IE2Video: Adapting Pretrained Diffusion Models for Event-Based Video Reconstruction
D. Torbunov
Onur Okuducu
Yi Huang
Odera Dim
Rebecca Coles
Yonggang Cui
Yihui Ren
DiffM
VGen
173
0
0
04 Dec 2025
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring
Geunhyuk Youk
Jihyong Oh
Munchurl Kim
133
0
0
04 Dec 2025
Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Yuchen Deng
Xiuyang Wu
Hai-Tao Zheng
Jie Wang
Feidiao Yang
Yuxing Han
VGen
257
0
0
03 Dec 2025
Bayes-DIC Net: Estimating Digital Image Correlation Uncertainty with Bayesian Neural Networks
Biao Chen
Zhenhua Lei
Yahui Zhang
Tongzhi Niu
DiffM
BDL
213
0
0
03 Dec 2025
Unique Lives, Shared World: Learning from Single-Life Videos
Tengda Han
Sayna Ebrahimi
Dilara Gokay
Li Yang Ku
M. Ovsjanikov
...
Daniel Zoran
Viorica Patraucean
João Carreira
Andrew Zisserman
Dima Damen
235
0
0
03 Dec 2025
Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding
Haoran Zhou
Gim Hee Lee
3DGS
VGen
3DV
256
0
0
03 Dec 2025
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Lanxiang Hu
Abhilash Shankarampeta
Yixin Huang
Zilin Dai
Haoyang Yu
Yujie Zhao
Haoqiang Kang
Daniel Zhao
Tajana Rosing
Hao Zhang
VGen
LRM
260
1
0
02 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffM
VGen
3DPC
333
2
0
01 Dec 2025
Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies
Bailiang Jian
J. Pan
Rohit Jena
Morteza Ghahremani
Hongwei Bran Li
Daniel Rueckert
Christian Wachinger
Benedikt Wiestler
OOD
236
2
0
01 Dec 2025
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee
Hyungjun Doh
Seunggeun Chi
Runlin Duan
Sangpil Kim
K. Ramani
DiffM
3DGS
VGen
194
0
0
30 Nov 2025
EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Xiaoshan Wu
Yifei Yu
Xiaoyang Lyu
Yihua Huang
Bo Wang
Baoheng Zhang
Zhongrui Wang
Xiaojuan Qi
3DGS
142
1
0
30 Nov 2025
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Minh-Quan Le
Yuanzhi Zhu
Vicky Kalogeiton
Dimitris Samaras
EGVM
VGen
131
3
0
29 Nov 2025
Captain Safari: A World Engine with Pose-Aligned 3D Memory
Yu-Cheng Chou
X. Wang
Yitong Li
Jiahao Wang
Hanting Liu
Cihang Xie
Alan Yuille
Junfei Xiao
VGen
245
0
0
28 Nov 2025
Cascaded Robust Rectification for Arbitrary Document Images
Chaoyun Wang
Quanxin Huang
I-Chao Shen
Takeo Igarashi
Nanning Zheng
Caigui Jiang
182
0
0
28 Nov 2025
Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model
J. Tang
J. Liu
Jiaqi Li
Longhuang Wu
Haoyu Yang
...
Siruis Gong
Xiang Yuan
Shuai Shao
Qinglin Lu
Qinglin Lu
VGen
165
13
0
28 Nov 2025
MARVO: Marine-Adaptive Radiance-aware Visual Odometry
Sacchin Sundar
Atman Kikani
Aaliya Alam
Sumukh Shrote
A. Nayeemulla Khan
A. Shahina
MDE
418
0
0
28 Nov 2025
Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction
Boyao Zhou
Shunyuan Zheng
Zhanfeng Liao
Zihan Ma
Hanzhang Tu
Boning Liu
Y. Liu
3DGS
216
0
0
27 Nov 2025
Prompt-based Consistent Video Colorization
International Conference on Image Analysis and Processing (ICIAP), 2025
Silvia Dani
Tiberio Uricchio
Lorenzo Seidenari
DiffM
VGen
130
0
0
27 Nov 2025
MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training
Haotian Xue
Qi-An Chen
Zhonghao Wang
Xun Huang
Eli Shechtman
Jinrong Xie
Yongxin Chen
DiffM
VGen
588
1
0
26 Nov 2025
Video Generation Models Are Good Latent Reward Models
Xiaoyue Mi
W. Yu
Jiesong Lian
Shibo Jie
Ruizhe Zhong
...
Z. Zhou
Zhiyong Xu
Yuan Zhou
Qinglin Lu
Fan Tang
EGVM
VGen
440
5
0
26 Nov 2025
ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction
Yuanzhe Li
Steffen Müller
ViT
210
0
0
25 Nov 2025
MotionV2V: Editing Motion in a Video
R. Burgert
Charles Herrmann
Forrester Cole
Michael S. Ryoo
Neal Wadhwa
Andrey Voynov
Nataniel Ruiz
DiffM
VGen
295
2
0
25 Nov 2025
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
Zhoujie Fu
Xianfang Zeng
Jinghong Lan
Xinyao Liao
Cheng Chen
...
Wei Cheng
Shiyu Liu
Y. Chen
Gang Yu
Guosheng Lin
DiffM
VGen
410
1
0
25 Nov 2025
Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
Ruojun Xu
Yu Kai
Xuhua Ren
Jiaxiang Cheng
Bing Ma
Tianxiang Zheng
Qinhlin Lu
EGVM
211
1
0
24 Nov 2025
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
Duolikun Danier
Ge Gao
Steven McDonagh
Changjian Li
Hakan Bilen
Oisin Mac Aodha
DiffM
VGen
173
1
0
24 Nov 2025
FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement
Wenshuo Gao
Junyi Fan
Jiangyue Zeng
Shuai Yang
106
0
0
23 Nov 2025
RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data
Wenchao Ma
Dario Kneubuehler
Maurice Chu
Ian Sachs
Haomiao Jiang
Sharon X. Huang
3DH
326
0
0
23 Nov 2025
C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction
Kuan Wei Huang
Brandon Li
Bharath Hariharan
Noah Snavely
3DPC
3DV
416
1
0
23 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffM
VGen
202
1
0
23 Nov 2025
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Yeji Song
Jaehyun Lee
Mijin Koo
Junhoo Lee
Nojun Kwak
DiffM
VGen
125
0
0
23 Nov 2025
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett-Luce Ranking
Chengan Che
Chao Wang
Xinyue Chen
Sophia Tsoka
Luis C. Garcia-Peraza-Herrera
SSL
AI4TS
260
0
0
21 Nov 2025
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Yujiang Pu
Zhanbo Huang
Vishnu Boddeti
Yu Kong
DiffM
VGen
143
0
0
21 Nov 2025
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu
Jiexi Lyu
Fulei Sun
R. G. Yang
Zhiqiang Ma
Wei Li
173
2
0
20 Nov 2025
EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering
Pierrick Bournez
Luca Savant Aira
T. Ehret
Gabriele Facciolo
3DGS
136
0
0
20 Nov 2025
Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography
Shourov Joarder
Tushar Talukder Showrav
Md. Kamrul Hasan
71
0
0
19 Nov 2025
CPSL: Representing Volumetric Video via Content-Promoted Scene Layers
Kaiyuan Hu
Yili Jin
Junhua Liu
Xize Duan
Hong Kang
Xue Liu
116
0
0
18 Nov 2025
Free-Form Scene Editor: Enabling Multi-Round Object Manipulation like in a 3D Engine
Xincheng Shuai
Zhenyuan Qin
Henghui Ding
Dacheng Tao
DiffM
196
1
0
17 Nov 2025
Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
Junlong Li
Huaiyuan Xu
Sijie Cheng
Kejun Wu
Kim-Hui Yap
Lap-Pui Chau
Yi Wang
EgoV
346
0
0
17 Nov 2025
DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry
Cheng Liao
113
0
0
16 Nov 2025
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
Tushar Anand
Advik Sinha
Abhijit Das
Mamba
156
0
0
16 Nov 2025
RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving
Ruiqi Cheng
Huijun Di
Jian Li
Feng Liu
Wei Liang
179
0
0
15 Nov 2025
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
Seyedehanita Madani
Vishal M. Patel
219
0
0
11 Nov 2025
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
314
4
0
11 Nov 2025
Non-Aligned Reference Image Quality Assessment for Novel View Synthesis
Abhijay Ghildyal
Rajesh Sureddi
Nabajeet Barman
Saman Zadtootaghaj
Alan C. Bovik
188
0
0
11 Nov 2025
FlowFeat: Pixel-Dense Embedding of Motion Profiles
Nikita Araslanov
Anna Sonnweber
Daniel Cremers
MDE
407
1
0
10 Nov 2025
StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Tianrui Feng
Z. Li
Shuo Yang
Haocheng Xi
Muyang Li
...
S. Han
Maneesh Agrawala
Kurt Keutzer
Akio Kodaira
Chenfeng Xu
VGen
208
9
0
10 Nov 2025
TiS-TSL: Image-Label Supervised Surgical Video Stereo Matching via Time-Switchable Teacher-Student Learning
Rui Wang
Ying Zhou
Hao Wang
Wenwei Zhang
Qiang Li
Zhiwei Wang
405
0
0
10 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
203
1
0
10 Nov 2025
1
2
3
4
...
34
35
36
Next
Page 1 of 36
Page
of 36
Go