Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2003.12039
Cited By
v1
v2
v3 (latest)
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
European Conference on Computer Vision (ECCV), 2020
26 March 2020
Zachary Teed
Gaowen Liu
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3586★)
Papers citing
"RAFT: Recurrent All-Pairs Field Transforms for Optical Flow"
50 / 1,785 papers shown
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Tianyu Yuan
Yuanbo Yang
Lin Chen
Yao Yao
Zhuzhong Qian
DiffM
VGen
236
0
0
04 Dec 2025
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring
Geunhyuk Youk
Jihyong Oh
Munchurl Kim
62
0
0
04 Dec 2025
Unique Lives, Shared World: Learning from Single-Life Videos
Tengda Han
Sayna Ebrahimi
Dilara Gokay
Li Yang Ku
M. Ovsjanikov
...
Daniel Zoran
Viorica Patraucean
João Carreira
Andrew Zisserman
Dima Damen
161
0
0
03 Dec 2025
Beyond Boundary Frames: Audio-Visual Semantic Guidance for Context-Aware Video Interpolation
Yuchen Deng
Xiuyang Wu
Hai-Tao Zheng
Jie Wang
Feidiao Yang
Yuxing Han
VGen
220
0
0
03 Dec 2025
Bayes-DIC Net: Estimating Digital Image Correlation Uncertainty with Bayesian Neural Networks
Biao Chen
Zhenhua Lei
Yahui Zhang
Tongzhi Niu
DiffM
BDL
179
0
0
03 Dec 2025
Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding
Haoran Zhou
Gim Hee Lee
3DGS
VGen
3DV
238
0
0
03 Dec 2025
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench
Lanxiang Hu
Abhilash Shankarampeta
Yixin Huang
Zilin Dai
Haoyang Yu
Yujie Zhao
Haoqiang Kang
Daniel Zhao
Tajana Rosing
Hao Zhang
VGen
LRM
225
1
0
02 Dec 2025
Disentangling Progress in Medical Image Registration: Beyond Trend-Driven Architectures towards Domain-Specific Strategies
Bailiang Jian
J. Pan
Rohit Jena
Morteza Ghahremani
Hongwei Bran Li
Daniel Rueckert
Christian Wachinger
Benedikt Wiestler
OOD
195
1
0
01 Dec 2025
Generative Video Motion Editing with 3D Point Tracks
Yao-Chih Lee
Zhoutong Zhang
Jiahui Huang
Jui-Hsien Wang
Joon-Young Lee
Jia-Bin Huang
Eli Shechtman
Zhengqi Li
DiffM
VGen
3DPC
262
0
0
01 Dec 2025
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer
Dong In Lee
Hyungjun Doh
Seunggeun Chi
Runlin Duan
Sangpil Kim
K. Ramani
DiffM
3DGS
VGen
145
0
0
30 Nov 2025
EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Xiaoshan Wu
Yifei Yu
Xiaoyang Lyu
Yihua Huang
Bo Wang
Baoheng Zhang
Zhongrui Wang
Xiaojuan Qi
3DGS
68
0
0
30 Nov 2025
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Minh-Quan Le
Yuanzhi Zhu
Vicky Kalogeiton
Dimitris Samaras
EGVM
VGen
91
1
0
29 Nov 2025
Captain Safari: A World Engine
Yu-Cheng Chou
X. Wang
Yitong Li
Jiahao Wang
Hanting Liu
Cihang Xie
Alan Yuille
Junfei Xiao
VGen
175
0
0
28 Nov 2025
Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model
J. Tang
J. Liu
Jiaqi Li
Longhuang Wu
Haoyu Yang
Penghao Zhao
Siruis Gong
Xiang Yuan
Shuai Shao
Qinglin Lu
VGen
121
1
0
28 Nov 2025
Cascaded Robust Rectification for Arbitrary Document Images
Chaoyun Wang
Quanxin Huang
I-Chao Shen
Takeo Igarashi
Nanning Zheng
Caigui Jiang
137
0
0
28 Nov 2025
MARVO: Marine-Adaptive Radiance-aware Visual Odometry
Sacchin Sundar
Atman Kikani
Aaliya Alam
Sumukh Shrote
A. Nayeemulla Khan
A. Shahina
MDE
378
0
0
28 Nov 2025
Prompt-based Consistent Video Colorization
Silvia Dani
Tiberio Uricchio
Lorenzo Seidenari
DiffM
VGen
107
0
0
27 Nov 2025
Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction
Boyao Zhou
Shunyuan Zheng
Zhanfeng Liao
Zihan Ma
Hanzhang Tu
Boning Liu
Y. Liu
3DGS
187
0
0
27 Nov 2025
Video Generation Models Are Good Latent Reward Models
Xiaoyue Mi
W. Yu
Jiesong Lian
Shibo Jie
Ruizhe Zhong
...
Z. Zhou
Zhiyong Xu
Yuan Zhou
Qinglin Lu
Fan Tang
EGVM
VGen
352
0
0
26 Nov 2025
MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training
Haotian Xue
Qi-An Chen
Zhonghao Wang
Xun Huang
Eli Shechtman
Jinrong Xie
Yongxin Chen
DiffM
VGen
530
0
0
26 Nov 2025
MotionV2V: Editing Motion in a Video
R. Burgert
Charles Herrmann
Forrester Cole
Michael S. Ryoo
Neal Wadhwa
Andrey Voynov
Nataniel Ruiz
DiffM
VGen
243
0
0
25 Nov 2025
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
Zhoujie Fu
Xianfang Zeng
Jinghong Lan
Xinyao Liao
Cheng Chen
...
Wei Cheng
Shiyu Liu
Y. Chen
Gang Yu
Guosheng Lin
DiffM
VGen
354
1
0
25 Nov 2025
ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction
Yuanzhe Li
Steffen Müller
ViT
173
0
0
25 Nov 2025
Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
Ruojun Xu
Yu Kai
Xuhua Ren
Jiaxiang Cheng
Bing Ma
Tianxiang Zheng
Qinhlin Lu
EGVM
160
0
0
24 Nov 2025
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
Duolikun Danier
Ge Gao
Steven McDonagh
Changjian Li
Hakan Bilen
Oisin Mac Aodha
DiffM
VGen
136
0
0
24 Nov 2025
Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
Yeji Song
Jaehyun Lee
Mijin Koo
Junhoo Lee
Nojun Kwak
DiffM
VGen
97
0
0
23 Nov 2025
RigAnyFace: Scaling Neural Facial Mesh Auto-Rigging with Unlabeled Data
Wenchao Ma
Dario Kneubuehler
Maurice Chu
Ian Sachs
Haomiao Jiang
Sharon X. Huang
3DH
293
0
0
23 Nov 2025
FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement
Wenshuo Gao
Junyi Fan
Jiangyue Zeng
Shuai Yang
73
0
0
23 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffM
VGen
152
0
0
23 Nov 2025
C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction
Kuan Wei Huang
Brandon Li
Bharath Hariharan
Noah Snavely
3DPC
3DV
374
0
0
23 Nov 2025
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett-Luce Ranking
Chengan Che
Chao Wang
Xinyue Chen
Sophia Tsoka
Luis C. Garcia-Peraza-Herrera
AI4TS
205
0
0
21 Nov 2025
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Yujiang Pu
Zhanbo Huang
Vishnu Boddeti
Yu Kong
DiffM
VGen
119
0
0
21 Nov 2025
LAOF: Robust Latent Action Learning with Optical Flow Constraints
Xizhou Bu
Jiexi Lyu
Fulei Sun
R. G. Yang
Zhiqiang Ma
Wei Li
108
0
0
20 Nov 2025
EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering
Pierrick Bournez
Luca Savant Aira
T. Ehret
Gabriele Facciolo
3DGS
113
0
0
20 Nov 2025
Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography
Shourov Joarder
Tushar Talukder Showrav
Md. Kamrul Hasan
44
0
0
19 Nov 2025
CPSL: Representing Volumetric Video via Content-Promoted Scene Layers
Kaiyuan Hu
Yili Jin
Junhua Liu
Xize Duan
Hong Kang
Xue Liu
68
0
0
18 Nov 2025
Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
Junlong Li
Huaiyuan Xu
Sijie Cheng
Kejun Wu
Kim-Hui Yap
Lap-Pui Chau
Yi Wang
EgoV
235
0
0
17 Nov 2025
Free-Form Scene Editor: Enabling Multi-Round Object Manipulation like in a 3D Engine
Xincheng Shuai
Zhenyuan Qin
Henghui Ding
Dacheng Tao
DiffM
167
0
0
17 Nov 2025
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
Tushar Anand
Advik Sinha
Abhijit Das
Mamba
132
0
0
16 Nov 2025
DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry
Cheng Liao
78
0
0
16 Nov 2025
RadarMP: Motion Perception for 4D mmWave Radar in Autonomous Driving
Ruiqi Cheng
Huijun Di
Jian Li
Feng Liu
Wei Liang
154
0
0
15 Nov 2025
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
Seyedehanita Madani
Vishal M. Patel
188
0
0
11 Nov 2025
Non-Aligned Reference Image Quality Assessment for Novel View Synthesis
Abhijay Ghildyal
Rajesh Sureddi
Nabajeet Barman
Saman Zadtootaghaj
Alan C. Bovik
145
0
0
11 Nov 2025
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
239
2
0
11 Nov 2025
TiS-TSL: Image-Label Supervised Surgical Video Stereo Matching via Time-Switchable Teacher-Student Learning
Rui Wang
Ying Zhou
Hao Wang
Wenwei Zhang
Qiang Li
Zhiwei Wang
343
0
0
10 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
182
1
0
10 Nov 2025
FlowFeat: Pixel-Dense Embedding of Motion Profiles
Nikita Araslanov
Anna Sonnweber
Daniel Cremers
MDE
362
1
0
10 Nov 2025
StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Tianrui Feng
Z. Li
Shuo Yang
Haocheng Xi
Muyang Li
...
S. Han
Maneesh Agrawala
Kurt Keutzer
Akio Kodaira
Chenfeng Xu
VGen
148
3
0
10 Nov 2025
Tracking and Understanding Object Transformations
Yihong Sun
Xinyu Yang
Jennifer J. Sun
Bharath Hariharan
172
0
0
06 Nov 2025
Estimation of Segmental Longitudinal Strain in Transesophageal Echocardiography by Deep Learning
IEEE journal of biomedical and health informatics (JBHI), 2025
Anders Austlid Taskén
Thierry Judge
Erik Andreas Rye Berg
Jinyang Yu
Bjørnar Grenne
...
Svend Aakhus
Pierre-Marc Jodoin
Nicolas Duchateau
Olivier Bernard
G. Kiss
80
0
0
04 Nov 2025
1
2
3
4
...
34
35
36
Next