Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2003.12039
Cited By
v1
v2
v3 (latest)
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
European Conference on Computer Vision (ECCV), 2020
26 March 2020
Zachary Teed
Gaowen Liu
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3586★)
Papers citing
"RAFT: Recurrent All-Pairs Field Transforms for Optical Flow"
50 / 1,785 papers shown
JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation
Fangda Chen
Shanshan Zhao
Chuanfu Xu
Long Lan
VGen
408
3
0
31 Mar 2025
Point Tracking in Surgery--The 2024 Surgical Tattoos in Infrared (STIR) Challenge
Adam Schmidt
Mert Asim Karaoglu
Soham Sinha
Mingang Jang
Ho-Gun Ha
...
Zijian Wu
A. Ladikos
S. DiMaio
Septimiu E. Salcudean
Omid Mohareri
222
3
0
31 Mar 2025
VideoGen-Eval: Agent-based System for Video Generation Evaluation
Yuhang Yang
Ke Fan
Siyang Song
Hongxiang Li
Ailing Zeng
FeiLin Han
Wei-dong Zhai
Wen Liu
Yang Cao
Zheng-jun Zha
EGVM
VGen
418
9
0
30 Mar 2025
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
Xindi Yang
Baolu Li
Yanzhe Zhang
Zhenfei Yin
Lei Bai
...
Zhiyong Wang
Jianfei Cai
Tien-Tsin Wong
Huchuan Lu
Xu Jia
DiffM
VGen
500
0
0
30 Mar 2025
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Computer Vision and Pattern Recognition (CVPR), 2025
Felix Wimbauer
Weirong Chen
Dominik Muhle
Christian Rupprecht
Daniel Cremers
VGen
388
7
0
30 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
257
0
0
28 Mar 2025
Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision
Rulin Zhou
Wenlong He
An Wang
Qiqi Yao
Haijun Hu
Jiankun Wang
Xi Zhang an Hongliang Ren
226
0
0
28 Mar 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Dian Zheng
Ziqi Huang
Hongbo Liu
Kai Zou
Yinan He
...
Jingwen He
Wei-Shi Zheng
Botian Shi
Yu Qiao
Ziwei Liu
EGVM
VGen
339
95
0
27 Mar 2025
Multispectral Demosaicing via Dual Cameras
SaiKiran Tedla
Junyong Lee
Beixuan Yang
Mahmoud Afifi
M. Brown
313
0
0
27 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
311
7
0
27 Mar 2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Computer Vision and Pattern Recognition (CVPR), 2025
Shijie Zhou
Hui Ren
Yijia Weng
Shuwang Zhang
Zhen Wang
...
Zhiwen Fan
Suya You
Ziyi Wang
Leonidas Guibas
A. Kadambi
VGen
3DGS
370
5
0
26 Mar 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
T. Liu
Longxiang Zhang
Zhaoxi Chen
Guangcong Wang
Shoukang Hu
Liao Shen
Huiqiang Sun
Z. Cao
Wei Li
Ziwei Liu
VGen
3DGS
416
17
0
26 Mar 2025
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Tai D. Nguyen
Matthew C. Stamm
373
1
0
26 Mar 2025
Burst Image Super-Resolution with Mamba
Ozan Unal
Steven Marty
Dengxin Dai
Mamba
195
0
0
25 Mar 2025
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Computer Vision and Pattern Recognition (CVPR), 2025
Zihang Lai
Andrea Vedaldi
234
3
0
25 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
314
27
0
25 Mar 2025
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals
Stefan Stojanov
David Wendt
Seungwoo Kim
R. Venkatesh
Kevin T. Feigelis
Jiajun Wu
Daniel L. K. Yamins
SSL
259
4
0
25 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffM
VGen
508
47
0
24 Mar 2025
AMD-Hummingbird: Towards an Efficient Text-to-Video Model
Takashi Isobe
He Cui
Dong Zhou
Mengmeng Ge
D. Li
E. Barsoum
VGen
329
4
0
24 Mar 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
645
1
0
22 Mar 2025
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image
Jerred Chen
Ronald Clark
409
3
0
21 Mar 2025
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos
Yuang Feng
Shuyong Gao
Fuzhen Yan
Yicheng Song
Lingyi Hong
J. Hu
Wenqiang Zhang
VOS
272
0
0
21 Mar 2025
HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks
Computer Vision and Pattern Recognition (CVPR), 2025
Maria Pilligua
Danna Xue
Javier Vázquez-Corral
224
1
0
21 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
Jing Zhang
Lu Qi
Xuelong Li
Yunhai Tong
297
6
0
21 Mar 2025
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia
David Bourgin
Krishna Kumar Singh
Yuheng Li
Yan Kang
Zhan Xu
N. Jha
Yixiao Liu
DiffM
VGen
398
1
0
21 Mar 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffM
VGen
838
9
0
20 Mar 2025
Physically Grounded Monocular Depth via Nanophotonic Wavefront Prompting
Bingxuan Li
Jiahao Wu
Yuan Xu
Yunxiang Zhang
Zezheng Zhu
Nanfang Yu
Qi Sun
Nanfang Yu
Qi Sun
MDE
261
0
0
20 Mar 2025
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
Edgar Sucar
Zihang Lai
Eldar Insafutdinov
Andrea Vedaldi
265
21
0
20 Mar 2025
DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration
Suraj Singh
Anastasia Batsheva
Oleg Y. Rogov
Ahmed Bouridane
280
0
0
20 Mar 2025
4D Gaussian Splatting SLAM
Yanyan Li
Youxu Fang
Zunjie Zhu
Kunyi Li
Yong Ding
Federico Tombari
3DGS
403
3
0
20 Mar 2025
Temporal-Consistent Video Restoration with Pre-trained Diffusion Models
Hengkang Wang
Yang Liu
Huidong Liu
Chien Wang
Yanhui Guo
Hongdong Li
Bryan Wang
Ju Sun
DiffM
VGen
157
3
0
19 Mar 2025
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
Computer Vision and Pattern Recognition (CVPR), 2025
Saad Lahlali
Sandra Kara
Hejer Ammar
Florian Chabot
Nicolas Granger
Hervé Le Borgne
Q. C. Pham
3DPC
283
0
0
19 Mar 2025
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight
Computer Vision and Pattern Recognition (CVPR), 2025
Cédric Vincent
Taehyoung Kim
Henri Meeß
260
2
0
19 Mar 2025
Learn Your Scales: Towards Scale-Consistent Generative Novel View Synthesis
Fereshteh Forghani
Jason J. Yu
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
Marcus A. Brubaker
DiffM
336
0
0
19 Mar 2025
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Computer Vision and Pattern Recognition (CVPR), 2025
Henrique Morimitsu
Xiaobin Zhu
Roberto M. Cesar Jr.
Xiangyang Ji
Xu-Cheng Yin
MDE
408
13
0
19 Mar 2025
Limb-Aware Virtual Try-On Network with Progressive Clothing Warping
IEEE transactions on multimedia (TMM), 2025
Shengping Zhang
Xiaoyu Han
Weigang Zhang
Xiangyuan Lan
Hongxun Yao
Qingming Huang
3DH
384
9
0
18 Mar 2025
Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
540
1
0
18 Mar 2025
GIFT: Generated Indoor video frames for Texture-less point tracking
Jianzheng Huang
Xianyu Mo
Ziling Liu
Jinyu Yang
Feng Zheng
DiffM
3DPC
3DV
VGen
382
0
0
17 Mar 2025
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
Hengjia Li
Lifan Jiang
Xi Xiao
Tianyang Wang
Hongwei Yi
Boxi Wu
Xiaofei He
VGen
198
12
0
16 Mar 2025
Progressive Limb-Aware Virtual Try-On
ACM Multimedia (ACM MM), 2022
Xiaoyu Han
Shengping Zhang
Qinglin Liu
Shunyuan Zheng
Chenyang Wang
3DH
405
5
0
16 Mar 2025
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning
Zihan Zhoua
Changrui Daia
Aibo Songa
Xiaolin Fang
VOS
394
0
0
15 Mar 2025
Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement
Yini Li
Nantheera Anantrasirichai
326
1
0
14 Mar 2025
EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation
Zengyu Wan
Wei-dong Zhai
Yang Cao
Zhengjun Zha
248
0
0
14 Mar 2025
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
H. Iqbal
Nazmul Karim
Umar Khalid
Azib Farooq
Z. Zhong
Jing Hua
Chen Chen
DiffM
3DGS
VGen
461
0
0
14 Mar 2025
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Computer Vision and Pattern Recognition (CVPR), 2025
Xunzhi Zheng
Dan Xu
AI4CE
274
3
0
13 Mar 2025
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He
Ceyuan Yang
Shanchuan Lin
Yinghao Xu
Meng Wei
Liangke Gui
Qi Zhao
Gordon Wetzstein
Lu Jiang
Hongsheng Li
DiffM
VGen
391
42
0
13 Mar 2025
MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry
IEEE International Conference on Robotics and Automation (ICRA), 2024
Yuheng Qiu
Yutian Chen
Zihao Zhang
Wenshan Wang
Sebastian A. Scherer
381
8
0
13 Mar 2025
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?
Yuanxin Liu
Rui Zhu
Shuhuai Ren
Jiacong Wang
Haoyuan Guo
Xu Sun
Lu Jiang
857
2
0
13 Mar 2025
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop
Chenyu Li
Oscar Michel
Xichen Pan
Sainan Liu
Mike Roberts
Saining Xie
VGen
229
25
0
12 Mar 2025
Depth-Assisted Network for Indiscernible Marine Object Counting with Adaptive Motion-Differentiated Feature Encoding
Chengzhi Ma
Kunqian Li
Shuaixin Liu
Han Mei
246
2
0
11 Mar 2025
Previous
1
2
3
...
7
8
9
...
34
35
36
Next
Page 8 of 36
Page
of 36
Go