Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.05783
Cited By
Unifying Flow, Stereo and Depth Estimation
10 November 2022
Haofei Xu
Jing Zhang
Jianfei Cai
Hamid Rezatofighi
F. I. F. Richard Yu
Dacheng Tao
Andreas Geiger
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unifying Flow, Stereo and Depth Estimation"
50 / 121 papers shown
Title
Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field
Jinlong Fan
Xuepu Zeng
J. Zhang
M. Gong
Yuxiang Yang
Dacheng Tao
3DGS
AI4CE
32
0
0
15 May 2025
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation
Yanbo Ding
DiffM
VGen
17
0
0
15 May 2025
Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic Images
Ziteng Liu
Dongdong He
Chenghong Zhang
Wenpeng Gao
Yili Fu
26
0
0
13 May 2025
Procedural Dataset Generation for Zero-Shot Stereo Matching
David Yan
Alexander R. E. Raistrick
Jia Deng
3DV
48
0
0
23 Apr 2025
TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting
Zhicong Wu
Hongbin Xu
Gang Xu
Ping Nie
Zhixin Yan
Jinkai Zheng
Liangqiong Qu
Ming Li
Liqiang Nie
3DGS
29
0
0
13 Apr 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
Zhong-Yu Li
Ruoyi Du
Juncheng Yan
Le Zhuo
Zhen Li
Peng Gao
Zhanyu Ma
Ming-Ming Cheng
VLM
68
2
0
10 Apr 2025
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
Lihua Liu
Jiehong Lin
Zhenxin Liu
Kui Jia
38
0
0
03 Apr 2025
Consistency-aware Self-Training for Iterative-based Stereo Matching
Jingyi Zhou
Peng Ye
H. Zhang
Jiakang Yuan
Rao Qiang
Liu YangChenXu
Wu Cailin
Feng Xu
Tao Chen
3DV
44
0
0
31 Mar 2025
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Kun Liu
Qi Liu
Xinchen Liu
Jie Li
Yongdong Zhang
Jiebo Luo
Xiaodong He
Wu Liu
VGen
35
0
0
31 Mar 2025
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Felix Wimbauer
Weirong Chen
Dominik Muhle
Christian Rupprecht
Daniel Cremers
VGen
65
0
0
30 Mar 2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres
Oliver Hahn
Charles Corbière
Simone Schaub-Meyer
Stefan Roth
Alexandre Alahi
MDE
37
0
0
30 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
39
0
0
28 Mar 2025
Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors
Weilong Yan
Ming Li
H. Li
S.
Robby T. Tan
MDE
77
0
0
26 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
57
3
0
24 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
61
0
0
20 Mar 2025
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Henrique Morimitsu
Xiaobin Zhu
Roberto M. Cesar Jr.
Xiangyang Ji
Xu-Cheng Yin
MDE
55
0
0
19 Mar 2025
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Xunzhi Zheng
Dan Xu
AI4CE
46
0
0
13 Mar 2025
Stereo Any Video: Temporally Consistent Stereo Matching
Junpeng Jing
Weixun Luo
Ye Mao
K. Mikolajczyk
46
0
0
07 Mar 2025
BANet: Bilateral Aggregation Network for Mobile Stereo Matching
Gangwei Xu
Jiaxin Liu
Xianqi Wang
JunDa Cheng
Yong Deng
Jinliang Zang
Yurui Chen
Xin-She Yang
52
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
48
0
0
05 Mar 2025
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Zhipeng Huang
Shaobin Zhuang
Canmiao Fu
Binxin Yang
Ying Zhang
Chong Sun
Zhizheng Zhang
Yali Wang
Chen Li
Zheng-Jun Zha
DiffM
69
1
0
03 Mar 2025
BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground
Yufei Wei
Sha Lu
Wangtao Lu
R. Xiong
Y. Wang
40
0
0
27 Feb 2025
L4P: Low-Level 4D Vision Perception Unified
Abhishek Badki
Hang Su
Bowen Wen
Orazio Gallo
VLM
78
1
0
18 Feb 2025
GFlow: Recovering 4D World from Monocular Video
Shizun Wang
Xingyi Yang
Qiuhong Shen
Zhenxiang Jiang
Xinchao Wang
VGen
86
17
0
03 Jan 2025
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training
Fardin Ayar
Ehsan Javanmardi
Manabu Tsukada
Mahdi Javanmardi
Mohammad Rahmati
VOS
32
0
0
31 Dec 2024
DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair
Weihang Li
Weirong Chen
Shenhan Qian
Jiajie Chen
Daniel Cremers
H. Li
3DGS
79
0
0
01 Dec 2024
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
92
3
0
28 Nov 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
135
3
0
26 Nov 2024
BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation
Yufei Wei
Sha Lu
Fuzhang Han
R. Xiong
Yue Wang
26
1
0
15 Nov 2024
These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion
Chuang-Wei Liu
Yikang Zhang
Qijun Chen
Ioannis Pitas
Rui Fan
3DV
30
2
0
06 Nov 2024
Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli
Matthias Tangemann
Matthias Kümmerer
Matthias Bethge
VOS
36
0
0
03 Nov 2024
GameGen-X: Interactive Open-world Game Video Generation
Haoxuan Che
Xuanhua He
Quande Liu
C. Jin
Hao Chen
VGen
62
16
0
01 Nov 2024
Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe
Songyu Xu
Yicheng Hu
Jionglong Su
Daniel Elson
Baoru Huang
26
0
0
30 Oct 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
Zhiyuan Min
Yawei Luo
Jianwen Sun
Yi Yang
3DGS
36
0
0
30 Oct 2024
Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models
Nils Blank
Moritz Reuss
Marcel Rühle
Ömer Erdinç Yagmurlu
Fabian Wenzel
Oier Mees
Rudolf Lioutikov
LM&Ro
OffRL
29
4
0
23 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Yuan Zhou
Qiuyue Wang
Yuxuan Cai
Huan Yang
VGen
VLM
80
25
0
20 Oct 2024
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
50
29
0
17 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
66
16
0
14 Oct 2024
Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world
Han Ling
Yinghui Sun
Quansen Sun
Ivor Tsang
Yuhui Zheng
26
1
0
14 Oct 2024
Compressing Scene Dynamics: A Generative Approach
Shanzhi Yin
Zihan Zhang
Bolin Chen
Shiqi Wang
Yan Ye
VGen
29
0
0
13 Oct 2024
A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways
Jing Su
Yiqing Zhou
Yu Zhang
Chao Wang
Yi Wei
3DV
28
0
0
10 Oct 2024
HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
Shengji Tang
Weicai Ye
Peng Ye
Weihao Lin
Yang Zhou
Tao Chen
Wanli Ouyang
3DGS
31
7
0
08 Oct 2024
Self-Supervised Any-Point Tracking by Contrastive Random Walks
Ayush Shrivastava
Andrew Owens
28
3
0
24 Sep 2024
Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping
Jaehyung Jung
Simon Boche
Sebastián Barbas Laina
Stefan Leutenegger
98
1
0
18 Sep 2024
SOLVR: Submap Oriented LiDAR-Visual Re-Localisation
Joshua Knights
Sebastián Barbas Laina
Peyman Moghadam
Stefan Leutenegger
26
0
0
16 Sep 2024
LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow
Hongyu Wen
Erich Liang
Jia Deng
3DPC
34
5
0
09 Sep 2024
Hybrid Cost Volume for Memory-Efficient Optical Flow
Yang Zhao
Gangwei Xu
Gang Wu
31
2
0
06 Sep 2024
Disparity Estimation Using a Quad-Pixel Sensor
Zhuofeng Wu
Doehyung Lee
Zihua Liu
Kazunori Yoshizaki
Yusuke Monno
Masatoshi Okutomi
MDE
20
1
0
01 Sep 2024
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching
Gangwei Xu
Xianqi Wang
Zhaoxing Zhang
Junda Cheng
Chunyuan Liao
Xin Yang
3DV
58
9
0
01 Sep 2024
MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation
Linyan Yang
Lukas Hoyer
Mark Weber
Tobias Fischer
Dengxin Dai
Laura Leal-Taixé
Marc Pollefeys
Daniel Cremers
Luc Van Gool
MDE
32
3
0
29 Aug 2024
1
2
3
Next