Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.10869
Cited By
DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras
24 August 2021
Zachary Teed
Jia Deng
MDE
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras"
50 / 352 papers shown
Title
Large-Scale Gaussian Splatting SLAM
Zhe Xin
Chenyang Wu
Penghui Huang
Yanyong Zhang
Yinian Mao
Guoquan Huang
3DGS
37
0
0
15 May 2025
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware
Justin Yu
Letian Fu
Huang Huang
Karim El-Refai
Rares Ambrus
Richard Cheng
Muhammad Zubair Irshad
Ken Goldberg
18
0
0
14 May 2025
Learning to Drive Anywhere with Model-Based Reannotation
Noriaki Hirose
Lydia Ignatova
Kyle Stachowicz
Catherine Glossop
Sergey Levine
Dhruv Shah
24
0
0
08 May 2025
SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models
Shun Taguchi
Hideki Deguchi
Takumi Hamazaki
Hiroyuki Sakai
ReLM
LRM
42
0
0
08 May 2025
GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels
Yongxin Su
Lin Chen
Kaiting Zhang
Zhongliang Zhao
Chenfeng Hou
Ziping Yu
3DGS
22
0
0
03 May 2025
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li
Jinkun Cao
Haotian Zhang
Davis Rempe
Jan Kautz
Umar Iqbal
Ye Yuan
DiffM
VGen
51
1
0
02 May 2025
AnimateAnywhere: Rouse the Background in Human Image Animation
Xiaoyu Liu
Mingshuai Yao
Y. Zhang
Xianhui Lin
Peiran Ren
X. Li
Ming-Yu Liu
W. Zuo
3DH
DiffM
65
0
0
28 Apr 2025
Vysics: Object Reconstruction Under Occlusion by Fusing Vision and Contact-Rich Physics
Bibit Bianchini
Minghan Zhu
Mengti Sun
Bowen Jiang
Camillo J. Taylor
Michael Posa
24
0
0
25 Apr 2025
Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization
Guangyang Zeng
Yuan Shen
Ziyang Hong
Yuze Hong
Viorela Ila
Guodong Shi
Junfeng Wu
28
0
0
24 Apr 2025
Dynamic Camera Poses and Where to Find Them
C. Rockwell
Joseph Tung
Tsung-Yi Lin
Ming-Yu Liu
David Fouhey
Chen-Hsuan Lin
37
0
0
24 Apr 2025
ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration
Andrea Conti
Matteo Poggi
Valerio Cambareri
Martin R. Oswald
S. Mattoccia
3DGS
MDE
47
0
0
23 Apr 2025
PRaDA: Projective Radial Distortion Averaging
Daniil Sinitsyn
Linus Harenstam-Nielsen
Daniel Cremers
17
0
0
23 Apr 2025
SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos
Yuxin Yao
Yan Zhang
Zhening Huang
Joan Lasenby
3DGS
19
0
0
22 Apr 2025
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
Bowei Zhang
Lei Ke
Adam W. Harley
Katerina Fragkiadaki
3DPC
MDE
46
0
0
20 Apr 2025
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction
Weirong Chen
Ganlin Zhang
Felix Wimbauer
Rui Wang
Nikita Araslanov
Andrea Vedaldi
Daniel Cremers
50
0
0
20 Apr 2025
Seurat: From Moving Points to Depth
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPC
MDE
31
0
0
20 Apr 2025
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World
Haiwen Feng
Junyi Zhang
Qianqian Wang
Yufei Ye
Pengcheng Yu
Michael J. Black
Trevor Darrell
Angjoo Kanazawa
VGen
3DV
52
1
0
17 Apr 2025
ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos
Zetong Zhang
Manuel Kaufmann
Lixin Xue
Jie Song
Martin R. Oswald
3DH
64
0
0
17 Apr 2025
Regist3R: Incremental Registration with Stereo Foundation Model
Sidun Liu
Wenyu Li
Peng Qiao
Yong Dou
3DV
48
0
0
16 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
27
0
0
09 Apr 2025
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Luo Ling
Bai Qianqian
LM&Ro
39
0
0
09 Apr 2025
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
Songyan Zhang
Yongtao Ge
Jinyuan Tian
Guangkai Xu
Hao Chen
Chen Lv
Chunhua Shen
3DPC
24
0
0
08 Apr 2025
VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets
Alejandro Fontan
Tobias Fischer
Javier Civera
Michael Milford
33
0
0
06 Apr 2025
A Self-Supervised Learning Approach with Differentiable Optimization for UAV Trajectory Planning
Yufei Jiang
Yuanzhu Zhan
Harsh Vardhan Gupta
Chinmay Borde
Junyi Geng
SSL
34
0
0
05 Apr 2025
Multi-identity Human Image Animation with Structural Video Diffusion
Zhenzhi Wang
Y. Li
Yanhong Zeng
Yuwei Guo
D. Lin
Tianfan Xue
Bo Dai
VGen
24
0
0
05 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Xin Zhang
Robby T. Tan
Mamba
48
0
0
04 Apr 2025
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
Jianhao Zheng
Zihan Zhu
Valentin Bieri
Marc Pollefeys
Songyou Peng
Iro Armeni
3DGS
24
0
0
04 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
65
1
0
01 Apr 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPC
VGen
38
1
0
31 Mar 2025
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Felix Wimbauer
Weirong Chen
Dominik Muhle
Christian Rupprecht
Daniel Cremers
VGen
65
0
0
30 Mar 2025
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
David Yifan Yao
Albert Zhai
Shenlong Wang
VGen
46
1
0
27 Mar 2025
HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM
Ziren Gong
Fabio Tosi
Youmin Zhang
S. Mattoccia
Matteo Poggi
39
0
0
27 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
70
0
0
27 Mar 2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Shijie Zhou
Hui Ren
Yijia Weng
Shuwang Zhang
Zhen Wang
...
Zhiwen Fan
Suya You
Z. Wang
Leonidas J. Guibas
A. Kadambi
VGen
3DGS
83
0
0
26 Mar 2025
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios
Xiangting Meng
Jiaqi Yang
Mingshu Chen
C. Yan
Yujiao Shi
Wenchao Ding
L. Kneip
34
0
0
25 Mar 2025
GI-SLAM: Gaussian-Inertial SLAM
Xulang Liu
Ning Tan
3DGS
GP
39
0
0
24 Mar 2025
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Yingping Liang
Yutao Hu
Wenqi Shao
Ying Fu
MDE
42
0
0
21 Mar 2025
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Wonbong Jang
Philippe Weinzaepfel
Vincent Leroy
Lourdes Agapito
Jérôme Revaud
46
0
0
21 Mar 2025
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image
Jerred Chen
Ronald Clark
45
1
0
21 Mar 2025
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
Longbin Ji
Lei Zhong
Pengfei Wei
Changjian Li
DiffM
VGen
41
0
0
20 Mar 2025
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
Edgar Sucar
Zihang Lai
Eldar Insafutdinov
Andrea Vedaldi
46
0
0
20 Mar 2025
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
50
0
0
16 Mar 2025
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
50
7
0
14 Mar 2025
MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry
Yuheng Qiu
Yutian Chen
Zihao Zhang
Wenshan Wang
Sebastian A. Scherer
50
0
0
13 Mar 2025
GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats
K. Deng
Jian Yang
Shenlong Wang
J. Xie
3DGS
45
0
0
11 Mar 2025
HumanMM: Global Human Motion Recovery from Multi-shot Videos
Y. Zhang
Guanlin Wu
Ling-Hao Chen
Zhuokai Zhao
Jing Lin
...
Jiamin Wu
Z. Li
Hao Frank Yang
Haoqian Wang
Lei Zhang
3DH
55
0
0
10 Mar 2025
AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones
Xiaowei Li
Kuan Xu
Fen Liu
Ruofei Bai
Shenghai Yuan
Lihua Xie
59
0
0
10 Mar 2025
Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation
Sihao Lin
Daqi Liu
Ruochong Fu
Dongrui Liu
A. Song
Hongwei Xie
Zhihui Li
Bing Wang
Xiaojun Chang
72
0
0
10 Mar 2025
Unified Human Localization and Trajectory Prediction with Monocular Vision
Po-Chien Luan
Yang Gao
Celine Demonsant
Alexandre Alahi
36
0
0
05 Mar 2025
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Xuanchi Ren
Tianchang Shen
Jiahui Huang
Huan Ling
Yifan Lu
Merlin Nimier-David
Thomas Muller
Alexander Keller
Sanja Fidler
Jun Gao
DiffM
VGen
74
8
0
05 Mar 2025
1
2
3
4
5
6
7
8
Next