v1v2 (latest)

DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras

Neural Information Processing Systems (NeurIPS), 2021

24 August 2021

ArXiv (abs)PDF HTML Github (2072★)

Papers citing "DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras"

50 / 456 papers shown

MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction

179

03 Dec 2025

What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models

...

03 Dec 2025

TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction

262

02 Dec 2025

VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM

02 Dec 2025

EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly

119

01 Dec 2025

IGen: Scalable Data Generation for Robot Learning from Open-World Images

...

161

01 Dec 2025

KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM

136

01 Dec 2025

EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes

30 Nov 2025

Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation

101

27 Nov 2025

Dual-Agent Reinforcement Learning for Adaptive and Cost-Aware Visual-Inertial Odometry

306

26 Nov 2025

4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models

...

231

25 Nov 2025

Zoo3D: Zero-Shot 3D Object Detection at Scene Level

440

25 Nov 2025

AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend

Hengyi Wang

Lourdes Agapito

25 Nov 2025

HABIT: Human Action Benchmark for Interactive Traffic in CARLA

139

24 Nov 2025

Any4D: Open-Prompt 4D Generation from Natural Language and Images

Hao Li

Qiao Sun

VGen LM&Ro

183

24 Nov 2025

SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes

101

23 Nov 2025

4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation

23 Nov 2025

SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors

220

21 Nov 2025

iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting InversionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2025

397

18 Nov 2025

BEDLAM2.0: Synthetic Humans and Cameras in Motion

J. Tesch

Giorgio Becherini

Prerana Achar

Anastasios Yiannakidis

196

18 Nov 2025

4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular VideosEducational Data Mining (EDM), 2025

153

07 Nov 2025

AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM

213

30 Oct 2025

Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation

Marziyeh Bamdad

Hans-Peter Hutter

Alireza Darvishy

168

23 Oct 2025

MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes

123

17 Oct 2025

C4D: 4D Made from 3D through Dual Correspondences

182

16 Oct 2025

Leveraging Cycle-Consistent Anchor Points for Self-Supervised RGB-D RegistrationIEEE International Conference on Robotics and Automation (ICRA), 2024

228

16 Oct 2025

SUM-AgriVLN: Spatial Understanding Memory for Agricultural Vision-and-Language Navigation

Xiaobei Zhao

Xingqi Lyu

Xiang Li

130

16 Oct 2025

SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding

117

14 Oct 2025

Online Video Depth Anything: Temporally-Consistent Depth Prediction with Low Memory Consumption

Johann-Friedrich Feiden

122

10 Oct 2025

Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians

124

10 Oct 2025

ReSplat: Learning Recurrent Gaussian Splats

153

09 Oct 2025

ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation

171

09 Oct 2025

Dropping the D: RGB-D SLAM Without the Depth Sensor

472

07 Oct 2025

Human3R: Everyone Everywhere All at Once

206

07 Oct 2025

OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSSIEEE Transactions on robotics (IEEE TRO), 2025

Simon Boche

Jaehyung Jung

Sebastián Barbas Laina

Stefan Leutenegger

204

06 Oct 2025

EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction

113

02 Oct 2025

Instant4D: 4D Gaussian Splatting in Minutes

177

01 Oct 2025

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

30 Sep 2025

TTT3R: 3D Reconstruction as Test-Time Training

282

30 Sep 2025

GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State

131

28 Sep 2025

MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM

189

25 Sep 2025

iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning

228

23 Sep 2025

TUN3D: Towards Real-World Scene Understanding from Unposed Images

181

23 Sep 2025

VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video

119

22 Sep 2025

ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos

155

22 Sep 2025

SLAM-Former: Putting SLAM into One Transformer

127

21 Sep 2025

ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM

Amanuel T. Dufera

Yuan-Li Cai

3DGS MDE

187

21 Sep 2025

BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots

177

18 Sep 2025

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes

Fang Li

Hao Zhang

Narendra Ahuja

165

18 Sep 2025

MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

239

17 Sep 2025