v1v2v3v4v5 (latest)

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

4 November 2020

Miguel Angel Bautista

Papers citing "Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding"

50 / 358 papers shown

COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

...

239

04 Dec 2025

LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging

...

280

04 Dec 2025

UniLight: A Unified Representation for Lighting

Zitian Zhang

Iliyan Georgiev

Michael Fischer

Yannick Hold-Geoffroy

Jean-François Lalonde

Valentin Deschaintre

132

03 Dec 2025

ReasonX: MLLM-Guided Intrinsic Image Decomposition

106

03 Dec 2025

MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models

195

03 Dec 2025

LumiX: Structured and Coherent Text-to-Intrinsic Generation

230

02 Dec 2025

FOD-S2R: A FOD Dataset for Sim2Real Transfer Learning based Object Detection

Ashish Vashist

Qiranul Saadiyean

Suresh Sundaram

Chandra Sekhar Seelamantula

106

01 Dec 2025

Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

252

30 Nov 2025

MARVO: Marine-Adaptive Radiance-aware Visual Odometry

426

28 Nov 2025

Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation

149

27 Nov 2025

CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion

...

280

26 Nov 2025

^2

VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

355

26 Nov 2025

Qwen3-VL Technical Report

...

2.2K

446

26 Nov 2025

AmodalGen3D: Generative Amodal 3D Object Reconstruction from Sparse Unposed Views

Junwei Zhou

Yu-Wing Tai

26 Nov 2025

LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight

173

25 Nov 2025

AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend

Hengyi Wang

Lourdes Agapito

176

25 Nov 2025

DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video

189

24 Nov 2025

Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers

374

24 Nov 2025

4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation

145

23 Nov 2025

Muskie: Multi-view Masked Image Modeling for 3D Vision Pre-training

222

22 Nov 2025

MuM: Multi-View Masked Image Modeling for 3D Vision

322

21 Nov 2025

Multi-Order Matching Network for Alignment-Free Depth Super-Resolution

326

20 Nov 2025

RoMa v2: Harder Better Faster Denser Feature Matching

592

19 Nov 2025

Lightweight Optimal-Transport Harmonization on Edge Devices

120

16 Nov 2025

Depth Anything 3: Recovering the Visual Space from Any Views

989

137

13 Nov 2025

Visual Spatial Tuning

...

409

07 Nov 2025

Room Envelopes: A Synthetic Dataset for Indoor Layout Reconstruction from Images

Sam Bahrami

Dylan Campbell

3DV

270

06 Nov 2025

Cambrian-S: Towards Spatial Supersensing in Video

...

213

06 Nov 2025

Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis

172

31 Oct 2025

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

201

30 Oct 2025

Rethinking Visual Intelligence: Insights from Video Pretraining

245

28 Oct 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

582

27 Oct 2025

Symmetria: A Synthetic Dataset for Learning in Point Clouds

138

27 Oct 2025

M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Monocular Spatial Perception

U.V.B.L Udugama

G. Vosselman

F. Nex

174

20 Oct 2025

DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning

183

15 Oct 2025

WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting

221

12 Oct 2025

Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding

285

10 Oct 2025

OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference

187

09 Oct 2025

Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation

Sam Sartor

Pieter Peers

DiffM

205

07 Oct 2025

Benchmark on Monocular Metric Depth Estimation in Wildlife Setting

250

06 Oct 2025

Improved probabilistic regression using diffusion models

216

06 Oct 2025

DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance

216

30 Sep 2025

$DA$^{2}$: Depth Anything in Any Direction$

^{2}

: Depth Anything in Any Direction

637

30 Sep 2025

BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

368

29 Sep 2025

EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model

224

26 Sep 2025

ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models

210

26 Sep 2025

SLAM-Former: Putting SLAM into One Transformer

155

21 Sep 2025

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes

253

19 Sep 2025

SPATIALGEN: Layout-guided 3D Indoor Scene Generation

454

18 Sep 2025

Efficient 3D Perception on Embedded Systems via Interpolation-Free Tri-Plane Lifting and Volume Fusion

Sibaek Lee

Jiung Yeon

Hyeonwoo Yu

154

18 Sep 2025