Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.02523
Cited By
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
4 November 2020
Mike Roberts
Jason Ramapuram
Anurag Ranjan
Atulit Kumar
Miguel Angel Bautista
Nathan Paczan
Russ Webb
Joshua M. Susskind
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding"
50 / 89 papers shown
Title
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Qi Dai
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
59
0
0
01 May 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Wufei Ma
Luoxin Ye
Nessa McWeeney
Celso M de Melo
A. Yuille
Jieneng Chen
LRM
57
1
0
01 May 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
57
0
0
21 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
40
0
0
19 Apr 2025
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding
Yuchen Rao
Stefan Ainetter
Sinisa Stekovic
Vincent Lepetit
Friedrich Fraundorfer
3DPC
3DV
131
0
0
18 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
36
0
0
15 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
60
0
0
15 Apr 2025
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang
Yurui Chen
Yanpeng Zhou
Yueming Xu
Ze Huang
...
Xinyue Cai
G. Huang
Xingyue Quan
Hang Xu
Li Zhang
LRM
89
0
0
29 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Y. Yang
85
1
0
16 Mar 2025
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Fanqi Pu
Yifan Wang
Jiru Deng
Wenming Yang
MDE
ViT
59
2
0
13 Mar 2025
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model
Zhiqiang Yan
Zhengxue Wang
Haoye Dong
Jun Yu Li
Jian Yang
Gim Hee Lee
64
0
0
06 Mar 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
108
4
0
11 Feb 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
40
0
0
24 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
69
16
0
23 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
46
1
0
29 Dec 2024
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Z. Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
136
1
0
18 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
91
0
0
01 Dec 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
98
2
0
24 Nov 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
50
30
0
24 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
66
5
0
14 Oct 2024
SceneCraft: Layout-Guided 3D Scene Generation
Xiuyu Yang
Yunze Man
Jun-Kun Chen
Yu-Xiong Wang
3DV
82
8
0
11 Oct 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
32
4
0
10 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
43
6
0
10 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Renhe Jiang
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
58
4
0
07 Oct 2024
Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection
Hongru Yan
Yu Zheng
Yueqi Duan
3DGS
69
2
0
02 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
44
40
0
26 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
55
10
0
23 Sep 2024
Colorful Diffuse Intrinsic Image Decomposition in the Wild
Chris Careaga
Yağız Aksoy
25
5
0
20 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
37
24
0
17 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
35
3
0
16 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
70
15
0
05 Sep 2024
Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization
Cho-Ying Wu
Yiqi Zhong
Junying Wang
Ulrich Neumann
MDE
54
0
0
04 Sep 2024
Disparity Estimation Using a Quad-Pixel Sensor
Zhuofeng Wu
Doehyung Lee
Zihua Liu
Kazunori Yoshizaki
Yusuke Monno
Masatoshi Okutomi
MDE
20
1
0
01 Sep 2024
Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation
Sai Prasanna
Daniel Honerkamp
Kshitij Sirohi
Tim Welschehold
Wolfram Burgard
Abhinav Valada
37
1
0
05 Aug 2024
CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning
Dongmyeong Lee
Amanda Adkins
Joydeep Biswas
40
0
0
12 Jul 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
Chongjie Ye
Lingteng Qiu
Xiaodong Gu
Qi Zuo
Yushuang Wu
Zilong Dong
Liefeng Bo
Yuliang Xiu
Xiaoguang Han
DiffM
32
39
0
24 Jun 2024
Latent Intrinsics Emerge from Training to Relight
Xiao Zhang
William Gao
Seemandhar Jain
Michael Maire
David.A.Forsyth
Anand Bhattad
38
3
0
31 May 2024
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
Xiankang He
Guangkai Xu
Bo Zhang
Hao Chen
Ying Cui
Dongyan Guo
DiffM
38
6
0
24 May 2024
WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users
William Huang
Sam Ghahremani
Siyou Pei
Yang Zhang
32
3
0
25 Apr 2024
EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting
Chia-Chen Lin
Mahesh Reddy
Guillaume Berger
Michel Sarkis
Fatih Porikli
Ning Bi
25
2
0
15 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
59
4
0
04 Apr 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
52
115
0
22 Mar 2024
IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images
Zhi-Hao Lin
Jia-Bin Huang
Zhengqin Li
Zhao Dong
Christian Richardt
Tuotuo Li
Michael Zollhöfer
Johannes Kopf
Shenlong Wang
Changil Kim
3DV
57
2
0
23 Jan 2024
A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification
Preeti Meena
Himanshu Kumar
Sandeep Yadav
3DH
27
2
0
19 Jan 2024
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes
J. L. Gómez
Manuel Silva
Antonio Seoane
Agnes Borrás
Mario Noriega
Germán Ros
Jose A. Iglesias-Guitian
Antonio M. López
3DPC
64
12
0
19 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
39
62
0
11 Dec 2023
Intrinsic Image Decomposition via Ordinal Shading
Chris Careaga
Yagiz Aksoy
26
26
0
21 Nov 2023
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Söhnke Benedikt Fischedick
Daniel Seichter
Robin M. Schmidt
Leonard Rabes
H. Groß
17
9
0
08 Jun 2023
SIDAR: Synthetic Image Dataset for Alignment & Restoration
Monika Kwiatkowski
Simon Matern
Olaf Hellwich
21
3
0
19 May 2023
Virtual Occlusions Through Implicit Depth
Jamie Watson
Mohamed Sayed
Z. Qureshi
Gabriel J. Brostow
Sara Vicente
Oisin Mac Aodha
Michael Firman
32
7
0
11 May 2023
1
2
Next