Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04994
Cited By
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
11 October 2021
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans"
50 / 221 papers shown
Title
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization
Yueh-Cheng Liu
Lukas Höllein
Matthias Nießner
Angela Dai
3DGS
24
0
0
08 May 2025
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
Weizhen He
Yunfeng Yan
Shixiang Tang
Yiheng Deng
Yangyang Zhong
Pengxin Luo
Donglian Qi
VLM
86
1
0
29 Apr 2025
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation
Shivam Duggal
Yushi Hu
Oscar Michel
Aniruddha Kembhavi
William T. Freeman
Noah A. Smith
Ranjay Krishna
Antonio Torralba
Ali Farhadi
Wei-Chiu Ma
EGVM
ELM
70
0
0
25 Apr 2025
The Fourth Monocular Depth Estimation Challenge
Anton Obukhov
Matteo Poggi
Fabio Tosi
Ripudaman Singh Arora
Jaime Spencer
...
Tuan-Anh Yang
Minh-Quang Nguyen
T. Tran
Albert Luginov
Muhammad Shahzad
MDE
55
0
0
24 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
57
0
0
21 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
40
0
0
19 Apr 2025
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Yanrui Bin
Wenbo Hu
Haoyuan Wang
Xinya Chen
Bing Wang
DiffM
45
0
0
15 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
60
0
0
15 Apr 2025
VibrantLeaves: A principled parametric image generator for training deep restoration models
Raphaël Achddou
Y. Gousseau
Saïd Ladjal
Sabine Süsstrunk
23
0
0
14 Apr 2025
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images
Byeongjun Kwon
Munchurl Kim
VLM
MDE
55
0
0
28 Mar 2025
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
Wenyuan Zhang
Yixiao Yang
Han Huang
Liang Han
Kanle Shi
Yu-Shen Liu
Zhizhong Han
MDE
55
3
0
24 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGS
VGen
63
0
0
20 Mar 2025
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Junfeng Ni
Yu Liu
Ruijie Lu
Zirui Zhou
Song-Chun Zhu
Yixin Chen
Siyuan Huang
DiffM
59
4
0
19 Mar 2025
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
50
0
0
16 Mar 2025
Online Language Splatting
Saimouli Katragadda
Cho-Ying Wu
Yuliang Guo
Xinyu Huang
G. Huang
Liu Ren
3DGS
OffRL
60
0
0
12 Mar 2025
LBM: Latent Bridge Matching for Fast Image-to-Image Translation
Clement Chadebec
O. Tasar
Sanjeev Sreetharan
Benjamin Aubin
37
0
0
10 Mar 2025
Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering
Idris O. Sunmola
Zhenjun Zhao
Samuel Schmidgall
Yumeng Wang
Paul Maria Scheikl
A. Krieger
3DGS
53
0
0
06 Mar 2025
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Luigi Piccinelli
Christos Sakaridis
Y. Yang
Mattia Segu
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
41
6
0
27 Feb 2025
IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360
∘
^\circ
∘
Cameras
Dongki Jung
Jaehoon Choi
Yonghan Lee
Dinesh Manocha
47
0
0
20 Feb 2025
Challenges of Multi-Modal Coreset Selection for Depth Prediction
Viktor Moskvoretskii
Narek Alvandian
39
0
0
20 Feb 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
66
16
0
23 Jan 2025
A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation
S. Landgraf
Rongjun Qin
Markus Ulrich
UQCV
70
0
0
14 Jan 2025
OneLLM: One Framework to Align All Modalities with Language
Jiaming Han
Kaixiong Gong
Yiyuan Zhang
Jiaqi Wang
Kaipeng Zhang
D. Lin
Yu Qiao
Peng Gao
Xiangyu Yue
MLLM
104
107
0
10 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
46
1
0
29 Dec 2024
SDRS: Shape-Differentiable Robot Simulator
Xiaohan Ye
Xifeng Gao
Kui Wu
Zherong Pan
Taku Komura
33
0
0
26 Dec 2024
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions
Sijia Jiang
Tong Wu
Jing Hua
Zhizhong Han
3DV
66
1
0
21 Dec 2024
Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion
Adam Bethell
Ravi Garg
Ian Reid
DiffM
68
0
0
16 Dec 2024
Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors
Alex Rich
Noah Stier
P. Sen
Tobias Höllerer
MDE
72
0
0
08 Dec 2024
Planar Gaussian Splatting
F. G. Zanjani
H. Cai
Hanno Ackermann
Leila Mirvakhabova
Fatih Porikli
3DGS
69
1
0
02 Dec 2024
Explorations in Self-Supervised Learning: Dataset Composition Testing for Object Classification
Raynor Kirkson E. Chavez
Kyle Gabriel M. Reynoso
66
0
0
01 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
91
0
0
01 Dec 2024
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
Yuze He
Wang Zhao
Shaohui Liu
Yubin Hu
Yushi Bai
Yu-Hui Wen
Y. Liu
87
1
0
29 Nov 2024
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Duc-Hai Pham
Tung Do
P. Nguyen
Binh-Son Hua
K. Nguyen
Rang Nguyen
MDE
78
1
0
27 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
59
5
0
25 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
98
2
0
24 Nov 2024
Direct and Explicit 3D Generation from a Single Image
Haoyu Wu
Meher Gitika Karumuri
Chuhang Zou
Seungbae Bang
Yuelong Li
Dimitris Samaras
Sunil Hadap
3DGS
MDE
3DV
30
0
0
17 Nov 2024
MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation
Ansh Shah
K Madhava Krishna
DiffM
27
0
0
16 Nov 2024
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions
Hao-Yu Hsu
Zhi-Hao Lin
Albert Zhai
Hongchi Xia
Shenlong Wang
VGen
40
9
0
04 Nov 2024
DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Yean Cheng
Ziqi Cai
Ming Ding
Wendi Zheng
Shiyu Huang
Yuxiao Dong
J. Tang
Boxin Shi
DiffM
32
0
0
03 Nov 2024
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
Wang Zhao
Jiachen Liu
Sheng Zhang
Y. Li
Sili Chen
S. X. Huang
Y. Liu
Hengkai Guo
24
0
0
02 Nov 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
50
30
0
24 Oct 2024
SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
Cheng-De Fan
Chen-Wei Chang
Yi-Ruei Liu
Jie-Ying Lee
Jiun-Long Huang
Yu-Chee Tseng
Yu-Lun Liu
3DGS
59
3
0
22 Oct 2024
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
Junsheng Zhou
Yu-Shen Liu
Zhizhong Han
ViT
32
9
0
21 Oct 2024
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
50
29
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
36
0
0
16 Oct 2024
TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt
Jiahui Yang
Donglin Di
Baorui Ma
Xun Yang
Yongjia Ma
...
Wei Chen
Jianxun Cui
Zhou Xue
Meng Wang
Yebin Liu
DiffM
32
1
0
16 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
66
16
0
14 Oct 2024
UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images
Zeyu Chen
Jingyi Tang
Gu Wang
Shengquan Li
Xinghui Li
Xiangyang Ji
Xiu Li
23
0
0
10 Oct 2024
O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out
Mısra Yavuz
Fatma Guney
19
0
0
10 Oct 2024
GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering
Hongze Chen
Zehong Lin
Jun Zhang
3DGS
38
4
0
03 Oct 2024
1
2
3
4
5
Next