Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04994
Cited By
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
11 October 2021
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans"
50 / 221 papers shown
Title
Learning from the Giants: A Practical Approach to Underwater Depth and Surface Normals Estimation
Alzayat Saleh
Melanie Olsen
Bouchra Senadji
M. R. Azghadi
16
0
0
02 Oct 2024
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
Jeff Tan
Donglai Xiang
Shubham Tulsiani
Deva Ramanan
Gengshan Yang
3DH
26
3
0
30 Sep 2024
KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation
Soofiyan Atar
Yuheng Zhi
Florian Richter
Michael C. Yip
MDE
29
0
0
29 Sep 2024
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
Shaowei Liu
Zhongzheng Ren
Saurabh Gupta
Shenlong Wang
VGen
DiffM
PINN
39
33
0
27 Sep 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
44
38
0
26 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
37
23
0
17 Sep 2024
RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models
Başak Melis Öcal
Maxim Tatarchenko
Sezer Karaoglu
Theo Gevers
DiffM
25
0
0
16 Sep 2024
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
Vitor Campagnolo Guizilini
P. Tokmakov
Achal Dave
Rares Ambrus
DiffM
23
2
0
15 Sep 2024
Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss
Qifan Fu
Xiaohang Yang
Muhammad Asad
Changjae Oh
Shanxin Yuan
Gregory Slabaugh
35
2
0
13 Sep 2024
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
Denis Zavadski
Damjan Kalšan
Carsten Rother
DiffM
MDE
20
5
0
13 Sep 2024
PVP-Recon: Progressive View Planning via Warping Consistency for Sparse-View Surface Reconstruction
Sheng Ye
Yuze He
Matthieu Lin
Jenny Sheng
Ruoyu Fan
...
Yubin Hu
Ran Yi
Yu-Hui Wen
Yong-Jin Liu
Wenping Wang
18
3
0
09 Sep 2024
Incorporating dense metric depth into neural 3D representations for view synthesis and relighting
A. N. Chaudhury
Igor Vasiljevic
Sergey Zakharov
Vitor Campagnolo Guizilini
Rares Ambrus
Srinivasa Narasimhan
C. Atkeson
3DH
3DV
29
0
0
04 Sep 2024
Ray-Distance Volume Rendering for Neural Scene Reconstruction
Ruihong Yin
Yunlu Chen
Sezer Karaoglu
Theo Gevers
22
2
0
28 Aug 2024
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
36
63
0
22 Aug 2024
Transientangelo: Few-Viewpoint Surface Reconstruction Using Single-Photon Lidar
Weihan Luo
Anagh Malik
David B. Lindell
36
0
0
22 Aug 2024
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
Ziyu Tang
Weicai Ye
Yifan Wang
Di Huang
Hujun Bao
Tong He
Guofeng Zhang
AI4CE
33
7
0
22 Aug 2024
NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
Yifan Wang
Di Huang
Weicai Ye
Guofeng Zhang
Wanli Ouyang
Tong He
40
7
0
19 Aug 2024
Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video
Shuxian Wang
Akshay Paruchuri
Zhaoxi Zhang
Sarah K. McGill
Roni Sengupta
MedIm
MDE
24
3
0
19 Aug 2024
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration
Hao Ai
Lin Wang
27
0
0
18 Aug 2024
GS-ID: Illumination Decomposition on Gaussian Splatting via Diffusion Prior and Parametric Light Source Optimization
Kang Du
Zhihao Liang
Zeyu Wang
3DGS
26
5
0
16 Aug 2024
IG-SLAM: Instant Gaussian SLAM
F. Aykut
Alatan
3DGS
39
6
0
02 Aug 2024
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Xiang Zhang
B. Ke
Hayko Riemenschneider
Nando Metzger
Anton Obukhov
Markus H. Gross
Konrad Schindler
Christopher Schroers
DiffM
MDE
41
7
0
25 Jul 2024
DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction
Xiaobiao Du
Haiyang Sun
Ming Lu
Tianqing Zhu
Xin Yu
28
2
0
24 Jul 2024
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Yiming Xie
Chun-Han Yao
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
VGen
70
39
0
24 Jul 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffM
MQ
MDE
31
7
0
23 Jul 2024
Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for Chemistry
Marvin Alberts
Oliver Schilter
F. Zipoli
Nina Hartrampf
Teodoro Laino
25
5
0
04 Jul 2024
VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors
Sungwon Hwang
Min-Jung Kim
Taewoong Kang
Jayeon Kang
Jaegul Choo
3DGS
33
10
0
03 Jul 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
Chongjie Ye
Lingteng Qiu
Xiaodong Gu
Qi Zuo
Yushuang Wu
Zilong Dong
Liefeng Bo
Yuliang Xiu
Xiaoguang Han
DiffM
32
38
0
24 Jun 2024
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao
Junshu Tang
Jiangning Zhang
Ran Yi
Yijia Hong
Moran Li
Weijian Cao
Yating Wang
Lizhuang Ma
DiffM
41
0
0
24 Jun 2024
Explore the Limits of Omni-modal Pretraining at Scale
Yiyuan Zhang
Handong Li
Jing Liu
Xiangyu Yue
VLM
LRM
45
1
0
13 Jun 2024
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann
Oğuzhan Fatih Kar
David Mizrahi
Ali Garjani
Mingfei Gao
David Griffiths
Jiaming Hu
Afshin Dehghan
Amir Zamir
MoE
VLM
MLLM
31
14
0
13 Jun 2024
Scale-Invariant Monocular Depth Estimation via SSI Depth
S. M. H. Miangoleh
Mahesh Kumar Krishna Reddy
Yağız Aksoy
MDE
19
5
0
13 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
34
37
0
03 Jun 2024
Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians
Erik Sandström
Keisuke Tateno
Michael Oechsle
Michael Niemeyer
Luc Van Gool
Martin R. Oswald
Federico Tombari
3DGS
27
24
0
26 May 2024
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
Yunhao Ge
Yihe Tang
Jiashu Xu
Cem Gokmen
Chengshu Li
...
Miao Liu
Pengchuan Zhang
Ruohan Zhang
Fei-Fei Li
Jiajun Wu
VGen
33
6
0
15 May 2024
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Yuhang Ming
Xingrui Yang
Weihan Wang
Zheng Chen
Jinglun Feng
Yifan Xing
Guofeng Zhang
27
8
0
09 May 2024
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation
Ziyi Chen
Xiaolong Wu
Yu Zhang
25
2
0
01 May 2024
High-quality Surface Reconstruction using Gaussian Surfels
Pinxuan Dai
Jiamin Xu
Wenxiang Xie
Xinguo Liu
Huamin Wang
Weiwei Xu
3DGS
33
76
0
27 Apr 2024
The Third Monocular Depth Estimation Challenge
Jaime Spencer
Fabio Tosi
Matteo Poggi
Ripudaman Singh Arora
Chris Russell
...
Albert Luginov
Muhammad Shahzad
Seyed Hosseini
Aleksander Trajcevski
James H. Elder
MDE
33
7
0
25 Apr 2024
PhyRecon: Physically Plausible Neural Scene Reconstruction
Junfeng Ni
Yixin Chen
Bohan Jing
Nan Jiang
Bin Wang
Bo Dai
Puhao Li
Yixin Zhu
Song-Chun Zhu
Siyuan Huang
42
12
0
25 Apr 2024
Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration
Jing Zeng
Yanxu Li
Jiahao Sun
Qi Ye
Yunlong Ran
Jiming Chen
19
2
0
16 Apr 2024
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Hongchi Xia
Zhi-Hao Lin
Wei-Chiu Ma
Shenlong Wang
VGen
38
13
0
15 Apr 2024
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance
Yuqun Wu
Jae Yong Lee
Chuhang Zou
Shenlong Wang
Derek Hoiem
32
0
0
12 Apr 2024
BRAVE: Broadening the visual encoding of vision-language models
Ouguzhan Fatih Kar
A. Tonioni
Petra Poklukar
Achin Kulshrestha
Amir Zamir
Federico Tombari
MLLM
VLM
42
25
0
10 Apr 2024
NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Peng Tu
Xun Zhou
Mingming Wang
Xiaojun Yang
Bo Peng
Ping Chen
Xiu Su
Yawen Huang
Yefeng Zheng
Chang Xu
24
1
0
07 Apr 2024
RaSim: A Range-aware High-fidelity RGB-D Data Simulation Pipeline for Real-world Applications
Xingyu Liu
Chenyangguang Zhang
Gu Wang
Ruida Zhang
Xiangyang Ji
34
1
0
05 Apr 2024
Stable Surface Regularization for Fast Few-Shot NeRF
Byeongin Joung
Byeong-uk Lee
Jaesung Choe
Ukcheol Shin
Minjun Kang
Taeyeop Lee
In So Kweon
KuK-Jin Yoon
28
0
0
29 Mar 2024
GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM
Ganlin Zhang
Erik Sandström
Youmin Zhang
Manthan Patel
Luc Van Gool
Martin R. Oswald
36
19
0
28 Mar 2024
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu
Chirui Chang
Peng Dai
Yang-tian Sun
Xiaojuan Qi
3DGS
33
3
0
28 Mar 2024
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Christos Sakaridis
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
73
126
0
27 Mar 2024
Previous
1
2
3
4
5
Next