ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.02191
  4. Cited By
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

5 July 2021
Aljavz Bovzivc
Pablo Rodríguez Palafox
Justus Thies
Angela Dai
Matthias Nießner
    ViT
ArXivPDFHTML

Papers citing "TransformerFusion: Monocular RGB Scene Reconstruction using Transformers"

50 / 89 papers shown
Title
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization
Yueh-Cheng Liu
Lukas Höllein
Matthias Nießner
Angela Dai
3DGS
24
0
0
08 May 2025
ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Shun Iwase
Zubair Irshad
Katherine Liu
Vitor Campagnolo Guizilini
Robert Lee
...
Ayako Amma
Koichi Nishiwaki
Kris M. Kitani
Rares Ambrus
Sergey Zakharov
29
0
0
15 Apr 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
39
3
0
28 Mar 2025
Deblur Gaussian Splatting SLAM
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
50
0
0
16 Mar 2025
H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision
Y. Shi
H. Cai
Amin Ansari
Fatih Porikli
38
0
0
06 Mar 2025
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph
  Generation with Enhanced Spatial Relations
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations
Mingjie Xu
Mengyang Wu
Yuzhi Zhao
Jason Chun Lok Li
Weifeng Ou
LRM
SyDa
VLM
57
2
0
09 Dec 2024
PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D
  Matching
PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching
Chen Ziwen
Zexiang Xu
Li Fuxin
3DPC
26
0
0
30 Oct 2024
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage
  Gaussian Splats
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats
Chen Ziwen
Hao Tan
Kai Zhang
Sai Bi
Fujun Luan
Yicong Hong
Li Fuxin
Zexiang Xu
3DGS
3DV
29
16
0
16 Oct 2024
Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active
  Sensor
Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor
Andrea Conti
Matteo Poggi
Valerio Cambareri
S. Mattoccia
MDE
38
3
0
12 Sep 2024
Geometry-guided Feature Learning and Fusion for Indoor Scene
  Reconstruction
Geometry-guided Feature Learning and Fusion for Indoor Scene Reconstruction
Ruihong Yin
Sezer Karaoglu
Theo Gevers
3DV
32
5
0
28 Aug 2024
Ray-Distance Volume Rendering for Neural Scene Reconstruction
Ray-Distance Volume Rendering for Neural Scene Reconstruction
Ruihong Yin
Yunlu Chen
Sezer Karaoglu
Theo Gevers
25
2
0
28 Aug 2024
HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy
  Prediction
HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
Xiao Zhao
Bo Chen
Mingyang Sun
Dingkang Yang
Youxing Wang
Xukun Zhang
Mingcheng Li
Dongliang Kou
Xiaoyi Wei
Lihua Zhang
36
6
0
17 Aug 2024
GroundUp: Rapid Sketch-Based 3D City Massing
GroundUp: Rapid Sketch-Based 3D City Massing
Gizem Esra Unlu
Mohamed Sayed
Yulia Gryaditskaya
Gabriel J. Brostow
36
1
0
17 Jul 2024
Deep Learning-based Depth Estimation Methods from Monocular Image and
  Videos: A Comprehensive Survey
Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey
Uchitha Rajapaksha
Ferdous Sohel
Hamid Laga
D. Diepeveen
Mohammed Bennamoun
MDE
26
11
0
28 Jun 2024
DoubleTake: Geometry Guided Depth Estimation
DoubleTake: Geometry Guided Depth Estimation
Mohamed Sayed
Filippo Aleotti
Jamie Watson
Z. Qureshi
Guillermo Garcia-Hernando
Gabriel J. Brostow
Sara Vicente
Michael Firman
MDE
3DH
3DV
27
2
0
26 Jun 2024
FAWN: Floor-And-Walls Normal Regularization for Direct Neural TSDF
  Reconstruction
FAWN: Floor-And-Walls Normal Regularization for Direct Neural TSDF Reconstruction
Anna Sokolova
Anna Vorontsova
Bulat Gabdullin
Alexander Limonov
3DV
21
0
0
17 Jun 2024
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
Jamie Watson
Filippo Aleotti
Mohamed Sayed
Z. Qureshi
Oisin Mac Aodha
Gabriel J. Brostow
Michael Firman
Sara Vicente
3DPC
27
0
0
13 Jun 2024
Gated Fields: Learning Scene Reconstruction from Gated Videos
Gated Fields: Learning Scene Reconstruction from Gated Videos
Andrea Ramazzina
Stefanie Walz
Pragyan Dahal
Mario Bijelic
Felix Heide
40
1
0
30 May 2024
Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians
Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians
Erik Sandström
Keisuke Tateno
Michael Oechsle
Michael Niemeyer
Luc Van Gool
Martin R. Oswald
Federico Tombari
3DGS
27
24
0
26 May 2024
MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for
  End-to-End Autonomous Driving
MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
Yiqun Duan
Xianda Guo
Zheng Zhu
Zhen Wang
Yu-Kai Wang
Chin-Teng Lin
29
2
0
13 May 2024
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with
  View-Dependent Normal Compensation
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation
Ziyi Chen
Xiaolong Wu
Yu Zhang
25
2
0
01 May 2024
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving
  Imitation Learning with LLMs
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
Yiqun Duan
Qiang Zhang
Renjing Xu
36
9
0
07 Apr 2024
NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking
  With Depth Completion and Denoising
NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising
Tianchen Deng
Yanbo Wang
Hongle Xie
Hesheng Wang
Jingchuan Wang
Danwei W. Wang
Weidong Chen
3DV
35
22
0
29 Mar 2024
GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud
  SLAM
GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM
Ganlin Zhang
Erik Sandström
Youmin Zhang
Manthan Patel
Luc Van Gool
Martin R. Oswald
36
19
0
28 Mar 2024
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos
Florian Langer
Jihong Ju
Georgi Dikov
Gerhard Reitmayr
Mohsen Ghafoorian
3DPC
29
3
0
22 Mar 2024
Real-time 3D semantic occupancy prediction for autonomous vehicles using
  memory-efficient sparse convolution
Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution
Samuel Sze
Lars Kunze
3DPC
26
3
0
13 Mar 2024
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Lukas Höllein
Aljavz Bovzivc
Norman Muller
David Novotny
Hung-Yu Tseng
Christian Richardt
Michael Zollhöfer
Matthias Nießner
DiffM
39
39
0
04 Mar 2024
Loopy-SLAM: Dense Neural SLAM with Loop Closures
Loopy-SLAM: Dense Neural SLAM with Loop Closures
Lorenzo Liso
Erik Sandström
V. Yugay
Luc Van Gool
Martin R. Oswald
16
29
0
14 Feb 2024
Range-Agnostic Multi-View Depth Estimation With Keyframe Selection
Range-Agnostic Multi-View Depth Estimation With Keyframe Selection
Andrea Conti
Matteo Poggi
Valerio Cambareri
S. Mattoccia
3DV
21
3
0
25 Jan 2024
Fully Sparse 3D Occupancy Prediction
Fully Sparse 3D Occupancy Prediction
Haisong Liu
Yang Chen
Haiguang Wang
Zetong Yang
Tianyu Li
Jia Zeng
Li Chen
Hongyang Li
Limin Wang
27
12
0
28 Dec 2023
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition
Xiao Wang
Yao Rong
Shiao Wang
Yuan Chen
Zhe Wu
Bowei Jiang
Yonghong Tian
Jin Tang
ViT
76
3
0
18 Dec 2023
PLGSLAM: Progressive Neural Scene Represenation with Local to Global
  Bundle Adjustment
PLGSLAM: Progressive Neural Scene Represenation with Local to Global Bundle Adjustment
Tianchen Deng
Guole Shen
Tong Qin
Jianyu Wang
Wentao Zhao
Jingchuan Wang
Danwei W. Wang
Weidong Chen
19
60
0
15 Dec 2023
SuperPrimitive: Scene Reconstruction at a Primitive Level
SuperPrimitive: Scene Reconstruction at a Primitive Level
Kirill Mazur
Gwangbin Bae
Andrew J. Davison
3DH
14
3
0
10 Dec 2023
Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting
Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting
V. Yugay
Yue Li
Theo Gevers
Martin R. Oswald
3DGS
19
118
0
06 Dec 2023
Learning Neural Implicit through Volume Rendering with Attentive Depth
  Fusion Priors
Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors
Pengchong Hu
Zhizhong Han
33
12
0
17 Oct 2023
GradientSurf: Gradient-Domain Neural Surface Reconstruction from RGB
  Video
GradientSurf: Gradient-Domain Neural Surface Reconstruction from RGB Video
Crane He Chen
Joerg Liebelt
17
0
0
09 Oct 2023
DebSDF: Delving into the Details and Bias of Neural Indoor Scene
  Reconstruction
DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction
Yuting Xiao
Jingwei Xu
Zehao Yu
Shenghua Gao
29
12
0
29 Aug 2023
SimpleMapping: Real-Time Visual-Inertial Dense Mapping with Deep
  Multi-View Stereo
SimpleMapping: Real-Time Visual-Inertial Dense Mapping with Deep Multi-View Stereo
Yingye Xin
Xingxing Zuo
Dongyue Lu
Stefan Leutenegger
8
6
0
14 Jun 2023
SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic
  Understanding
SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding
Paul-Edouard Sarlin
Eduard Trulls
Marc Pollefeys
J. Hosang
Simon Lynen
3DPC
SSL
20
25
0
08 Jun 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
41
156
0
01 Jun 2023
BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D
  Scene Reconstruction From A Single Image
BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Tao Chu
Pan Zhang
Qiong Liu
Jiaqi Wang
30
6
0
01 Jun 2023
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
Xiaoliang Ju
Zhaoyang Huang
Yijin Li
Guofeng Zhang
Yu Qiao
Hongsheng Li
16
7
0
01 Jun 2023
Incremental Dense Reconstruction from Monocular Video with Guided Sparse
  Feature Volume Fusion
Incremental Dense Reconstruction from Monocular Video with Guided Sparse Feature Volume Fusion
Xingxing Zuo
Nan Yang
Nate Merrill
Binbin Xu
Stefan Leutenegger
MDE
17
7
0
24 May 2023
CVRecon: Rethinking 3D Geometric Feature Learning For Neural
  Reconstruction
CVRecon: Rethinking 3D Geometric Feature Learning For Neural Reconstruction
Ziyue Feng
Le Yang
Pengsheng Guo
Bing Li
27
14
0
28 Apr 2023
VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos
VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos
Huiyu Gao
Wei Mao
Miaomiao Liu
13
7
0
21 Apr 2023
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic
  Reconstruction of Indoor Scenes
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
Yiming Gao
Yan-Pei Cao
Ying Shan
22
31
0
18 Apr 2023
Point-SLAM: Dense Neural Point Cloud-based SLAM
Point-SLAM: Dense Neural Point Cloud-based SLAM
Erik Sandström
Yue Li
Luc Van Gool
Martin R. Oswald
3DPC
11
129
0
09 Apr 2023
FineRecon: Depth-aware Feed-forward Network for Detailed 3D
  Reconstruction
FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction
Noah Stier
Anurag Ranjan
Alex Colburn
Yajie Yan
Liang Yang
Fangchang Ma
Baptiste Angles
3DV
17
13
0
04 Apr 2023
LivePose: Online 3D Reconstruction from Monocular Video with Dynamic
  Camera Poses
LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses
Noah Stier
Baptiste Angles
Liang Yang
Yajie Yan
Alex Colburn
Ming Chuang
3DH
8
3
0
31 Mar 2023
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth
  Estimation
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation
Yunsong Zhou
Quanpan Liu
Hongzi Zhu
Yunzhe Li
Shan Chang
Minyi Guo
3DPC
MDE
45
13
0
23 Mar 2023
12
Next