An End-to-End Transformer Model for 3D Object Detection

16 September 2021

Papers citing "An End-to-End Transformer Model for 3D Object Detection"

50 / 294 papers shown

GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection

Md Sohag Mia

Md Nahid Hasan

Tawhid Ahmed

Muhammad Abdullah Adnan

3DPC ViT

205

02 Dec 2025

Real-Time 3D Object Detection with Inference-Aligned Learning

237

20 Nov 2025

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

136

24 Oct 2025

Learning Global Representation from Queries for Vectorized HD Map Construction

108

08 Oct 2025

Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds

112

21 Sep 2025

Sparse Multiview Open-Vocabulary 3D Detection

Olivier Moliner

Viktor Larsson

Kalle Åström

116

19 Sep 2025

White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation

208

17 Sep 2025

PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges With a Geometry-Aware 3DETR

Fabio Francisco Oberweger

Michael Schwingshackl

Vanessa Staderini

3DPC

156

03 Sep 2025

Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views

206

01 Sep 2025

RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation

119

26 Aug 2025

Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

Xinhao Xiang

Kuan-Chuan Peng

Suhas Lohit

Michael Jeffrey Jones

Jiawei Zhang

3DPC

154

22 Aug 2025

Masked Clustering Prediction for Unsupervised Point Cloud Pre-training

168

12 Aug 2025

Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding

189

23 Jul 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

232

09 Jun 2025

Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs

382

05 Jun 2025

Detection of Endangered Deer Species Using UAV Imagery: A Comparative Study Between Efficient Deep Learning ApproachesInternational Conference on Unmanned Aircraft Systems (ICUAS), 2025

186

30 May 2025

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba AdapterComputer Vision and Pattern Recognition (CVPR), 2025

286

27 May 2025

Sketchy Bounding-box Supervision for 3D Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2025

307

22 May 2025

Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detectionIEEE International Conference on Robotics and Automation (ICRA), 2025

255

21 May 2025

Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey

Calvin Galagain

Martyna Poreba

François Goulette

307

18 May 2025

Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

396

09 Apr 2025

Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework

Nikhil Shivakumar Nayak

411

04 Apr 2025

GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection

353

26 Mar 2025

Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2025

Jiangyi Wang

Na Zhao

355

20 Mar 2025

GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025

307

19 Mar 2025

State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionInternational Conference on Learning Representations (ICLR), 2025

487

18 Mar 2025

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

187

09 Mar 2025

HexPlane Representation for 3D Semantic Scene Understanding

355

07 Mar 2025

MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2025

180

28 Feb 2025

DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

471

24 Feb 2025

Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object RecognitionComputer Vision and Pattern Recognition (CVPR), 2025

Khanh Nguyen

Ghulam Mubashar Hassan

Lin Wang

3DPC

350

15 Feb 2025

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingIEEE transactions on multimedia (TMM), 2025

237

14 Jan 2025

PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation

329

06 Jan 2025

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

739

02 Jan 2025

RelationField: Relate Anything in Radiance FieldsComputer Vision and Pattern Recognition (CVPR), 2024

409

18 Dec 2024

PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionAAAI Conference on Artificial Intelligence (AAAI), 2024

395

11 Dec 2024

Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting

272

27 Nov 2024

Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data

373

23 Nov 2024

PointCG: Self-supervised Point Cloud Learning via Joint Completion and GenerationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024

156

09 Nov 2024

ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024

316

31 Oct 2024

NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery

Lars Petersson

181

30 Oct 2024

MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsNeural Information Processing Systems (NeurIPS), 2024

298

28 Oct 2024

Joint Top-Down and Bottom-Up Frameworks for 3D Visual GroundingInternational Conference on Pattern Recognition (ICPR), 2024

Yang Liu

Daizong Liu

Wei Hu

3DPC

378

21 Oct 2024

SAM-Guided Masked Token Prediction for 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024

Yingwei Li

350

16 Oct 2024

Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

381

13 Oct 2024

Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

350

10 Oct 2024

Diffusion Models in 3D Vision: A Survey

Zhen Wang

Dongyuan Li

Xue Liu

Tianyu He

Jiang Bian

Renhe Jiang

MedIm

745

07 Oct 2024

Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection

238

01 Oct 2024

Formula-Supervised Visual-Geometric Pre-trainingEuropean Conference on Computer Vision (ECCV), 2024

Hirokatsu Kataoka

158

20 Sep 2024

UNIT: Unsupervised Online Instance Segmentation through TimeInternational Conference on 3D Vision (3DV), 2024

243

12 Sep 2024