An End-to-End Transformer Model for 3D Object Detection

16 September 2021

Papers citing "An End-to-End Transformer Model for 3D Object Detection"

50 / 294 papers shown

GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection

Md Sohag Mia

Md Nahid Hasan

Tawhid Ahmed

Muhammad Abdullah Adnan

3DPC ViT

205

02 Dec 2025

Real-Time 3D Object Detection with Inference-Aligned Learning

244

20 Nov 2025

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

140

24 Oct 2025

Learning Global Representation from Queries for Vectorized HD Map Construction

111

08 Oct 2025

Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds

116

21 Sep 2025

Sparse Multiview Open-Vocabulary 3D Detection

Olivier Moliner

Viktor Larsson

Kalle Åström

118

19 Sep 2025

White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation

212

17 Sep 2025

PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges With a Geometry-Aware 3DETR

Fabio Francisco Oberweger

Michael Schwingshackl

Vanessa Staderini

3DPC

174

03 Sep 2025

Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views

206

01 Sep 2025

RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation

120

26 Aug 2025

Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

Xinhao Xiang

Kuan-Chuan Peng

Suhas Lohit

Michael Jeffrey Jones

Jiawei Zhang

3DPC

162

22 Aug 2025

Masked Clustering Prediction for Unsupervised Point Cloud Pre-training

170

12 Aug 2025

Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding

189

23 Jul 2025

SpatialLM: Training Large Language Models for Structured Indoor Modeling

235

09 Jun 2025

Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs

389

05 Jun 2025

Detection of Endangered Deer Species Using UAV Imagery: A Comparative Study Between Efficient Deep Learning ApproachesInternational Conference on Unmanned Aircraft Systems (ICUAS), 2025

192

30 May 2025

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba AdapterComputer Vision and Pattern Recognition (CVPR), 2025

289

27 May 2025

Sketchy Bounding-box Supervision for 3D Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2025

308

22 May 2025

Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detectionIEEE International Conference on Robotics and Automation (ICRA), 2025

258

21 May 2025

Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey

Calvin Galagain

Martyna Poreba

François Goulette

310

18 May 2025

Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

400

09 Apr 2025

Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework

Nikhil Shivakumar Nayak

417

04 Apr 2025

GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection

356

26 Mar 2025

Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2025

Jiangyi Wang

Na Zhao

364

20 Mar 2025

GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025

307

19 Mar 2025

State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionInternational Conference on Learning Representations (ICLR), 2025

495

18 Mar 2025

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

195

09 Mar 2025

HexPlane Representation for 3D Semantic Scene Understanding

356

07 Mar 2025

MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2025

180

28 Feb 2025

DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

472

24 Feb 2025

Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object RecognitionComputer Vision and Pattern Recognition (CVPR), 2025

Khanh Nguyen

Ghulam Mubashar Hassan

Lin Wang

3DPC

355

15 Feb 2025

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingIEEE transactions on multimedia (TMM), 2025

247

14 Jan 2025

PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation

329

06 Jan 2025

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

763

02 Jan 2025

RelationField: Relate Anything in Radiance FieldsComputer Vision and Pattern Recognition (CVPR), 2024

409

18 Dec 2024

PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionAAAI Conference on Artificial Intelligence (AAAI), 2024

400

11 Dec 2024

Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting

272

27 Nov 2024

Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data

374

23 Nov 2024

PointCG: Self-supervised Point Cloud Learning via Joint Completion and GenerationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024

156

09 Nov 2024

ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024

319

31 Oct 2024

NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery

Lars Petersson

187

30 Oct 2024

MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsNeural Information Processing Systems (NeurIPS), 2024

301

28 Oct 2024

Joint Top-Down and Bottom-Up Frameworks for 3D Visual GroundingInternational Conference on Pattern Recognition (ICPR), 2024

Yang Liu

Daizong Liu

Wei Hu

3DPC

378

21 Oct 2024

SAM-Guided Masked Token Prediction for 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024

Yingwei Li

353

16 Oct 2024

Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

393

13 Oct 2024

Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

359

10 Oct 2024

Diffusion Models in 3D Vision: A Survey

Zhen Wang

Dongyuan Li

Xue Liu

Tianyu He

Jiang Bian

Renhe Jiang

MedIm

763

07 Oct 2024

Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection

241

01 Oct 2024

Formula-Supervised Visual-Geometric Pre-trainingEuropean Conference on Computer Vision (ECCV), 2024

Hirokatsu Kataoka

160

20 Sep 2024

UNIT: Unsupervised Online Instance Segmentation through TimeInternational Conference on 3D Vision (3DV), 2024

252

12 Sep 2024