ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.08141
  4. Cited By
An End-to-End Transformer Model for 3D Object Detection

An End-to-End Transformer Model for 3D Object Detection

16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
    3DPCViT
ArXiv (abs)PDFHTML

Papers citing "An End-to-End Transformer Model for 3D Object Detection"

50 / 294 papers shown
GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
Md Sohag Mia
Md Nahid Hasan
Tawhid Ahmed
Muhammad Abdullah Adnan
3DPCViT
205
0
0
02 Dec 2025
Real-Time 3D Object Detection with Inference-Aligned Learning
Chenyu Zhao
Xianwei Zheng
Zimin Xia
Linwei Yue
Nan Xue
3DPC
237
0
0
20 Nov 2025
DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning
DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning
Ziqi Gao
Qiufu Li
Linlin Shen
3DPC
136
1
0
24 Oct 2025
Learning Global Representation from Queries for Vectorized HD Map Construction
Learning Global Representation from Queries for Vectorized HD Map Construction
Shoumeng Qiu
Xinrun Li
Yang Long
Xiangyang Xue
Varun Ojha
Jian Pu
108
1
0
08 Oct 2025
Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds
Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds
Gunner Stone
Youngsook Choi
Alireza Tavakkoli
Ankita Shukla
3DPC
112
0
0
21 Sep 2025
Sparse Multiview Open-Vocabulary 3D Detection
Sparse Multiview Open-Vocabulary 3D Detection
Olivier Moliner
Viktor Larsson
Kalle Åström
116
0
0
19 Sep 2025
White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation
White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation
Jiyun Im
Subeen Lee
Miso Lee
Jae-Pil Heo
3DPC
208
0
0
17 Sep 2025
PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges With a Geometry-Aware 3DETR
PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges With a Geometry-Aware 3DETR
Fabio Francisco Oberweger
Michael Schwingshackl
Vanessa Staderini
3DPC
156
1
0
03 Sep 2025
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views
Xiangdong Zhang
Shaofeng Zhang
Junchi Yan
3DPC
206
1
0
01 Sep 2025
RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation
RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation
Siyuan You
Guozheng Xu
Pengwei Zhou
Qiwen Jin
Jian Yao
Li Li
ViT
119
0
0
26 Aug 2025
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang
Kuan-Chuan Peng
Suhas Lohit
Michael Jeffrey Jones
Jiawei Zhang
3DPC
154
1
0
22 Aug 2025
Masked Clustering Prediction for Unsupervised Point Cloud Pre-training
Masked Clustering Prediction for Unsupervised Point Cloud Pre-training
Bin Ren
Xiaoshui Huang
Mengyuan Liu
Hong Liu
Fabio Poiesi
Andrii Zadaianchuk
Guofeng Mei
3DPCISeg
168
0
0
12 Aug 2025
Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding
Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding
Liwen Liu
Weidong Yang
Lipeng Ma
Ben Fei
3DPC
189
0
0
23 Jul 2025
SpatialLM: Training Large Language Models for Structured Indoor Modeling
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Yongsen Mao
Junhao Zhong
Chuan Fang
Jia Zheng
Rui Tang
Hao Zhu
Ping Tan
Zihan Zhou
3DV
232
18
0
09 Jun 2025
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Haoyuan Li
Yanpeng Zhou
Yufei Gao
Tao Tang
J. N. Han
Yujie Yuan
Dave Zhenyu Chen
Jiawang Bian
Hang Xu
Xiaodan Liang
382
3
0
05 Jun 2025
Detection of Endangered Deer Species Using UAV Imagery: A Comparative Study Between Efficient Deep Learning Approaches
Detection of Endangered Deer Species Using UAV Imagery: A Comparative Study Between Efficient Deep Learning ApproachesInternational Conference on Unmanned Aircraft Systems (ICUAS), 2025
Agustín Roca
Gastón Castro
Gabriel Torre
Leonardo J. Colombo
Ignacio Mas
Javier Pereira
Juan Giribet
186
3
0
30 May 2025
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba AdapterComputer Vision and Pattern Recognition (CVPR), 2025
Yaohua Zha
Yanzi Wang
Hang Guo
Jinpeng Wang
Tao Dai
Bin Chen
Zhihao Ouyang
Xue Yuerong
Ke Chen
Shu-Tao Xia
286
3
0
27 May 2025
Sketchy Bounding-box Supervision for 3D Instance Segmentation
Sketchy Bounding-box Supervision for 3D Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2025
Qian Deng
Renjie He
Jin Xie
Zhiqiang Wang
ISeg
307
1
0
22 May 2025
Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detection
Learning better representations for crowded pedestrians in offboard LiDAR-camera 3D tracking-by-detectionIEEE International Conference on Robotics and Automation (ICRA), 2025
Shichao Li
Peiliang Li
Qing Lian
Peng Yun
Xiaozhi Chen
3DPC3DV
255
0
0
21 May 2025
Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey
Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey
Calvin Galagain
Martyna Poreba
François Goulette
307
5
0
18 May 2025
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025
Pedro Hermosilla
Christian Stippel
Leon Sick
SSL3DPC
396
0
0
09 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
411
12
0
04 Apr 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
353
2
0
26 Mar 2025
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2025
Jiangyi Wang
Na Zhao
355
0
0
20 Mar 2025
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Zinqin Huang
Gu Wang
Chenyangguang Zhang
Ruida Zhang
Xiu Li
Xiangyang Ji
307
4
0
19 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionInternational Conference on Learning Representations (ICLR), 2025
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
487
3
0
18 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
187
1
0
09 Mar 2025
HexPlane Representation for 3D Semantic Scene Understanding
Zeren Chen
Yuenan Hou
Yulin Chen
Li Liu
Xiao Sun
Lu Sheng
3DPC
355
0
0
07 Mar 2025
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2025
Shaoming Li
Qing Cai
Songqi Kong
Runqing Tan
Heng Tong
Shiji Qiu
Yongguo Jiang
Ziqiang Liu
3DV3DPC
180
0
0
28 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
Guang Dai
Philip H. S. Torr
471
8
0
24 Feb 2025
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object RecognitionComputer Vision and Pattern Recognition (CVPR), 2025
Khanh Nguyen
Ghulam Mubashar Hassan
Lin Wang
3DPC
350
0
0
15 Feb 2025
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingIEEE transactions on multimedia (TMM), 2025
Haomiao Xiong
Yunzhi Zhuge
Jiawen Zhu
Lu Zhang
Huchuan Lu
237
11
0
14 Jan 2025
PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation
Xu Ma
Mengsheng Chen
Junhui Zhang
Lijuan Song
Fang Du
Zhenhua Yu
ViTMedIm
329
0
0
06 Jan 2025
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Yuan Liu
Hengshuang Zhao
739
50
0
02 Jan 2025
RelationField: Relate Anything in Radiance Fields
RelationField: Relate Anything in Radiance FieldsComputer Vision and Pattern Recognition (CVPR), 2024
Sebastian Koch
Johanna Wald
Mirco Colosi
Narunas Vaskevicius
Pedro Hermosilla
F. Tombari
Timo Ropinski
409
7
0
18 Dec 2024
PointCFormer: a Relation-based Progressive Feature Extraction Network
  for Point Cloud Completion
PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionAAAI Conference on Artificial Intelligence (AAAI), 2024
Yi Zhong
Weize Quan
Dong-ming Yan
J. Jiang
Yingmei Wei
3DPC
395
2
0
11 Dec 2024
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Hao Liu
Minglin Chen
Yanni Ma
Haihong Xiao
Ying He
3DGS3DPC
272
2
0
27 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without
  3D Data
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPCVLM
373
1
0
23 Nov 2024
PointCG: Self-supervised Point Cloud Learning via Joint Completion and
  Generation
PointCG: Self-supervised Point Cloud Learning via Joint Completion and GenerationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Yi Liu
Peng Li
Xuefeng Yan
Liangliang Nan
Bing Wang
Honghua Chen
Lina Gong
Wei Zhao
Mingqiang Wei
SSL3DPC
156
1
0
09 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from
  Only 2D Images
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024
Timing Yang
Yuanliang Ju
Li Yi
3DPC
316
14
0
31 Oct 2024
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
Xuesong Li
Zeeshan Hayder
Ali Zia
Connor Cassidy
Shiming Liu
W. Stiller
Eric A. Stone
Warren C. Conaty
Lars Petersson
V. Rolland
181
4
0
30 Oct 2024
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsNeural Information Processing Systems (NeurIPS), 2024
Yating Xu
Chen Li
G. Lee
3DPC
298
6
0
28 Oct 2024
Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding
Joint Top-Down and Bottom-Up Frameworks for 3D Visual GroundingInternational Conference on Pattern Recognition (ICPR), 2024
Yang Liu
Daizong Liu
Wei Hu
3DPC
378
9
0
21 Oct 2024
SAM-Guided Masked Token Prediction for 3D Scene Understanding
SAM-Guided Masked Token Prediction for 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024
Zhimin Chen
Liang Yang
Yingwei Li
Longlong Jing
Bing Li
350
6
0
16 Oct 2024
Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised Learning
Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Yaohua Zha
Tao Dai
Yanzi Wang
Hang Guo
Bin Chen
Zhihao Ouyang
Chunlin Fan
3DPC
381
1
0
13 Oct 2024
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Dingkang Liang
Tianrui Feng
Xin Zhou
Yumeng Zhang
Zhikang Zou
Xiang Bai
350
27
0
10 Oct 2024
Diffusion Models in 3D Vision: A Survey
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
745
12
0
07 Oct 2024
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for
  Remote Object Detection
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection
Pengxi Zeng
Alberto Presta
Jonah Reinis
Dinesh Bharadia
Hang Qiu
Pamela Cosman
3DPC
238
1
0
01 Oct 2024
Formula-Supervised Visual-Geometric Pre-training
Formula-Supervised Visual-Geometric Pre-trainingEuropean Conference on Computer Vision (ECCV), 2024
Ryosuke Yamada
Kensho Hara
Hirokatsu Kataoka
Koshi Makihara
Nakamasa Inoue
Rio Yokota
Y. Satoh
158
2
0
20 Sep 2024
UNIT: Unsupervised Online Instance Segmentation through Time
UNIT: Unsupervised Online Instance Segmentation through TimeInternational Conference on 3D Vision (3DV), 2024
Corentin Sautier
Gilles Puy
Alexandre Boulch
Renaud Marlet
Vincent Lepetit
243
5
0
12 Sep 2024
123456
Next