ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.08141
  4. Cited By
An End-to-End Transformer Model for 3D Object Detection

An End-to-End Transformer Model for 3D Object Detection

16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
    3DPCViT
ArXiv (abs)PDFHTML

Papers citing "An End-to-End Transformer Model for 3D Object Detection"

50 / 294 papers shown
PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer
PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel TransformerComputer Vision and Pattern Recognition (CVPR), 2023
Honghui Yang
Wenxiao Wang
Minghao Chen
Binbin Lin
Tong He
Huaguan Chen
Xiaofei He
Wanli Ouyang
3DPCViT
278
64
0
11 May 2023
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene
  Understanding
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Li Jiang
Zetong Yang
Shaoshuai Shi
Vladislav Golyanik
Dengxin Dai
Bernt Schiele
3DPC
265
13
0
08 May 2023
3D Small Object Detection with Dynamic Spatial Pruning
3D Small Object Detection with Dynamic Spatial PruningEuropean Conference on Computer Vision (ECCV), 2023
Xiuwei Xu
Zhihao Sun
Ziwei Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
3DPC
470
9
0
05 May 2023
OctFormer: Octree-based Transformers for 3D Point Clouds
OctFormer: Octree-based Transformers for 3D Point CloudsACM Transactions on Graphics (TOG), 2023
Peng-Shuai Wang
ViT3DPC
318
147
0
04 May 2023
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor
  3D Object Detection
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Yichen Xie
Chenfeng Xu
Marie-Julie Rakotosaona
Patrick Rim
F. Tombari
Kurt Keutzer
Masayoshi Tomizuka
Wei Zhan
3DPC
266
112
0
27 Apr 2023
Spatial-Language Attention Policies for Efficient Robot Learning
Spatial-Language Attention Policies for Efficient Robot LearningConference on Robot Learning (CoRL), 2023
Priyam Parashar
Vidhi Jain
Xiaohan Zhang
Jay Vakil
Sam Powers
Yonatan Bisk
Chris Paxton
LM&Ro
269
5
0
21 Apr 2023
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Align-DETR: Improving DETR with Simple IoU-aware BCE lossBritish Machine Vision Conference (BMVC), 2023
Zhi Cai
Songtao Liu
Guodong Wang
Zheng Ge
Xiangyu Zhang
Di Huang
214
15
0
15 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention
  and Residual Connection in Kernel Space
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
159
2
0
13 Apr 2023
CLIP-Guided Vision-Language Pre-training for Question Answering in 3D
  Scenes
CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes
Maria Parelli
Alexandros Delitzas
Nikolas Hars
G. Vlassis
Sotiris Anagnostidis
Gregor Bachmann
Thomas Hofmann
CLIP
219
73
0
12 Apr 2023
PointCAT: Cross-Attention Transformer for point cloud
PointCAT: Cross-Attention Transformer for point cloud
Xincheng Yang
Mingze Jin
Weiji He
Qian Chen
3DPCViT
228
7
0
06 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World
  3D Scene Understanding
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
354
104
0
03 Apr 2023
Open-Vocabulary Point-Cloud Object Detection without 3D AnnotationComputer Vision and Pattern Recognition (CVPR), 2023
Yuheng Lu
Chenfeng Xu
Xi Wei
Xiaodong Xie
Masayoshi Tomizuka
Kurt Keutzer
Shanghang Zhang
3DPC
423
92
0
03 Apr 2023
Learning Second-Order Attentive Context for Efficient Correspondence
  Pruning
Learning Second-Order Attentive Context for Efficient Correspondence PruningAAAI Conference on Artificial Intelligence (AAAI), 2023
Xinyi Ye
Weiyue Zhao
Hao Lu
Zhiguo Cao
168
12
0
28 Mar 2023
Context-Aware Transformer for 3D Point Cloud Automatic Annotation
Context-Aware Transformer for 3D Point Cloud Automatic AnnotationAAAI Conference on Artificial Intelligence (AAAI), 2023
Xiaoyan Qian
Chang Liu
Xiaojuan Qi
Siew-Chong Tan
E. Lam
Ngai Wong
3DPCViT
201
9
0
27 Mar 2023
RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning
Yabin Zhu
Chenglong Li
Tianlin Li
Jin Tang
Zhixiang Huang
236
15
0
26 Mar 2023
ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised
  Pointcloud Understanding
ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised Pointcloud UnderstandingIEEE International Conference on Robotics and Automation (ICRA), 2023
Hongyu Sun
Yongcai Wang
Xudong Cai
Xuewei Bai
Deying Li
ViT3DPC
297
8
0
25 Mar 2023
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based
  Self-Supervised Pre-Training
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2023
Runsen Xu
Tai Wang
Wenwei Zhang
Runjian Chen
Jinkun Cao
Jiangmiao Pang
Dahua Lin
3DPC
215
37
0
23 Mar 2023
POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery
POTTER: Pooling Attention Transformer for Efficient Human Mesh RecoveryComputer Vision and Pattern Recognition (CVPR), 2023
Ce Zheng
Xianpeng Liu
Guo-Jun Qi
Chong Chen
3DH
290
43
0
23 Mar 2023
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth
  Estimation
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth EstimationNeural Information Processing Systems (NeurIPS), 2023
Yunsong Zhou
Quanpan Liu
Hongzi Zhu
Yunzhe Li
Shan Chang
Minyi Guo
3DPCMDE
242
18
0
23 Mar 2023
OcTr: Octree-based Transformer for 3D Object Detection
OcTr: Octree-based Transformer for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Chao Zhou
Yanan Zhang
Jiaxin Chen
Di Huang
3DPCViT
271
65
0
22 Mar 2023
CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World
  Point Cloud Data
CLIP2^22: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud DataComputer Vision and Pattern Recognition (CVPR), 2023
Yi Zeng
Chenhan Jiang
Jiageng Mao
Jianhua Han
Chao Ye
Qingqiu Huang
Dit-Yan Yeung
Zhen Yang
Xiaodan Liang
Hang Xu
3DPCVLMCLIP
247
103
0
22 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR
  Perception
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR PerceptionIEEE International Conference on Robotics and Automation (ICRA), 2023
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
355
12
0
21 Mar 2023
CurveCloudNet: Processing Point Clouds with 1D Structure
CurveCloudNet: Processing Point Clouds with 1D StructureComputer Vision and Pattern Recognition (CVPR), 2023
Jiahui Lei
Davis Rempe
Jiateng Liu
Alex Fu
Sebastien Mascha
Jeong Joon Park
Despoina Paschalidou
Leonidas Guibas
3DPC
314
4
0
21 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for
  3D Point Cloud Analysis
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Shiyang Feng
Jiaming Song
Jianbo Shi
3DPC
232
80
0
14 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D
  Object Detection
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
249
94
0
14 Mar 2023
Efficient Transformer-based 3D Object Detection with Dynamic Token
  Halting
Efficient Transformer-based 3D Object Detection with Dynamic Token HaltingIEEE International Conference on Computer Vision (ICCV), 2023
Mao Ye
Gregory P. Meyer
Yuning Chai
Qiang Liu
248
10
0
09 Mar 2023
MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box
  Priors
MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box PriorsIEEE International Conference on Computer Vision (ICCV), 2023
Tianhan Xu
Yuanchen Guo
Yunyu Lai
Songiie Zhang
3DPC
274
27
0
09 Mar 2023
Full Point Encoding for Local Feature Aggregation in 3D Point Clouds
Full Point Encoding for Local Feature Aggregation in 3D Point CloudsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yong-xing He
Hongshan Yu
Zhengeng Yang
Xiaoguang Liu
Wei Sun
Lin Wang
ViT3DPC
271
8
0
08 Mar 2023
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature
  Augmentation
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature AugmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Rongyao Fang
Shiyang Feng
Aojun Zhou
Yingjie Cai
Si Liu
Jifeng Dai
Jiaming Song
ViT
282
17
0
02 Mar 2023
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis
Nearest Neighbors Meet Deep Neural Networks for Point Cloud AnalysisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Renrui Zhang
Liuhui Wang
Ziyu Guo
Jianbo Shi
3DPC
275
11
0
01 Mar 2023
Applying Plain Transformers to Real-World Point Clouds
Applying Plain Transformers to Real-World Point Clouds
Lanxiao Li
M. Heizmann
3DPCViT
373
3
0
28 Feb 2023
Sampled Transformer for Point Sets
Sampled Transformer for Point Sets
Shidi Li
Christian J. Walder
Alexander Soen
Lexing Xie
Miaomiao Liu
3DPC
183
1
0
28 Feb 2023
CLR-GAM: Contrastive Point Cloud Learning with Guided Augmentation and
  Feature Mapping
CLR-GAM: Contrastive Point Cloud Learning with Guided Augmentation and Feature Mapping
Srikanth Malla
Yi-Ting Chen
3DPC
176
4
0
28 Feb 2023
PointWavelet: Learning in Spectral Domain for 3D Point Cloud Analysis
PointWavelet: Learning in Spectral Domain for 3D Point Cloud AnalysisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Cheng Wen
Jia-Li Long
B. Yu
Dacheng Tao
3DPC
287
15
0
10 Feb 2023
Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
  Autonomous Driving
Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving
Jiawei Liu
Xingping Dong
Sanyuan Zhao
Jianbing Shen
3DPC
332
10
0
08 Feb 2023
TR3D: Towards Real-Time Indoor 3D Object Detection
TR3D: Towards Real-Time Indoor 3D Object DetectionInternational Conference on Information Photonics (ICIP), 2023
D. Rukhovich
Anna Vorontsova
Anton Konushin
3DPC
343
47
0
06 Feb 2023
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting
Exploiting Optical Flow Guidance for Transformer-Based Video InpaintingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Kaiwen Zhang
Jialun Peng
Jingjing Fu
Dong Liu
ViT
243
18
0
24 Jan 2023
Slice Transformer and Self-supervised Learning for 6DoF Localization in
  3D Point Cloud Maps
Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud MapsIEEE International Conference on Robotics and Automation (ICRA), 2023
Muhammad Ibrahim
Naveed Akhtar
Saeed Anwar
Michael Wise
Lin Wang
ViT3DPC
213
6
0
21 Jan 2023
Joint Representation Learning for Text and 3D Point Cloud
Joint Representation Learning for Text and 3D Point CloudPattern Recognition (Pattern Recogn.), 2023
Rui Huang
Xuran Pan
Henry Zheng
Haojun Jiang
Zhifeng Xie
Qing Xiao
Gao Huang
271
26
0
18 Jan 2023
TarViS: A Unified Approach for Target-based Video Segmentation
TarViS: A Unified Approach for Target-based Video SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
A. Athar
Alexander Hermans
Jonathon Luiten
Deva Ramanan
Bastian Leibe
VOS
365
37
0
06 Jan 2023
Hierarchical Point Attention for Indoor 3D Object Detection
Hierarchical Point Attention for Indoor 3D Object DetectionIEEE International Conference on Robotics and Automation (ICRA), 2023
Manli Shu
Le Xue
Ning Yu
Roberto Martín-Martín
Caiming Xiong
Tom Goldstein
Juan Carlos Niebles
Ran Xu
3DPC
368
2
0
06 Jan 2023
End-to-End 3D Dense Captioning with Vote2Cap-DETR
End-to-End 3D Dense Captioning with Vote2Cap-DETRComputer Vision and Pattern Recognition (CVPR), 2023
Sijin Chen
Erik Cambria
Xin Chen
Yinjie Lei
Tao Chen
YU Gang
ViT
210
87
0
06 Jan 2023
CAT: LoCalization and IdentificAtion Cascade Detection Transformer for
  Open-World Object Detection
CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Shuailei Ma
Yuefeng Wang
Jiaqi Fan
Ying-yu Wei
Thomas H. Li
Hongli Liu
Fanbing Lv
417
56
0
05 Jan 2023
Ponder: Point Cloud Pre-training via Neural Rendering
Ponder: Point Cloud Pre-training via Neural RenderingIEEE International Conference on Computer Vision (ICCV), 2022
Di Huang
Sida Peng
Tong He
Honghui Yang
Xiaowei Zhou
Wanli Ouyang
SSL3DPC
267
53
0
31 Dec 2022
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with
  Informative-Preserved Reconstruction and Self-Distilled Consistency
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled ConsistencyComputer Vision and Pattern Recognition (CVPR), 2022
Mingye Xu
Mutian Xu
Tong He
Wanli Ouyang
Yali Wang
Xiaoguang Han
Yu Qiao
217
14
0
20 Dec 2022
3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
3D Point Cloud Pre-training with Knowledge Distillation from 2D Images
Xingtai Lv
Yuanhan Zhang
Zhen-fei Yin
Jiebo Luo
Wanli Ouyang
Xiaoshui Huang
3DPC
224
11
0
17 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?International Conference on Learning Representations (ICLR), 2022
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
312
139
0
16 Dec 2022
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
ConQueR: Query Contrast Voxel-DETR for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Jiaming Song
150
37
0
14 Dec 2022
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound SeparationComputer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
279
40
0
07 Dec 2022
GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point CloudsComputer Vision and Pattern Recognition (CVPR), 2022
Honghui Yang
Tong He
Jiaheng Liu
Huaguan Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wanli Ouyang
365
93
0
06 Dec 2022
Previous
123456
Next
Page 4 of 6