ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.08141
  4. Cited By
An End-to-End Transformer Model for 3D Object Detection

An End-to-End Transformer Model for 3D Object Detection

16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
    3DPC
    ViT
ArXivPDFHTML

Papers citing "An End-to-End Transformer Model for 3D Object Detection"

50 / 274 papers shown
Title
Situational Awareness Matters in 3D Vision Language Reasoning
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man
Liang-Yan Gui
Yu-Xiong Wang
38
12
0
11 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal
  Alignment for Open-Vocabulary 3D Object Detection
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
28
6
0
02 Jun 2024
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Weitai Kang
Mengxue Qu
Jyoti Kini
Yunchao Wei
Mubarak Shah
Yan Yan
LM&Ro
3DPC
45
10
0
28 May 2024
LCM: Locally Constrained Compact Point Cloud Model for Masked Point
  Modeling
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
Yaohua Zha
Naiqi Li
Yanzi Wang
Tao Dai
Hang Guo
Bin Chen
Zhi Wang
Zhihao Ouyang
Shu-Tao Xia
Mamba
42
8
0
27 May 2024
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud
  Learning
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning
Qingdong He
Jiangning Zhang
Jinlong Peng
Haoyang He
Yabiao Wang
Chengjie Wang
3DPC
35
12
0
24 May 2024
PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object
  Detection
PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
Zhaoqi Leng
Pei Sun
Tong He
Drago Anguelov
Mingxing Tan
ViT
3DPC
29
3
0
05 May 2024
Dexterous Grasp Transformer
Dexterous Grasp Transformer
Guo-Hao Xu
Yi-Lin Wei
Dian Zheng
Xiao-Ming Wu
Wei-Shi Zheng
ViT
27
11
0
28 Apr 2024
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Ye Mao
Junpeng Jing
K. Mikolajczyk
VLM
32
0
0
25 Apr 2024
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
Shuofeng Sun
Yongming Rao
Jiwen Lu
Haibin Yan
27
6
0
23 Apr 2024
Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR
  Data
Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data
Aakash Kumar
Chen Chen
Ajmal Saeed Mian
N. Lobo
Mubarak Shah
3DPC
38
1
0
10 Apr 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via
  Cycle-Modality Propagation
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
33
7
0
28 Mar 2024
On permutation-invariant neural networks
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
33
12
0
26 Mar 2024
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos
Florian Langer
Jihong Ju
Georgi Dikov
Gerhard Reitmayr
Mohsen Ghafoorian
3DPC
32
3
0
22 Mar 2024
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance
  Segmentation
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
Jiahao Lu
Jiacheng Deng
Tianzhu Zhang
45
4
0
22 Mar 2024
3D Object Detection from Point Cloud via Voting Step Diffusion
3D Object Detection from Point Cloud via Voting Step Diffusion
Haoran Hou
Mingtao Feng
Zijie Wu
Weisheng Dong
Qing Zhu
Yaonan Wang
Ajmal Saeed Mian
DiffM
3DPC
30
4
0
21 Mar 2024
SceneScript: Reconstructing Scenes With An Autoregressive Structured
  Language Model
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model
A. Avetisyan
Christopher Xie
Henry Howard-Jenkins
Tsun-Yi Yang
Samir Aroudj
...
Campbell Orme
Jakob Engel
Edward Miller
Richard A. Newcombe
Vasileios Balntas
LM&Ro
33
17
0
19 Mar 2024
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Anushri Dixit
Zhiting Mei
Meghan Booker
Mariko Storey-Matsutani
Mariko Storey-Matsutani
Allen Z. Ren
Ola Shorinwa
Anirudha Majumdar
29
5
0
13 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing
  Objects in 3D Scenes
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
35
10
0
12 Mar 2024
Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted
  Sample Selection
Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection
Shi-tao Chen
Haolin Zhang
Nanning Zheng
3DPC
31
0
0
04 Mar 2024
SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud
  Tracking
SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Yu Lin
Zhiheng Li
Yubo Cui
Zheng Fang
3DPC
39
1
0
26 Feb 2024
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with
  Fine-Grained Feature Representation
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
Qingdong He
Jinlong Peng
Zhengkai Jiang
Kai Wu
Xiaozhong Ji
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Mingang Chen
Yunsheng Wu
3DPC
13
7
0
21 Jan 2024
Surface Normal Estimation with Transformers
Surface Normal Estimation with Transformers
Barry Shichen Hu
Siyun Liang
Johannes Paetzold
H. Nguyen
Isao Echizen
Jiapeng Tang
ViT
3DPC
28
0
0
11 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
MS-DETR: Efficient DETR Training with Mixed Supervision
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
20
20
0
08 Jan 2024
ScatterFormer: Efficient Voxel Transformer with Scattered Linear
  Attention
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
Chenhang He
Ruihuang Li
Guowen Zhang
Lei Zhang
15
4
0
01 Jan 2024
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for
  Open-Vocabulary 3D Detection
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Dongmei Zhang
Chang Li
Ray Zhang
Shenghao Xie
Wei Xue
Xiaodong Xie
Shanghang Zhang
VLM
25
13
0
22 Dec 2023
SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
Yun Zhu
Le Hui
Yaqi Shen
Jin Xie
3DPC
25
3
0
21 Dec 2023
ConDaFormer: Disassembled Transformer with Local Structure Enhancement
  for 3D Point Cloud Understanding
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Lunhao Duan
Shanshan Zhao
Nan Xue
Mingming Gong
Gui-Song Xia
Dacheng Tao
ViT
16
18
0
18 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Hongyuan Zhu
Fukun Yin
Gang Yu
Tao Chen
17
23
0
17 Dec 2023
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point
  Cloud Registration
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration
Kezheng Xiong
Maoji Zheng
Qingshan Xu
Chenglu Wen
Siqi Shen
Cheng-Yu Wang
3DPC
26
10
0
14 Dec 2023
Cross-BERT for Point Cloud Pretraining
Cross-BERT for Point Cloud Pretraining
Xin Li
Peng Li
Zeyong Wei
Zhe Zhu
Mingqiang Wei
Junhui Hou
Liangliang Nan
J. Qin
H. Xie
F. Wang
SSL
3DPC
23
0
0
08 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
26
3
0
05 Dec 2023
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding,
  Reasoning, and Planning
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Sijin Chen
Xin Chen
C. Zhang
Mingsheng Li
Gang Yu
Hao Fei
Hongyuan Zhu
Jiayuan Fan
Tao Chen
MLLM
24
77
0
30 Nov 2023
Point Cloud Pre-training with Diffusion Models
Point Cloud Pre-training with Diffusion Models
Xiao Zheng
Xiaoshui Huang
Guofeng Mei
Yuenan Hou
Zhaoyang Lyu
Bo Dai
Wanli Ouyang
Yongshun Gong
19
19
0
25 Nov 2023
Multiple View Geometry Transformers for 3D Human Pose Estimation
Multiple View Geometry Transformers for 3D Human Pose Estimation
Ziwei Liao
Jialiang Zhu
Chunyu Wang
Han Hu
Steven L. Waslander
ViT
23
2
0
18 Nov 2023
Point Cloud Self-supervised Learning via 3D to Multi-view Masked
  Autoencoder
Point Cloud Self-supervised Learning via 3D to Multi-view Masked Autoencoder
Zhimin Chen
Yingwei Li
Longlong Jing
Liang Yang
Bing Li
3DPC
29
9
0
17 Nov 2023
3DifFusionDet: Diffusion Model for 3D Object Detection with Robust
  LiDAR-Camera Fusion
3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion
Xinhao Xiang
Simon Dräger
Jiawei Zhang
30
1
0
07 Nov 2023
FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision
  Transformer Fusion
FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision Transformer Fusion
Xinhao Xiang
Jiawei Zhang
3DPC
ViT
39
1
0
07 Nov 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
27
8
0
24 Oct 2023
SoybeanNet: Transformer-Based Convolutional Neural Network for Soybean
  Pod Counting from Unmanned Aerial Vehicle (UAV) Images
SoybeanNet: Transformer-Based Convolutional Neural Network for Soybean Pod Counting from Unmanned Aerial Vehicle (UAV) Images
Jiajia Li
Raju Thada Magar
Dong Chen
Feng Lin
Dechun Wang
Xiang Yin
Weichao Zhuang
Zhao Li
22
11
0
16 Oct 2023
Multimodal Object Query Initialization for 3D Object Detection
Multimodal Object Query Initialization for 3D Object Detection
Mathijs R. van Geerenstein
Felicia Ruppel
Klaus C. J. Dietmayer
D. Gavrila
3DPC
16
2
0
16 Oct 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
...
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
SSL
69
42
0
12 Oct 2023
3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic
  Indoor Environments
3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments
G. S. Krishna
Kundrapu Supriya
S. Baidya
3DPC
18
2
0
10 Oct 2023
Uni3DETR: Unified 3D Detection Transformer
Uni3DETR: Unified 3D Detection Transformer
Zhenyu Wang
Yali Li
Xi Chen
Hengshuang Zhao
Shengjin Wang
3DPC
42
18
0
09 Oct 2023
Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
Zhenyu Wu
Xiuwei Xu
Ziwei Wang
Chong Xia
Linqing Zhao
Jiwen Lu
Haibin Yan
3DPC
11
3
0
09 Oct 2023
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for
  Open-vocabulary 3D Object Detection
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
8
33
0
04 Oct 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and
  Favorable Transferability For ViTs
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
J. Han
Guiguang Ding
ViT
24
6
0
27 Sep 2023
Unsupervised 3D Perception with 2D Vision-Language Distillation for
  Autonomous Driving
Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Mahyar Najibi
Jingwei Ji
Yin Zhou
C. Qi
Xinchen Yan
Scott Ettinger
Drago Anguelov
14
27
0
25 Sep 2023
Regress Before Construct: Regress Autoencoder for Point Cloud
  Self-supervised Learning
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning
Yang Liu
C. L. P. Chen
Can Wang
Xulin King
Mengyuan Liu
3DPC
27
7
0
25 Sep 2023
Holistic Geometric Feature Learning for Structured Reconstruction
Holistic Geometric Feature Learning for Structured Reconstruction
Ziqiong Lu
Linxi Huan
Qiyuan Ma
Xianwei Zheng
10
1
0
18 Sep 2023
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D
  Detection
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Chenming Zhu
Wenwei Zhang
Tai Wang
Xihui Liu
Kai-xiang Chen
3DPC
37
18
0
18 Sep 2023
Previous
123456
Next