Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.08141
Cited By
An End-to-End Transformer Model for 3D Object Detection
16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPC
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Transformer Model for 3D Object Detection"
50 / 274 papers shown
Title
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Sijin Chen
Hongyuan Zhu
Mingsheng Li
Xin Chen
Peng Guo
Yinjie Lei
Gang Yu
Taihao Li
Tao Chen
11
17
0
06 Sep 2023
Dense Object Grounding in 3D Scenes
Wencan Huang
Daizong Liu
Wei Hu
13
17
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGen
DiffM
24
1
0
05 Sep 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation
Xin Lai
Yuhui Yuan
Ruihang Chu
Yukang Chen
Han Hu
Jiaya Jia
MedIm
ISeg
3DPC
35
29
0
04 Sep 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
39
46
0
01 Sep 2023
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Yiwen Tang
Xianzheng Ma
...
Ke Chen
Peng Gao
Xianzhi Li
Hongsheng Li
Pheng-Ann Heng
MLLM
22
124
0
01 Sep 2023
Group Regression for Query Based Object Detection and Tracking
Felicia Ruppel
F. Faion
Claudius Gläser
Klaus C. J. Dietmayer
13
1
0
28 Aug 2023
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
Tao Tu
Shun-Po Chuang
Yu-Lun Liu
Cheng Sun
Kecheng Zhang
D. Roy
Cheng-Hao Kuo
Min Sun
3DPC
25
5
0
17 Aug 2023
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Zehan Wang
Haifeng Huang
Yang Zhao
Ziang Zhang
Zhou Zhao
19
58
0
17 Aug 2023
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Yichao Shen
Zigang Geng
Yuhui Yuan
Yutong Lin
Ze Liu
Chunyu Wang
Han Hu
Nanning Zheng
B. Guo
3DPC
26
24
0
08 Aug 2023
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
3DV
VLM
16
28
0
01 Aug 2023
Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Ziyi Wang
Xumin Yu
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
3DPC
19
18
0
27 Jul 2023
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Zehan Wang
Haifeng Huang
Yang Zhao
Lin Li
Xize Cheng
Yichen Zhu
Aoxiong Yin
Zhou Zhao
3DPC
25
20
0
25 Jul 2023
GaPro: Box-Supervised 3D Point Cloud Instance Segmentation Using Gaussian Processes as Pseudo Labelers
T.D. Ngo
Binh-Son Hua
Khoi Duc Minh Nguyen
3DPC
18
4
0
25 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
24
32
0
18 Jul 2023
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Bernard Ghanem
Dacheng Tao
ObjD
VLM
27
134
0
28 Jun 2023
UniG3D: A Unified 3D Object Generation Dataset
Qinghong Sun
Yangguang Li
Zexia Liu
Xiaoshui Huang
Fenggang Liu
Xihui Liu
Wanli Ouyang
Jing Shao
22
6
0
19 Jun 2023
Randomized 3D Scene Generation for Generalizable Self-Supervised Pre-Training
Lanxiao Li
M. Heizmann
19
0
0
07 Jun 2023
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan
Chen Song
Youkang Kong
Qi-Xing Huang
3DPC
25
2
0
05 Jun 2023
Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes
Alexandros Delitzas
Maria Parelli
Nikolas Hars
G. Vlassis
Sotiris Anagnostidis
Gregor Bachmann
Thomas Hofmann
CLIP
12
19
0
04 Jun 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
41
158
0
01 Jun 2023
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
Guo Fan
Zekun Qi
Wenkai Shi
Kaisheng Ma
3DPC
SSL
20
9
0
31 May 2023
VoxDet: Voxel Learning for Novel Instance Detection
Bowen Li
Jiashun Wang
Yaoyu Hu
Chen Wang
S. Scherer
25
6
0
26 May 2023
Hierarchical Adaptive Voxel-guided Sampling for Real-time Applications in Large-scale Point Clouds
Ju Ouyang
Xiao Liu
Haoyao Chen
3DPC
24
2
0
23 May 2023
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Taiki Miyanishi
Daich Azuma
Shuhei Kurita
M. Kawanabe
28
2
0
23 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
24
31
0
15 May 2023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Le Xue
Ning Yu
Shu Zhen Zhang
Artemis Panagopoulou
Junnan Li
...
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
19
115
0
14 May 2023
PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer
Honghui Yang
Wenxiao Wang
Minghao Chen
Binbin Lin
Tong He
Huaguan Chen
Xiaofei He
Wanli Ouyang
3DPC
ViT
27
32
0
11 May 2023
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding
Li Jiang
Zetong Yang
Shaoshuai Shi
Vladislav Golyanik
Dengxin Dai
Bernt Schiele
3DPC
14
13
0
08 May 2023
3D Small Object Detection with Dynamic Spatial Pruning
Xiuwei Xu
Zhihao Sun
Ziwei Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
3DPC
13
3
0
05 May 2023
OctFormer: Octree-based Transformers for 3D Point Clouds
Peng-Shuai Wang
ViT
3DPC
19
81
0
04 May 2023
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
Yichen Xie
Chenfeng Xu
Marie-Julie Rakotosaona
Patrick Rim
F. Tombari
Kurt Keutzer
M. Tomizuka
Wei Zhan
3DPC
41
52
0
27 Apr 2023
Spatial-Language Attention Policies for Efficient Robot Learning
Priyam Parashar
Vidhi Jain
Xiaohan Zhang
Jay Vakil
Sam Powers
Yonatan Bisk
Chris Paxton
LM&Ro
32
5
0
21 Apr 2023
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Zhi Cai
Songtao Liu
Guodong Wang
Zheng Ge
Xiangyu Zhang
Di Huang
26
2
0
15 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
11
2
0
13 Apr 2023
CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes
Maria Parelli
Alexandros Delitzas
Nikolas Hars
G. Vlassis
Sotiris Anagnostidis
Gregor Bachmann
Thomas Hofmann
CLIP
13
50
0
12 Apr 2023
PointCAT: Cross-Attention Transformer for point cloud
Xincheng Yang
Mingze Jin
Weiji He
Qian Chen
3DPC
ViT
16
3
0
06 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
10
61
0
03 Apr 2023
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Yuheng Lu
Chenfeng Xu
Xi Wei
Xiaodong Xie
M. Tomizuka
Kurt Keutzer
Shanghang Zhang
3DPC
16
53
0
03 Apr 2023
Learning Second-Order Attentive Context for Efficient Correspondence Pruning
Xinyi Ye
Weiyue Zhao
Hao Lu
Zhiguo Cao
8
8
0
28 Mar 2023
Context-Aware Transformer for 3D Point Cloud Automatic Annotation
Xiaoyan Qian
Chang Liu
Xiaojuan Qi
Siew-Chong Tan
E. Lam
Ngai Wong
3DPC
ViT
51
6
0
27 Mar 2023
RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning
Yabin Zhu
Chenglong Li
Xiao Wang
Jin Tang
Zhixiang Huang
24
7
0
26 Mar 2023
ViPFormer: Efficient Vision-and-Pointcloud Transformer for Unsupervised Pointcloud Understanding
Hongyu Sun
Yongcai Wang
Xudong Cai
Xuewei Bai
Deying Li
ViT
3DPC
24
8
0
25 Mar 2023
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
Runsen Xu
Tai Wang
Wenwei Zhang
Runjian Chen
Jinkun Cao
Jiangmiao Pang
Dahua Lin
3DPC
29
29
0
23 Mar 2023
POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery
Ce Zheng
Xianpeng Liu
Guo-Jun Qi
C. L. P. Chen
3DH
113
32
0
23 Mar 2023
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation
Yunsong Zhou
Quanpan Liu
Hongzi Zhu
Yunzhe Li
Shan Chang
Minyi Guo
3DPC
MDE
45
13
0
23 Mar 2023
OcTr: Octree-based Transformer for 3D Object Detection
Chao Zhou
Yanan Zhang
Jiaxin Chen
Di Huang
3DPC
ViT
19
41
0
22 Mar 2023
CLIP
2
^2
2
: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Yi Zeng
Chenhan Jiang
Jiageng Mao
Jianhua Han
Chao Ye
Qingqiu Huang
Dit-Yan Yeung
Zhen Yang
Xiaodan Liang
Hang Xu
3DPC
VLM
CLIP
14
68
0
22 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
29
10
0
21 Mar 2023
CurveCloudNet: Processing Point Clouds with 1D Structure
Colton Stearns
Davis Rempe
Jiateng Liu
Alex Fu
Sebastien Mascha
Jeong Joon Park
Despoina Paschalidou
Leonidas J. Guibas
3DPC
11
1
0
21 Mar 2023
Previous
1
2
3
4
5
6
Next