Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2109.08141
Cited By
An End-to-End Transformer Model for 3D Object Detection
16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPC
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An End-to-End Transformer Model for 3D Object Detection"
50 / 294 papers shown
Title
UniDet3D: Multi-dataset Indoor 3D Object Detection
Maksim Kolodiazhnyi
Anna Vorontsova
Matvey Skripkin
D. Rukhovich
Anton Konushin
3DPC
290
13
0
06 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Neural Information Processing Systems (NeurIPS), 2024
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
245
9
0
06 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
192
33
0
27 Aug 2024
See It All: Contextualized Late Aggregation for 3D Dense Captioning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Minjung Kim
Hyung Suk Lim
Seung Hwan Kim
Soonyoung Lee
Bumsoo Kim
Gunhee Kim
190
4
0
14 Aug 2024
Bi-directional Contextual Attention for 3D Dense Captioning
European Conference on Computer Vision (ECCV), 2024
Minjung Kim
Hyung Suk Lim
Soonyoung Lee
Bumsoo Kim
Gunhee Kim
197
5
0
13 Aug 2024
MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas
Feng Qiao
Zhexiao Xiong
Xinge Zhu
Yuexin Ma
Qiumeng He
Nathan Jacobs
MDE
248
1
0
03 Aug 2024
Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection
European Conference on Computer Vision (ECCV), 2024
Sachin Pathiyan Cherumanal
Jiahao Lu
Damiano Spina
DiffM
304
9
0
01 Aug 2024
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
Shuting He
Henghui Ding
223
18
0
25 Jul 2024
Multi-modal Relation Distillation for Unified 3D Representation Learning
Huiqun Wang
Yiping Bao
Panwang Pan
Zeming Li
Xiao Liu
Ruijie Yang
Di Huang
244
3
0
19 Jul 2024
General Geometry-aware Weakly Supervised 3D Object Detection
Guowen Zhang
Junsong Fan
Liyi Chen
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
3DPC
245
5
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
222
10
0
18 Jul 2024
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation
Lei Yao
Yi Wang
Moyun Liu
Lap-Pui Chau
279
7
0
16 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
Yanfeng Guo
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
169
19
0
15 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
212
10
0
13 Jul 2024
Pre-training Point Cloud Compact Model with Partial-aware Reconstruction
Yaohua Zha
Yanzi Wang
Tao Dai
Shu-Tao Xia
214
1
0
12 Jul 2024
Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
Zihao Li
Pan Gao
Kang-Soo You
Chuan Yan
Manoranjan Paul
3DPC
331
1
0
12 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
207
8
0
12 Jul 2024
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image
Pengkun Jiao
Na Zhao
Yue Yu
Yu-Gang Jiang
VLM
ObjD
233
12
0
07 Jul 2024
Towards Open-set Camera 3D Object Detection
Zhuolin He
Xinrun Li
Heng Gao
Jiachen Tang
Shoumeng Qiu
Wenfu Wang
Lvjian Lu
Xuchong Qiu
Xiangyang Xue
Jian Pu
3DPC
226
1
0
25 Jun 2024
Video Frame Interpolation for Polarization via Swin-Transformer
Feng Huang
Xin Zhang
Yixuan Xu
Xuesong Wang
Xianyu Wu
236
0
0
17 Jun 2024
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding
Yunsong Wang
Na Zhao
Gim Hee Lee
SSL
3DV
245
0
0
17 Jun 2024
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
Julian Straub
Daniel DeTone
Tianwei Shen
Nan Yang
Chris Sweeney
Richard Newcombe
EgoV
226
14
0
14 Jun 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
221
4
0
12 Jun 2024
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man
Liang-Yan Gui
Yu-Xiong Wang
288
36
0
11 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
276
14
0
02 Jun 2024
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Weitai Kang
Mengxue Qu
Jyoti Kini
Yunchao Wei
Mubarak Shah
Yan Yan
LM&Ro
3DPC
222
16
0
28 May 2024
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
Yaohua Zha
Naiqi Li
Yanzi Wang
Tao Dai
Hang Guo
Bin Chen
Zhi Wang
Zhihao Ouyang
Shu-Tao Xia
Mamba
310
20
0
27 May 2024
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning
Qingdong He
Jiangning Zhang
Jinlong Peng
Haoyang He
Yabiao Wang
Chengjie Wang
3DPC
191
30
0
24 May 2024
PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
IEEE International Conference on Robotics and Automation (ICRA), 2024
Zhaoqi Leng
Pei Sun
Tong He
Drago Anguelov
Mingxing Tan
ViT
3DPC
180
5
0
05 May 2024
Dexterous Grasp Transformer
Guo-Hao Xu
Yi-Lin Wei
Dian Zheng
Xiao-Ming Wu
Wei-Shi Zheng
ViT
190
17
0
28 Apr 2024
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Ye Mao
Junpeng Jing
K. Mikolajczyk
VLM
127
0
0
25 Apr 2024
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
Shuofeng Sun
Yongming Rao
Jiwen Lu
Haibin Yan
256
10
0
23 Apr 2024
Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Aakash Kumar
Chen Chen
Lin Wang
N. Lobo
Mubarak Shah
3DPC
228
3
0
10 Apr 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
261
15
0
28 Mar 2024
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
305
16
0
26 Mar 2024
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos
European Conference on Computer Vision (ECCV), 2024
Florian Langer
Jihong Ju
Georgi Dikov
Gerhard Reitmayr
Mohsen Ghafoorian
3DPC
248
5
0
22 Mar 2024
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Jiahao Lu
Jiacheng Deng
Tianzhu Zhang
290
10
0
22 Mar 2024
3D Object Detection from Point Cloud via Voting Step Diffusion
Haoran Hou
Mingtao Feng
Zijie Wu
Weisheng Dong
Qing Zhu
Yaonan Wang
Lin Wang
DiffM
3DPC
201
6
0
21 Mar 2024
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model
A. Avetisyan
Christopher Xie
Henry Howard-Jenkins
Tsun-Yi Yang
Samir Aroudj
...
Campbell Orme
Jakob Engel
Edward Miller
Richard Newcombe
Vasileios Balntas
LM&Ro
182
54
0
19 Mar 2024
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Conference on Robot Learning (CoRL), 2024
Anushri Dixit
Zhiting Mei
Meghan Booker
Mariko Storey-Matsutani
Mariko Storey-Matsutani
Allen Z. Ren
Ola Shorinwa
Anirudha Majumdar
577
10
0
13 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
208
16
0
12 Mar 2024
Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection
Shi-tao Chen
Haolin Zhang
Nanning Zheng
3DPC
165
2
0
04 Mar 2024
SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking
Yu Lin
Zhiheng Li
Yubo Cui
Zheng Fang
3DPC
196
4
0
26 Feb 2024
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Qingdong He
Jinlong Peng
Zhengkai Jiang
Kai Wu
Xiaozhong Ji
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Mingang Chen
Yunsheng Wu
3DPC
171
14
0
21 Jan 2024
Surface Normal Estimation with Transformers
Barry Shichen Hu
Siyun Liang
Johannes Paetzold
H. Nguyen
Isao Echizen
Jiapeng Tang
ViT
3DPC
127
0
0
11 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
Computer Vision and Pattern Recognition (CVPR), 2024
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
143
44
0
08 Jan 2024
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
European Conference on Computer Vision (ECCV), 2024
Chenhang He
Ruihuang Li
Guowen Zhang
Lei Zhang
213
12
0
01 Jan 2024
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Dongmei Zhang
Chang Li
Ray Zhang
Shenghao Xie
Wei Xue
Xiaodong Xie
Shanghang Zhang
VLM
189
21
0
22 Dec 2023
SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
Yun Zhu
Le Hui
Yaqi Shen
Jin Xie
3DPC
122
13
0
21 Dec 2023
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Lunhao Duan
Shanshan Zhao
Nan Xue
Biwei Huang
Gui-Song Xia
Dacheng Tao
ViT
367
32
0
18 Dec 2023
Previous
1
2
3
4
5
6
Next