Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.08141
Cited By
An End-to-End Transformer Model for 3D Object Detection
16 September 2021
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPC
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Transformer Model for 3D Object Detection"
50 / 274 papers shown
Title
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Pedro Hermosilla
Christian Stippel
Leon Sick
SSL
3DPC
74
0
0
09 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
47
0
0
04 Apr 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
62
0
0
26 Mar 2025
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Jiangyi Wang
Na Zhao
39
0
0
20 Mar 2025
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation
Zinqin Huang
Gu Wang
Chenyangguang Zhang
Ruida Zhang
Xiu Li
Xiangyang Ji
46
0
0
19 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
59
0
0
18 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
46
0
0
09 Mar 2025
HexPlane Representation for 3D Semantic Scene Understanding
Zeren Chen
Yuenan Hou
Yulin Chen
Li Liu
Xiao Sun
Lu Sheng
3DPC
59
0
0
07 Mar 2025
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image
Shaoming Li
Qing Cai
Songqi Kong
Runqing Tan
Heng Tong
Shiji Qiu
Yongguo Jiang
Z. Liu
3DV
3DPC
47
0
0
28 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
L. Zhang
Philip H. S. Torr
69
4
0
24 Feb 2025
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Khanh Nguyen
Ghulam Mubashar Hassan
Ajmal Saeed Mian
3DPC
42
0
0
15 Feb 2025
PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation
Xu Ma
Mengsheng Chen
Junhui Zhang
Lijuan Song
Fang Du
Zhenhua Yu
ViT
MedIm
26
0
0
06 Jan 2025
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Jiaqi Wang
Hengshuang Zhao
83
6
0
02 Jan 2025
RelationField: Relate Anything in Radiance Fields
Sebastian Koch
Johanna Wald
Mirco Colosi
Narunas Vaskevicius
Pedro Hermosilla
F. Tombari
Timo Ropinski
109
1
0
18 Dec 2024
PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion
Yi Zhong
Weize Quan
Dong-ming Yan
J. Jiang
Yingmei Wei
3DPC
81
1
0
11 Dec 2024
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Hao Liu
Minglin Chen
Yanni Ma
Haihong Xiao
Ying He
3DGS
3DPC
73
0
0
27 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
83
1
0
23 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images
Timing Yang
Yuanliang Ju
Li Yi
3DPC
32
3
0
31 Oct 2024
NeFF-BioNet: Crop Biomass Prediction from Point Cloud to Drone Imagery
Xuesong Li
Zeeshan Hayder
Ali Zia
Connor Cassidy
Shiming Liu
W. Stiller
Eric A. Stone
Warren C. Conaty
Lars Petersson
V. Rolland
30
0
0
30 Oct 2024
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps
Yating Xu
Chen Li
G. Lee
3DPC
26
1
0
28 Oct 2024
Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding
Yang Liu
Daizong Liu
Wei Hu
3DPC
16
1
0
21 Oct 2024
SAM-Guided Masked Token Prediction for 3D Scene Understanding
Zhimin Chen
Liang Yang
Yingwei Li
Longlong Jing
Bing Li
32
3
0
16 Oct 2024
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders
Yaohua Zha
Tao Dai
Yanzi Wang
Hang Guo
Taolin Zhang
Zhihao Ouyang
Chunlin Fan
B. Chen
Ke Chen
Shu-Tao Xia
3DPC
28
1
0
13 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Renhe Jiang
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
58
4
0
07 Oct 2024
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection
Pengxi Zeng
Alberto Presta
Jonah Reinis
Dinesh Bharadia
Hang Qiu
Pamela Cosman
3DPC
14
1
0
01 Oct 2024
Formula-Supervised Visual-Geometric Pre-training
Ryosuke Yamada
Kensho Hara
Hirokatsu Kataoka
Koshi Makihara
Nakamasa Inoue
Rio Yokota
Y. Satoh
21
1
0
20 Sep 2024
UNIT: Unsupervised Online Instance Segmentation through Time
Corentin Sautier
Gilles Puy
Alexandre Boulch
Renaud Marlet
Vincent Lepetit
27
1
0
12 Sep 2024
UniDet3D: Multi-dataset Indoor 3D Object Detection
Maksim Kolodiazhnyi
Anna Vorontsova
Matvey Skripkin
D. Rukhovich
Anton Konushin
3DPC
23
2
0
06 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
29
4
0
06 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
30
7
0
27 Aug 2024
See It All: Contextualized Late Aggregation for 3D Dense Captioning
Minjung Kim
Hyung Suk Lim
Seung Hwan Kim
Soonyoung Lee
Bumsoo Kim
Gunhee Kim
47
4
0
14 Aug 2024
Bi-directional Contextual Attention for 3D Dense Captioning
Minjung Kim
Hyung Suk Lim
Soonyoung Lee
Bumsoo Kim
Gunhee Kim
35
3
0
13 Aug 2024
MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas
Feng Qiao
Zhexiao Xiong
Xinge Zhu
Yuexin Ma
Qiumeng He
Nathan Jacobs
MDE
16
1
0
03 Aug 2024
Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Sachin Pathiyan Cherumanal
Jiahao Lu
Damiano Spina
DiffM
12
4
0
01 Aug 2024
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
Shuting He
Henghui Ding
44
10
0
25 Jul 2024
Multi-modal Relation Distillation for Unified 3D Representation Learning
Huiqun Wang
Yiping Bao
Panwang Pan
Zeming Li
Xiao Liu
Ruijie Yang
Di Huang
50
0
0
19 Jul 2024
General Geometry-aware Weakly Supervised 3D Object Detection
Guowen Zhang
Junsong Fan
Liyi Chen
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
3DPC
45
2
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
38
2
0
18 Jul 2024
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation
Lei Yao
Yi Wang
Moyun Liu
Lap-Pui Chau
31
0
0
16 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
Zhe Liu
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
33
6
0
15 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
34
5
0
13 Jul 2024
Pre-training Point Cloud Compact Model with Partial-aware Reconstruction
Yaohua Zha
Yanzi Wang
Tao Dai
Shu-Tao Xia
40
0
0
12 Jul 2024
Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
Zihao Li
Pan Gao
Kang-Soo You
Chuan Yan
Manoranjan Paul
3DPC
21
0
0
12 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
32
3
0
12 Jul 2024
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image
Pengkun Jiao
Na Zhao
Jingjing Chen
Yu-Gang Jiang
VLM
ObjD
21
3
0
07 Jul 2024
Towards Open-set Camera 3D Object Detection
Zhuolin He
Xinrun Li
Heng Gao
Jiachen Tang
Shoumeng Qiu
Wenfu Wang
Lvjian Lu
Xuchong Qiu
Xiangyang Xue
Jian Pu
3DPC
26
0
0
25 Jun 2024
Video Frame Interpolation for Polarization via Swin-Transformer
Feng Huang
Xin Zhang
Yixuan Xu
Xuesong Wang
Xianyu Wu
27
0
0
17 Jun 2024
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding
Yunsong Wang
Na Zhao
Gim Hee Lee
SSL
3DV
21
0
0
17 Jun 2024
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
Julian Straub
Daniel DeTone
Tianwei Shen
Nan Yang
Chris Sweeney
Richard A. Newcombe
EgoV
30
6
0
14 Jun 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
40
0
0
12 Jun 2024
1
2
3
4
5
6
Next