Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.05625
Cited By
PETR: Position Embedding Transformation for Multi-View 3D Object Detection
10 March 2022
Yingfei Liu
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PETR: Position Embedding Transformation for Multi-View 3D Object Detection"
50 / 384 papers shown
Title
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
M. Tomizuka
W. Zhan
DiffM
41
2
0
02 Nov 2024
GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection
Xiaotian Li
Baojie Fan
Jiandong Tian
Huijie Fan
3DPC
56
9
0
01 Nov 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
N. Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCV
EDL
3DPC
54
1
0
31 Oct 2024
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving
Maciej K. Wozniak
Hariprasath Govindarajan
Marvin Klingner
Camille Maurice
B Ravi Kiran
S. Yogamani
3DPC
55
1
0
30 Oct 2024
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Gyusam Chang
Jiwon Lee
Donghyun Kim
Jinkyu Kim
Dongwook Lee
Daehyun Ji
Sujin Jang
Sangpil Kim
42
1
0
29 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
M. Hosseinzadeh
Ian Reid
33
1
0
28 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
43
1
0
17 Oct 2024
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho
Yulong Cao
Jiachen Sun
Qingzhao Zhang
Marco Pavone
Jeong Joon Park
Heng Yang
Z. Morley Mao
34
0
0
16 Oct 2024
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
72
4
0
16 Oct 2024
UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles
Hui Ye
Rajshekhar Sunderraman
Shihao Ji
26
2
0
14 Oct 2024
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object
Jiwei Chen
Laiyan Ding
Chi Zhang
Feifei Li
Rui Huang
28
0
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
52
0
0
14 Oct 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
34
77
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
43
3
0
09 Oct 2024
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
36
0
0
09 Oct 2024
Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
40
0
0
09 Oct 2024
Cross-Camera Data Association via GNN for Supervised Graph Clustering
Đorđe Nedeljković
26
0
0
01 Oct 2024
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
134
32
0
26 Sep 2024
Enhancing Fruit and Vegetable Detection in Unconstrained Environment with a Novel Dataset
Sandeep Khanna
Chiranjoy Chattopadhyay
Suman Kundu
25
2
0
20 Sep 2024
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework
Xiaoyu Li
Peidong Li
Lijun Zhao
Dedong Liu
Jinghan Gao
Xian Wu
Yitao Wu
Dixiao Cui
VOT
38
1
0
18 Sep 2024
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Jinrang Jia
Guangqi Yi
Yifeng Shi
34
0
0
18 Sep 2024
DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving
Haisheng Su
Wei Wu
Junchi Yan
39
0
0
15 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
38
9
0
14 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
31
0
0
09 Sep 2024
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping
Shuang Zeng
Xinyuan Chang
Xinran Liu
Zheng Pan
Xing Wei
37
1
0
09 Sep 2024
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Zhiwei Lin
Zhe Liu
Yongtao Wang
Le Zhang
Ce Zhu
39
4
0
08 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
Yukai Ma
Xuemeng Yang
Licheng Wen
Tiantian Wei
Min Dou
Yukai Ma
Min Dou
Botian Shi
Yong Liu
DiffM
VGen
69
3
0
06 Sep 2024
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Dingyuan Zhang
Dingkang Liang
Zichang Tan
Xiaoqing Ye
Cheng Zhang
Jingdong Wang
Xiang Bai
ViT
51
2
0
01 Sep 2024
Enhancing Vectorized Map Perception with Historical Rasterized Maps
Xiaoyu Zhang
Guangwei Liu
Zihao Liu
Ningyi Xu
Yunhui Liu
Ji Zhao
36
7
0
01 Sep 2024
RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning
Sha Lu
Xuecheng Xu
Yuxuan Wu
Haojian Lu
Xieyuanli Chen
R. Xiong
Yue Wang
43
2
0
30 Aug 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
32
0
0
29 Aug 2024
AdaOcc: Adaptive-Resolution Occupancy Prediction
Chao-Yeh Chen
Ruoyu Wang
Yuliang Guo
Cheng Zhao
Xinyu Huang
Chen Feng
Liu Ren
50
0
0
24 Aug 2024
Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection
Tamás Matuszka
Peter Hajas
Dávid Szeghy
42
0
0
22 Aug 2024
HeightLane: BEV Heightmap guided 3D Lane Detection
Chaesong Park
Eunbin Seo
Jongwoo Lim
106
2
0
15 Aug 2024
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
40
2
0
13 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
51
4
0
12 Aug 2024
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning
Changze Li
Ziheng Ji
Zhe Chen
Tong Qin
Ming Yang
50
4
0
04 Aug 2024
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
Shiyuan Chen
Jiaxin Zhang
Ruohong Mei
Yingfeng Cai
Haoran Yin
Tao Chen
Wei Sui
Cong Yang
31
0
0
31 Jul 2024
CardioSyntax: end-to-end SYNTAX score prediction -- dataset, benchmark and method
Alexander Ponomarchuk
Ivan Kruzhilov
Galina Zubkova
Artem Shadrin
Ruslan Utegenov
Ivan Bessonov
Pavel Blinov
38
0
0
29 Jul 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
43
2
0
27 Jul 2024
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
50
7
0
24 Jul 2024
Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images
Dooseop Choi
Jungyu Kang
Taeghyun An
Kyounghwan Ahn
Kyoung‐Wook Min
27
0
0
24 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
46
0
0
24 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
58
1
0
22 Jul 2024
RoadPainter: Points Are Ideal Navigators for Topology transformER
Zhongxing Ma
Shuang Liang
Yongkun Wen
Weixin Lu
Guowei Wan
ViT
3DPC
33
6
0
22 Jul 2024
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang
Xu Gao
Tong Wang
Xin Hao
Yifeng Shi
Xiao Tan
Xiaoqing Ye
Jingdong Wang
3DPC
37
0
0
22 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
32
3
0
15 Jul 2024
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim
Youngseok Kim
Sihwan Hwang
H. Jeong
Dongsuk Kum
40
4
0
14 Jul 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang
Jinqing Zhang
Yanan Zhang
Qingjie Liu
Zhenghui Hu
Baohui Wang
Yunhong Wang
32
2
0
14 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
45
2
0
13 Jul 2024
Previous
1
2
3
4
5
6
7
8
Next