Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.01256
Cited By
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
2 June 2022
Yingfei Liu
Junjie Yan
Fan Jia
Shuailin Li
Q. Gao
Tiancai Wang
X. Zhang
Jian-jun Sun
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images"
50 / 55 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
39
0
0
13 May 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
64
0
0
28 Apr 2025
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Jiaqi Peng
Tai Wang
Jiangmiao Pang
Yuan Shen
33
0
0
27 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
27
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
35
0
0
17 Apr 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
71
0
0
18 Mar 2025
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
Weiyi Xiong
Zean Zou
Qiuchi Zhao
Fengchun He
Bing Zhu
59
0
0
21 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
61
1
0
04 Feb 2025
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
Yuhang He
Sangyun Shin
Anoop Cherian
Niki Trigoni
Andrew Markham
70
0
0
31 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Houqiang Li
Yanyong Zhang
97
0
0
17 Dec 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
79
0
0
25 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
45
0
0
16 Nov 2024
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
93
29
0
26 Sep 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Yao Li
Yanyong Zhang
31
3
0
20 Jul 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
32
0
0
12 Jun 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
41
1
0
17 May 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
25
7
0
22 Apr 2024
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Hou-I Liu
Christine Wu
Jen-Hao Cheng
Wenhao Chai
Shian-Yun Wang
...
Jenq-Neng Hwang
Hong-Han Shuai
Wen-Huang Cheng
Hong-Han Shuai
Wen-Huang Cheng
34
2
0
07 Apr 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
57
13
0
18 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
55
12
0
12 Mar 2024
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Rui Song
Chenwei Liang
Hu Cao
Zhiran Yan
Walter Zimmer
Markus Gross
Andreas Festag
Alois C. Knoll
16
21
0
12 Feb 2024
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
Yifeng Bai
Zhirong Chen
Pengpeng Liang
Erkang Cheng
Erkang Cheng
ViT
20
6
0
09 Feb 2024
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
19
8
0
19 Dec 2023
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
22
63
0
22 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
25
14
0
16 Oct 2023
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
33
73
0
08 Sep 2023
HeightFormer: Explicit Height Modeling without Extra Data for Camera-only 3D Object Detection in Bird's Eye View
Yiming Wu
Rui Li
Zequn Qin
Xinhai Zhao
Xi Li
25
11
0
25 Jul 2023
EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps
Yuzhe He
Shuang Liang
Xiaofei Rui
Chengying Cai
Guowei Wan
16
6
0
18 Jul 2023
BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging Bird's-Eye-View in Dynamic Scenarios
Yucheng Mao
Ruowen Zhao
Tianbao Zhang
Hang Zhao
8
3
0
20 Jun 2023
Geometric-aware Pretraining for Vision-centric 3D Object Detection
Linyan Huang
Huijie Wang
J. Zeng
Shengchuan Zhang
Liujuan Cao
Junchi Yan
Hongyang Li
3DPC
57
9
0
06 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
22
35
0
03 Apr 2023
BEVFusion4D: Learning LiDAR-Camera Fusion Under Bird's-Eye-View via Cross-Modality Guidance and Temporal Aggregation
Hongxiang Cai
Zeyuan Zhang
Zhenyu Zhou
Ziyin Li
Wenbo Ding
Jiu-Yang Zhao
3DPC
16
29
0
30 Mar 2023
3D Video Object Detection with Learnable Object-Centric Global Optimization
Jiawei He
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
3DH
3DPC
51
9
0
27 Mar 2023
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
39
188
0
21 Mar 2023
X
3
^3
3
KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection
Marvin Klingner
Shubhankar Borse
V. Kumar
B. Rezaei
V. Narayanan
S. Yogamani
Fatih Porikli
29
21
0
03 Mar 2023
Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey
Apoorv Singh
Varun Bankiti
3DPC
15
23
0
13 Feb 2023
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Yangguang Li
Bin Huang
Zeren Chen
Yufeng Cui
Feng Liang
...
Fenggang Liu
Enze Xie
Lu Sheng
Wanli Ouyang
Jing Shao
22
41
0
29 Jan 2023
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation
Hao Dong
Xianjing Zhang
Jintao Xu
Rui Ai
Weihao Gu
Huimin Lu
Juho Kannala
Xieyuanli Chen
13
31
0
28 Nov 2022
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection
Linfeng Zhang
Yukang Shi
Hung-Shuo Tai
Zhipeng Zhang
Yuan He
Ke Wang
Kaisheng Ma
18
2
0
14 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Hongxiang Jiang
Wenming Meng
Hongmei Zhu
Q. Zhang
Jihao Yin
24
4
0
31 Oct 2022
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
Georg Hess
Johan Jaxing
Elias Svensson
David Hagerman
Christoffer Petersson
Lennart Svensson
3DPC
ViT
16
33
0
01 Jul 2022
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
Wei-Chih Hung
Vincent Casser
Henrik Kretzschmar
Jyh-Jing Hwang
Drago Anguelov
13
26
0
15 Jun 2022
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection
Kaicheng Yu
Tao Tang
Hongwei Xie
Zhiwei Lin
Zhongwei Wu
...
Jiong Deng
Dayang Hao
Yongtao Wang
Xi Liang
Bing Wang
3DPC
19
52
0
30 May 2022
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
14
863
0
26 May 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
37
78
0
24 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
14
131
0
08 Mar 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
138
703
0
28 Jan 2022
Efficiently Identifying Task Groupings for Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
201
235
1
10 Sep 2021
Multi-Modal 3D Object Detection in Autonomous Driving: a Survey
Yingjie Wang
Qi Mao
Hanqi Zhu
Jiajun Deng
Yu Zhang
Jianmin Ji
Houqiang Li
Yanyong Zhang
3DPC
23
133
0
24 Jun 2021
1
2
Next