ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.01256
  4. Cited By
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

2 June 2022
Yingfei Liu
Junjie Yan
Fan Jia
Shuailin Li
Q. Gao
Tiancai Wang
X. Zhang
Jian-jun Sun
    3DPC
ArXivPDFHTML

Papers citing "PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images"

50 / 51 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
39
0
0
13 May 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
50
0
0
28 Apr 2025
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Jiaqi Peng
Tai Wang
Jiangmiao Pang
Yuan Shen
33
0
0
27 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
35
0
0
17 Apr 2025
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
Weiyi Xiong
Zean Zou
Qiuchi Zhao
Fengchun He
Bing Zhu
59
0
0
21 Feb 2025
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
Yuhang He
Sangyun Shin
Anoop Cherian
Niki Trigoni
Andrew Markham
70
0
0
31 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Houqiang Li
Yanyong Zhang
90
0
0
17 Dec 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
79
0
0
25 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
43
0
0
16 Nov 2024
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
87
29
0
26 Sep 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via
  Ray-Centric Strategies
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Yao Li
Yanyong Zhang
31
3
0
20 Jul 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise
  Transformer
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
29
0
0
12 Jun 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
39
1
0
17 May 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining
  BEV Segmentation Networks
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
25
7
0
22 Apr 2024
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Hou-I Liu
Christine Wu
Jen-Hao Cheng
Wenhao Chai
Shian-Yun Wang
...
Jenq-Neng Hwang
Hong-Han Shuai
Wen-Huang Cheng
Hong-Han Shuai
Wen-Huang Cheng
34
2
0
07 Apr 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
51
13
0
18 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
49
12
0
12 Mar 2024
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion
  in Connected Automated Vehicles
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Rui Song
Chenwei Liang
Hu Cao
Zhiran Yan
Walter Zimmer
Markus Gross
Andreas Festag
Alois C. Knoll
16
21
0
12 Feb 2024
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
Yifeng Bai
Zhirong Chen
Pengpeng Liang
Erkang Cheng
Erkang Cheng
ViT
20
6
0
09 Feb 2024
M-BEV: Masked BEV Perception for Robust Autonomous Driving
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
19
8
0
19 Dec 2023
ADriver-I: A General World Model for Autonomous Driving
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
16
63
0
22 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
25
14
0
16 Oct 2023
Language Prompt for Autonomous Driving
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
27
71
0
08 Sep 2023
HeightFormer: Explicit Height Modeling without Extra Data for
  Camera-only 3D Object Detection in Bird's Eye View
HeightFormer: Explicit Height Modeling without Extra Data for Camera-only 3D Object Detection in Bird's Eye View
Yiming Wu
Rui Li
Zequn Qin
Xinhai Zhao
Xi Li
25
11
0
25 Jul 2023
EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized
  Maps
EgoVM: Achieving Precise Ego-Localization using Lightweight Vectorized Maps
Yuzhe He
Shuang Liang
Xiaofei Rui
Chengying Cai
Guowei Wan
14
6
0
18 Jul 2023
BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging
  Bird's-Eye-View in Dynamic Scenarios
BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging Bird's-Eye-View in Dynamic Scenarios
Yucheng Mao
Ruowen Zhao
Tianbao Zhang
Hang Zhao
8
3
0
20 Jun 2023
Geometric-aware Pretraining for Vision-centric 3D Object Detection
Geometric-aware Pretraining for Vision-centric 3D Object Detection
Linyan Huang
Huijie Wang
J. Zeng
Shengchuan Zhang
Liujuan Cao
Junchi Yan
Hongyang Li
3DPC
51
9
0
06 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via
  Historical Object Prediction
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
22
35
0
03 Apr 2023
BEVFusion4D: Learning LiDAR-Camera Fusion Under Bird's-Eye-View via
  Cross-Modality Guidance and Temporal Aggregation
BEVFusion4D: Learning LiDAR-Camera Fusion Under Bird's-Eye-View via Cross-Modality Guidance and Temporal Aggregation
Hongxiang Cai
Zeyuan Zhang
Zhenyu Zhou
Ziyin Li
Wenbo Ding
Jiu-Yang Zhao
3DPC
13
29
0
30 Mar 2023
3D Video Object Detection with Learnable Object-Centric Global
  Optimization
3D Video Object Detection with Learnable Object-Centric Global Optimization
Jiawei He
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
3DH
3DPC
33
9
0
27 Mar 2023
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every
  Detection Box
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D
  Object Detection
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
39
188
0
21 Mar 2023
X$^3$KD: Knowledge Distillation Across Modalities, Tasks and Stages for
  Multi-Camera 3D Object Detection
X3^33KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection
Marvin Klingner
Shubhankar Borse
V. Kumar
B. Rezaei
V. Narayanan
S. Yogamani
Fatih Porikli
29
21
0
03 Mar 2023
Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey
Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey
Apoorv Singh
Varun Bankiti
3DPC
13
23
0
13 Feb 2023
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Yangguang Li
Bin Huang
Zeren Chen
Yufeng Cui
Feng Liang
...
Fenggang Liu
Enze Xie
Lu Sheng
Wanli Ouyang
Jing Shao
19
41
0
29 Jan 2023
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map
  Generation
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation
Hao Dong
Xianjing Zhang
Jintao Xu
Rui Ai
Weihao Gu
Huimin Lu
Juho Kannala
Xieyuanli Chen
13
31
0
28 Nov 2022
Structured Knowledge Distillation Towards Efficient and Compact
  Multi-View 3D Detection
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection
Linfeng Zhang
Yukang Shi
Hung-Shuo Tai
Zhipeng Zhang
Yuan He
Ke Wang
Kaisheng Ma
18
2
0
14 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Hongxiang Jiang
Wenming Meng
Hongmei Zhu
Q. Zhang
Jihao Yin
21
4
0
31 Oct 2022
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point
  Clouds
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
Georg Hess
Johan Jaxing
Elias Svensson
David Hagerman
Christoffer Petersson
Lennart Svensson
3DPC
ViT
13
33
0
01 Jul 2022
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for
  Camera-Only 3D Detection
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
Wei-Chih Hung
Vincent Casser
Henrik Kretzschmar
Jyh-Jing Hwang
Drago Anguelov
13
26
0
15 Jun 2022
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object
  Detection
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection
Kaicheng Yu
Tao Tang
Hongwei Xie
Zhiwei Lin
Zhongwei Wu
...
Jiong Deng
Dayang Hao
Yongtao Wang
Xi Liang
Bing Wang
3DPC
13
52
0
30 May 2022
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View
  Representation
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
14
863
0
26 May 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
37
78
0
24 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary
  Camera Rigs
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
14
131
0
08 Mar 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
138
703
0
28 Jan 2022
Efficiently Identifying Task Groupings for Multi-Task Learning
Efficiently Identifying Task Groupings for Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
201
235
1
10 Sep 2021
FIERY: Future Instance Prediction in Bird's-Eye View from Surround
  Monocular Cameras
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
Anthony Hu
Zak Murez
Nikhil C. Mohan
Sofía Dudas
Jeffrey Hawke
Vijay Badrinarayanan
R. Cipolla
Alex Kendall
131
254
0
21 Apr 2021
Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection
Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection
Yuliang Guo
Guang Chen
Peitao Zhao
Weide Zhang
Jinghao Miao
Jingao Wang
Tae Eun Choe
3DPC
58
104
0
24 Mar 2020
Conditional Convolutions for Instance Segmentation
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
167
596
0
12 Mar 2020
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
Benjin Zhu
Zhengkai Jiang
Xiangxin Zhou
Zeming Li
Gang Yu
3DPC
155
482
0
26 Aug 2019
12
Next