Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17270
Cited By
v1
v2 (latest)
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"
50 / 973 papers shown
MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering
Yonglin Tian
Songlin Bai
Zhiyao Luo
Yutong Wang
Yisheng Lv
Fei-Yue Wang
Mamba
230
6
0
21 Aug 2024
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
Wanshui Gan
Fang Liu
Hongbin Xu
Ningkai Mo
Xiangwei Zhu
3DGS
521
37
0
21 Aug 2024
Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception
Jiaru Zhong
Haibao Yu
Tianyi Zhu
Jiahui Xu
Wenxian Yang
Zaiqing Nie
Chao Sun
304
11
0
20 Aug 2024
MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
ACM Multimedia (MM), 2024
Xiao Zhao
Xukun Zhang
Dingkang Yang
Mingyang Sun
Mingcheng Li
Shunli Wang
Lihua Zhang
MoE
193
7
0
17 Aug 2024
HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
IEEE Robotics and Automation Letters (RA-L), 2024
Xiao Zhao
Bo Chen
Mingyang Sun
Dingkang Yang
Youxing Wang
Xukun Zhang
Mingcheng Li
Dongliang Kou
Xiaoyi Wei
Lihua Zhang
301
14
0
17 Aug 2024
PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors
Rongxuan Wang
Xin Lu
Xiaoyang Liu
Xiaoyi Zou
Tongyi Cao
Ying Li
165
9
0
16 Aug 2024
HeightLane: BEV Heightmap guided 3D Lane Detection
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Chaesong Park
Eunbin Seo
Jongwoo Lim
342
5
0
15 Aug 2024
MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers
Zichao Dong
Yilin Zhang
Xufeng Huang
Hang Ji
Zhan Shi
Xin Zhan
Junbo Chen
ViT
209
0
0
13 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
382
13
0
12 Aug 2024
Camera Perspective Transformation to Bird's Eye View via Spatial Transformer Model for Road Intersection Monitoring
ACM Symposium on Solid Modeling and Applications (SMA), 2024
Rukesh Prajapati
Amr S. El-Wakeel
266
1
0
10 Aug 2024
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Changze Li
Ziheng Ji
Zhe Chen
Tong Qin
Ming Yang
268
14
0
04 Aug 2024
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Junyan Ye
Jun He
Weijia Li
Zhutao Lv
Yi Lin
Jinhua Yu
Haote Yang
Conghui He
401
1
0
03 Aug 2024
Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Hengyuan Zhang
David Paz
Yuliang Guo
Arun Das
Xinyu Huang
Karsten Haug
Henrik I. Christensen
Liu Ren
228
9
0
01 Aug 2024
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
Shiyuan Chen
Jiaxin Zhang
Ruohong Mei
Yingfeng Cai
Haoran Yin
Tao Chen
Wei Sui
Cong Yang
279
3
0
31 Jul 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
430
2
0
27 Jul 2024
PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction
Nan Peng
Xun Zhou
Mingming Wang
Xiaojun Yang
Songming Chen
Guisong Chen
246
10
0
24 Jul 2024
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
323
14
0
24 Jul 2024
Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images
Dooseop Choi
Jungyu Kang
Taeghyun An
Kyounghwan Ahn
Kyoung‐Wook Min
214
0
0
24 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
238
2
0
24 Jul 2024
Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles
Seamie Hayes
Sushil Sharma
Ciarán Eising
316
2
0
23 Jul 2024
CarFormer: Self-Driving with Learned Object-Centric Representations
Shadi S. Hamdan
Fatma Guney
3DPC
OCL
266
10
0
22 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
286
5
0
22 Jul 2024
RoadPainter: Points Are Ideal Navigators for Topology transformER
Zhongxing Ma
Shuang Liang
Yongkun Wen
Weixin Lu
Guowei Wan
ViT
3DPC
276
11
0
22 Jul 2024
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang
Xu Gao
Tong Wang
Xin Hao
Yifeng Shi
Xiao Tan
Xiaoqing Ye
Jingdong Wang
3DPC
157
0
0
22 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
263
20
0
21 Jul 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
YiFan Duan
Yao Li
Yanyong Zhang
413
5
0
20 Jul 2024
The Research of Group Re-identification from Multiple Cameras
Hao Xiao
208
0
0
19 Jul 2024
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation
Florian Chabot
Nicolas Granger
G. Lapouge
3DGS
261
16
0
19 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
270
25
0
18 Jul 2024
Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement
Yulin He
Wei Chen
Tianci Xun
Yusong Tan
3DPC
289
1
0
18 Jul 2024
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation
Jian Sun
Yuqi Dai
Chi-Man Vong
Qing Xu
Shengbo Eben Li
Jianqiang Wang
Lei He
Keqiang Li
339
3
0
18 Jul 2024
Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving
Yuqi Dai
Jian Sun
Shengbo Eben Li
Qing Xu
Jianqiang Wang
Lei He
Keqiang Li
295
3
0
17 Jul 2024
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation
Ruijie Xu
Chuyu Zhang
Hui Ren
Xuming He
3DPC
278
9
0
17 Jul 2024
Monocular Occupancy Prediction for Scalable Indoor Scenes
Hongxiao Yu
Yu-Quan Wang
Yuntao Chen
Zhaoxiang Zhang
227
11
0
16 Jul 2024
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
Xiaoshuai Hao
Ruikai Li
Hui Zhang
Dingzhe Li
Rong Yin
Sangil Jung
Seungsang Park
ByungIn Yoo
Haimei Zhao
Jing Zhang
263
29
0
16 Jul 2024
Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures
Guoliang You
Xiaomeng Chu
YiFan Duan
Wenyu Zhang
Xingchen Li
Sha Zhang
Yao Li
Jianmin Ji
Yanyong Zhang
238
1
0
16 Jul 2024
Continuity Preserving Online CenterLine Graph Learning
Yunhui Han
Kun Yu
Zhiwei Li
GNN
3DPC
319
3
0
16 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
227
10
0
15 Jul 2024
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim
Youngseok Kim
Sihwan Hwang
H. Jeong
Dongsuk Kum
279
11
0
14 Jul 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang
Jinqing Zhang
Yanan Zhang
Qingjie Liu
Zhenghui Hu
Baohui Wang
Yunhong Wang
223
8
0
14 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
326
8
0
13 Jul 2024
Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data
Cherie Ho
Jiaye Zou
Omar Alama
Sai Mitheran Jagadesh Kumar
Benjamin Chiang
Taneesh Gupta
Chen Wang
Nikhil Varma Keetha
Katia Sycara
Sebastian Scherer
160
2
0
11 Jul 2024
MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps
Hang Wu
Zhenghao Zhang
Siyuan Lin
Xiangru Mu
Qiang Zhao
Ming Yang
Tong Qin
241
18
0
11 Jul 2024
BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight
Hang Wu
Zhenghao Zhang
Siyuan Lin
Tong Qin
Jin Pan
Qiang Zhao
Chunjing Xu
Ming Yang
180
16
0
11 Jul 2024
Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction
Yili Liu
Linzhan Mou
Xuan Yu
Chenrui Han
Sitong Mao
R. Xiong
Yue Wang
3DPC
260
20
0
10 Jul 2024
Window-to-Window BEV Representation Learning for Limited FoV Cross-View Geo-localization
Lei Cheng
Teng Wang
Lingquan Meng
Changyin Sun
267
2
0
09 Jul 2024
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
Xunjiang Gu
Guanyu Song
Igor Gilitschenski
Marco Pavone
Boris Ivanovic
225
8
0
09 Jul 2024
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
Jinhua Zhang
Hualian Sheng
Sijia Cai
Bing Deng
Qiao Liang
Wen Li
Ying Fu
Jieping Ye
Shuhang Gu
DiffM
698
5
0
08 Jul 2024
Towards Stable 3D Object Detection
Jiabao Wang
Qiang Meng
Guochao Liu
Liujiang Yan
Ke Wang
Ming-Ming Cheng
Qibin Hou
209
4
0
05 Jul 2024
Occupancy as Set of Points
Yiang Shi
Tianheng Cheng
Qian Zhang
Wenyu Liu
Xinggang Wang
3DPC
280
28
0
04 Jul 2024
Previous
1
2
3
...
7
8
9
...
18
19
20
Next
Page 8 of 20
Page
of 20
Go