Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.07732
Cited By
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
15 August 2023
Haiyang Wang
Hao Tang
Shaoshuai Shi
Aoxue Li
Zhenguo Li
Bernt Schiele
Liwei Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation"
46 / 46 papers shown
Title
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
28
0
0
12 May 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
L. Wang
Zhipeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
J. Ji
Y. Zhang
3DPC
36
0
0
17 Apr 2025
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Konyul Park
Yecheol Kim
Daehun Kim
Jun-Won Choi
34
0
0
25 Mar 2025
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
Hongyu Sun
Qiuhong Ke
Ming Cheng
Y. Wang
Deying Li
Chenhui Gou
Jianfei Cai
3DPC
84
0
0
15 Mar 2025
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Xuzhong Hu
Zaipeng Duan
Pei An
Jun zhang
Jie Ma
3DPC
80
0
0
12 Mar 2025
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection
Hyeongseok Son
Jia He
Seung-In Park
Ying Min
Yunhao Zhang
ByungIn Yoo
45
0
0
11 Mar 2025
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
Shijia Zhao
Qiming Xia
Xusheng Guo
Pufan Zou
Maoji Zheng
Hai Wu
Chenglu Wen
Cheng-Yu Wang
3DPC
60
0
0
09 Mar 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
L. Zhang
Philip H. S. Torr
63
4
0
24 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
58
1
0
04 Feb 2025
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Y. Li
Yang Yang
Zhen Lei
3DPC
46
2
0
11 Jan 2025
Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation
Christian Witte
Jens Behley
Cyrill Stachniss
Marvin Raaijmakers
78
0
0
02 Dec 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
J. T. Wang
90
0
0
25 Nov 2024
A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data
Kavin Chandrasekaran
Sorin Grigorescu
Gijs Dubbelman
P. Jancura
67
0
0
20 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
43
0
0
16 Nov 2024
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
Haiyang Wang
Yue Fan
Muhammad Ferjad Naeem
Yongqin Xian
J. E. Lenssen
Liwei Wang
F. Tombari
Bernt Schiele
36
2
0
30 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
25
3
0
09 Oct 2024
OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping
Jiale Wei
Junwei Zheng
Ruiping Liu
Jie Hu
Jiaming Zhang
Rainer Stiefelhagen
13
1
0
20 Sep 2024
Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception
Jiaru Zhong
Haibao Yu
Tianyi Zhu
Jiahui Xu
Wenxian Yang
Zaiqing Nie
Chao Sun
17
0
0
20 Aug 2024
OccMamba: Semantic Occupancy Prediction with State Space Models
Heng Li
Yuenan Hou
Xiaohan Xing
Xiao Sun
Xiao Sun
Yanyong Zhang
Mamba
45
2
0
19 Aug 2024
MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation
Xiao Zhao
Xukun Zhang
Dingkang Yang
Mingyang Sun
Mingcheng Li
Shunli Wang
Lihua Zhang
MoE
25
1
0
17 Aug 2024
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
19
0
0
13 Aug 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
26
2
0
27 Jul 2024
BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
Yang Song
Lin Wang
19
3
0
27 Jun 2024
ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection
Ziying Song
Feiyang Jia
Hongyu Pan
Yadan Luo
Caiyan Jia
Guoxin Zhang
Lin Liu
Yang Ji
Lei Yang
Li-e Wang
35
9
0
27 May 2024
Addressing Diverging Training Costs using Local Restoration for Precise Bird's Eye View Map Construction
Minsu Kim
Giseop Kim
Sunwook Choi
19
0
0
02 May 2024
DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning
S. Yogamani
David Unger
Venkatraman Narayanan
Varun Ravi Kumar
18
1
0
09 Apr 2024
Weak-to-Strong 3D Object Detection with X-Ray Distillation
Alexander Gambashidze
Aleksandr Dadukin
Maksim Golyadkin
Maria Razzhivina
Ilya Makarov
27
2
0
31 Mar 2024
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
Junbo Yin
Jianbing Shen
Runnan Chen
Wei Li
Ruigang Yang
Pascal Frossard
Wenguan Wang
3DPC
21
31
0
22 Mar 2024
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception
Yiheng Li
Hongyang Li
Zehao Huang
Hong Chang
Naiyan Wang
33
2
0
15 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
22
10
0
14 Mar 2024
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Jiajun Deng
Sha Zhang
Feras Dayoub
Wanli Ouyang
Yanyong Zhang
Ian Reid
3DPC
28
4
0
14 Mar 2024
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
Hongcheng Zhang
Liu Liang
Pengxin Zeng
Xiao Song
Zhe Wang
24
7
0
12 Mar 2024
Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving
Zhili Chen
K. T. Pham
Maosheng Ye
Zhiqiang Shen
Qifeng Chen
3DPC
28
0
0
10 Mar 2024
MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection
Yuxue Yang
Lue Fan
Zhaoxiang Zhang
3DPC
34
6
0
29 Jan 2024
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
James Gunn
Zygmunt Lenyk
Anuj Sharma
Andrea Donati
Alexandru Buburuzan
John Redford
Romain Mueller
MDE
25
8
0
22 Dec 2023
Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection
Junjie Huang
Yun Ye
Zhujin Liang
Yi Shan
Dalong Du
3DPC
11
16
0
13 Nov 2023
PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds
Hao-Yu Yang
Haiyang Wang
Di Dai
Liwei Wang
3DPC
13
4
0
08 Nov 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
27
8
0
24 Oct 2023
FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection
Chunyong Hu
Hang Zheng
Kun Li
Jianyun Xu
Weibo Mao
...
Kaixuan Liu
Yiru Zhao
Peihan Hao
Minzhe Liu
Kaicheng Yu
ViT
3DPC
11
14
0
11 Sep 2023
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds
Haiyang Wang
Lihe Ding
Shaocong Dong
Shaoshuai Shi
Aoxue Li
Jianan Li
Zhenguo Li
Liwei Wang
3DPC
132
67
0
09 Oct 2022
Multimodal Virtual Point 3D Detection
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
143
243
0
12 Nov 2021
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Scott Ettinger
Shuyang Cheng
Benjamin Caine
Chenxi Liu
Hang Zhao
...
Jiquan Ngiam
Vijay Vasudevan
Alexander McCauley
Jonathon Shlens
Drago Anguelov
123
421
0
20 Apr 2021
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Hongsheng Li
3DPC
135
400
0
31 Jan 2021
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
Jiajun Deng
Shaoshuai Shi
Pei-Cian Li
Wen-gang Zhou
Yanyong Zhang
Houqiang Li
3DPC
216
660
0
31 Dec 2020
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu
Mingming Gong
Chaohui Wang
Kayhan Batmanghelich
Dacheng Tao
MDE
178
1,687
0
06 Jun 2018
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
210
13,886
0
02 Dec 2016
1