ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.12735
  4. Cited By
Multi-Modal 3D Object Detection in Autonomous Driving: a Survey

Multi-Modal 3D Object Detection in Autonomous Driving: a Survey

24 June 2021
Yingjie Wang
Qi Mao
Hanqi Zhu
Jiajun Deng
Yu Zhang
Jianmin Ji
Houqiang Li
Yanyong Zhang
    3DPC
ArXivPDFHTML

Papers citing "Multi-Modal 3D Object Detection in Autonomous Driving: a Survey"

50 / 59 papers shown
Title
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti
Roberto Basla
Riccardo Pieroni
Matteo Corno
S. Savaresi
Luca Magri
Giacomo Boracchi
3DPC
44
0
0
25 Apr 2025
Enhancing Novel Object Detection via Cooperative Foundational Models
Enhancing Novel Object Detection via Cooperative Foundational Models
Rohit K Bharadwaj
Muzammal Naseer
Salman Khan
F. Khan
ObjD
VLM
100
1
0
17 Jan 2025
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Shuang Cui
Yi Li
Jiangmeng Li
Xiongxin Tang
Bing-Huang Su
Fanjiang Xu
Hui Xiong
51
0
0
15 Jan 2025
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation
  via Hierarchical Modality Selection
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Jiazhou Zhou
Lin Wang
Xuming Hu
72
4
0
22 Dec 2024
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding
Yi Liu
Chengxin Li
Shoukun Xu
J. Han
ViT
35
1
0
19 Oct 2024
Advancing Object Detection in Transportation with Multimodal Large
  Language Models (MLLMs): A Comprehensive Review and Empirical Testing
Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing
Huthaifa I. Ashqar
Ahmed Jaber
Taqwa I. Alhadidi
Mohammed Elhenawy
26
7
0
26 Sep 2024
UniBEVFusion: Unified Radar-Vision BEVFusion for 3D Object Detection
UniBEVFusion: Unified Radar-Vision BEVFusion for 3D Object Detection
Haocheng Zhao
Runwei Guan
Taoyu Wu
Ka Lok Man
Limin Yu
Yutao Yue
MDE
26
2
0
23 Sep 2024
NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar
NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar
Runwei Guan
Jianan Liu
Liye Jia
Haocheng Zhao
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yutao Yue
49
5
0
30 Aug 2024
BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and
  Localization
BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization
Mario A. V. Saucedo
Nikolaos Stathoulopoulos
Vidya Sumathy
Christoforos Kanellakis
G. Nikolakopoulos
3DPC
27
0
0
27 Aug 2024
LLMI3D: MLLM-based 3D Perception from a Single 2D Image
LLMI3D: MLLM-based 3D Perception from a Single 2D Image
Fan Yang
Sicheng Zhao
Yanhao Zhang
Haoxiang Chen
Hui Chen
Wenbo Tang
Guiguang Ding
33
4
0
14 Aug 2024
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object
  Detection
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection
A. Benfenati
P. Causin
Hang Yu
Zhedong Zheng
3DPC
39
2
0
01 Aug 2024
InScope: A New Real-world 3D Infrastructure-side Collaborative
  Perception Dataset for Open Traffic Scenarios
InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios
M. Zarei
Yining Li
B. Hellinga
Xiangyi Qin
Ying Shen
P. Izadpanah
Xiaojun Tan
34
2
0
31 Jul 2024
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular
  Transformer
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer
Yang Wu
Kaihua Zhang
Jianjun Qian
Jin Xie
Jian Yang
DiffM
37
4
0
29 Jul 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via
  Ray-Centric Strategies
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Yao Li
Yanyong Zhang
31
3
0
20 Jul 2024
Learning Modality-agnostic Representation for Semantic Segmentation from
  Any Modalities
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities
Xueye Zheng
Yuanhuiyi Lyu
Lin Wang
VLM
47
10
0
16 Jul 2024
Centering the Value of Every Modality: Towards Efficient and Resilient
  Modality-agnostic Semantic Segmentation
Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation
Xueye Zheng
Yuanhuiyi Lyu
Jiazhou Zhou
Lin Wang
27
7
0
16 Jul 2024
SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data
  Augmentation and Confidence Correction Mechanism
SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data Augmentation and Confidence Correction Mechanism
Ao Liang
Wenyu Chen
Jian Fang
Huaici Zhao
3DPC
31
0
0
01 Jul 2024
HVDistill: Transferring Knowledge from Images to Point Clouds via
  Unsupervised Hybrid-View Distillation
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation
Sha Zhang
Jiajun Deng
Lei Bai
Houqiang Li
Wanli Ouyang
Yanyong Zhang
3DPC
48
8
0
18 Mar 2024
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of
  Interest
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Jiajun Deng
Sha Zhang
Feras Dayoub
Wanli Ouyang
Yanyong Zhang
Ian Reid
3DPC
30
4
0
14 Mar 2024
NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth
  Supervision for Indoor Multi-View 3D Detection
NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Chenxi Huang
Yuenan Hou
Weicai Ye
Di Huang
Xiaoshui Huang
Binbin Lin
Deng Cai
Wanli Ouyang
3DV
3DPC
MDE
29
12
0
22 Feb 2024
LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection
LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection
Jingyu Song
Lingjun Zhao
Katherine A. Skinner
3DPC
16
9
0
18 Feb 2024
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of
  LiDAR-Camera Fusion for 3D Object Detection
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection
Till Beemelmanns
Quan Zhang
Christian Geller
Lutz Eckstein
3DPC
13
3
0
18 Feb 2024
Stream Query Denoising for Vectorized HD Map Construction
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
20
18
0
17 Jan 2024
DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception
DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception
Kai Jiang
Jiaxing Huang
Weiying Xie
Yunsong Li
Ling Shao
Shijian Lu
27
4
0
13 Jan 2024
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM
Ziying Song
Guoxin Zhang
Lin Liu
Lei Yang
Shaoqing Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
21
16
0
08 Jan 2024
Spatiotemporal Attention Enhances Lidar-Based Robot Navigation in
  Dynamic Environments
Spatiotemporal Attention Enhances Lidar-Based Robot Navigation in Dynamic Environments
Jorge de Heuvel
Xiangyu Zeng
Weixian Shi
Tharun Sethuraman
Maren Bennewitz
16
6
0
30 Oct 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
27
8
0
24 Oct 2023
GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for
  Multi-Modal 3D Object Detection
GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection
Ziying Song
Haiyue Wei
Lin Bai
Lei Yang
Caiyan Jia
3DPC
16
27
0
12 Oct 2023
3D Multiple Object Tracking on Autonomous Driving: A Literature Review
3D Multiple Object Tracking on Autonomous Driving: A Literature Review
Peng Zhang
Xin Li
Liang He
Xinhua Lin
33
2
0
27 Sep 2023
MapTRv2: An End-to-End Framework for Online Vectorized HD Map
  Construction
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction
Bencheng Liao
Shaoyu Chen
Yunchi Zhang
Bo Jiang
Qian Zhang
Wenyu Liu
Chang Huang
Xinggang Wang
3DV
ViT
51
112
0
10 Aug 2023
Deep Transfer Learning for Intelligent Vehicle Perception: a Survey
Deep Transfer Learning for Intelligent Vehicle Perception: a Survey
Xinyi Liu
Jinlong Li
Jin Ma
Huiming Sun
Zhigang Xu
Tianyu Zhang
Hongkai Yu
38
21
0
26 Jun 2023
A survey on deep learning approaches for data integration in autonomous
  driving system
A survey on deep learning approaches for data integration in autonomous driving system
Xi Zhu
Likang Wang
Caifa Zhou
Xiya Cao
Yue Gong
L. Chen
23
1
0
17 Jun 2023
Radars for Autonomous Driving: A Review of Deep Learning Methods and
  Challenges
Radars for Autonomous Driving: A Review of Deep Learning Methods and Challenges
Arvind Srivastav
S. Mandal
17
29
0
15 Jun 2023
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object
  Detection
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection
Yingjie Wang
Jiajun Deng
Yao Li
Jinshui Hu
Cong Liu
Yu Zhang
Jianmin Ji
Wanli Ouyang
Yanyong Zhang
8
26
0
02 Jun 2023
SDVRF: Sparse-to-Dense Voxel Region Fusion for Multi-modal 3D Object
  Detection
SDVRF: Sparse-to-Dense Voxel Region Fusion for Multi-modal 3D Object Detection
Binglu Ren
Jianqin Yin
3DPC
8
1
0
17 Apr 2023
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global
  Cross-Modal Fusion
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion
Xin Li
Tengyu Ma
Yuenan Hou
Botian Shi
Yucheng Yang
...
Xingjiao Wu
Qingsheng Chen
Yikang Li
Yu Qiao
Liangbo He
3DPC
19
81
0
07 Mar 2023
Towards Domain Generalization for Multi-view 3D Object Detection in
  Bird-Eye-View
Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View
Shuo Wang
Xinhai Zhao
Haiting Xu
Zehui Chen
Dameng Yu
Jiahao Chang
Zhen Yang
Feng Zhao
35
18
0
03 Mar 2023
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry
  Learning
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning
Pei-Kai Huang
L. Liu
Renrui Zhang
Song Zhang
Xin Xu
Bai-Qi Wang
G. Liu
3DPC
MDE
25
42
0
28 Dec 2022
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection
Shaoqing Xu
Fang Li
Dingfu Zhou
Jin Fang
Sifen Wang
Liangjun Zhang
3DPC
23
9
0
10 Dec 2022
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object
  Detection
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
Zehui Chen
Zhenyu Li
Shiquan Zhang
Liangji Fang
Qinhong Jiang
Feng Zhao
32
60
0
17 Nov 2022
PointSee: Image Enhances Point Cloud
PointSee: Image Enhances Point Cloud
Lipeng Gu
Xu Yan
Peng Cui
Lina Gong
H. Xie
F. Wang
Jing Qin
Mingqiang Wei
3DPC
23
4
0
03 Nov 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A
  Comprehensive Survey
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
35
4
0
19 Oct 2022
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object
  Detection
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
Xin Li
Botian Shi
Yuenan Hou
Xingjiao Wu
Tianlong Ma
Yikang Li
Liangbo He
3DPC
10
49
0
18 Oct 2022
JPerceiver: Joint Perception Network for Depth, Pose and Layout
  Estimation in Driving Scenes
JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes
Haimei Zhao
Jing Zhang
Sen Zhang
Dacheng Tao
11
15
0
16 Jul 2022
3D Object Detection for Autonomous Driving: A Comprehensive Survey
3D Object Detection for Autonomous Driving: A Comprehensive Survey
Jiageng Mao
Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
3DPC
19
194
0
19 Jun 2022
Multi-modal Sensor Fusion for Auto Driving Perception: A Survey
Multi-modal Sensor Fusion for Auto Driving Perception: A Survey
Keli Huang
Botian Shi
Xiang Li
Xin Li
Siyuan Huang
Yikang Li
11
129
0
06 Feb 2022
Agent-Centric Relation Graph for Object Visual Navigation
Agent-Centric Relation Graph for Object Visual Navigation
X. Hu
Youfang Lin
Shuo Wang
Zhihao Wu
Kai Lv
23
18
0
29 Nov 2021
VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and
  Stereo Data Fusion
VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and Stereo Data Fusion
Hanqi Zhu
Jiajun Deng
Yu Zhang
J. Ji
Qiuyu Mao
Houqiang Li
Yanyong Zhang
3DPC
35
131
0
29 Nov 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
17
327
0
17 Apr 2021
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
Jiajun Deng
Shaoshuai Shi
Pei-Cian Li
Wen-gang Zhou
Yanyong Zhang
Houqiang Li
3DPC
216
660
0
31 Dec 2020
12
Next