ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.15676
  4. Cited By
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

24 October 2023
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
ArXivPDFHTML

Papers citing "Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation"

15 / 15 papers shown
Title
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding
Feng Xiao
Hongbin Xu
Guocan Zhao
Wenxiong Kang
34
0
0
07 May 2025
Non-contact Multimodal Indoor Human Monitoring Systems: A Survey
Non-contact Multimodal Indoor Human Monitoring Systems: A Survey
L. Nguyen
Praneeth Susarla
Anirban Mukherjee
Manuel Lage Cañellas
Constantino Álvarez Casado
Xiaoting Wu
Olli Silvén
D. Jayagopi
Miguel Bordallo López
8
1
0
11 Dec 2023
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor
  3D Object Detection
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
Yichen Xie
Chenfeng Xu
Marie-Julie Rakotosaona
Patrick Rim
F. Tombari
Kurt Keutzer
M. Tomizuka
Wei Zhan
3DPC
39
49
0
27 Apr 2023
Set-the-Scene: Global-Local Training for Generating Controllable NeRF
  Scenes
Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes
Dana Cohen-Bar
Elad Richardson
G. Metzer
Raja Giryes
Daniel Cohen-Or
63
52
0
23 Mar 2023
MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving
MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving
Jiale Li
Hang Dai
Hao Han
Yong Ding
3DPC
35
68
0
15 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual
  Grounding
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Yanmin Wu
Xinhua Cheng
Renrui Zhang
Zesen Cheng
Jian Zhang
41
62
0
29 Sep 2022
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
Xu Yan
Jiantao Gao
Chaoda Zheng
Chao Zheng
Ruimao Zhang
Shenghui Cui
Zhen Li
3DPC
81
210
0
10 Jul 2022
VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and
  Stereo Data Fusion
VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and Stereo Data Fusion
Hanqi Zhu
Jiajun Deng
Yu Zhang
J. Ji
Qiuyu Mao
Houqiang Li
Yanyong Zhang
3DPC
29
131
0
29 Nov 2021
Multimodal Virtual Point 3D Detection
Multimodal Virtual Point 3D Detection
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
140
243
0
12 Nov 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
275
3,784
0
18 Apr 2021
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding
  on Point Clouds through Instance Multi-level Contextual Referring
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Zhihao Yuan
Xu Yan
Yinghong Liao
Ruimao Zhang
Sheng Wang
Zhen Li
Shuguang Cui
59
128
0
01 Mar 2021
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
C. Qi
Xinlei Chen
Or Litany
Leonidas J. Guibas
3DPC
175
239
0
29 Jan 2020
Multi-view PointNet for 3D Scene Understanding
Multi-view PointNet for 3D Scene Understanding
M. Jaritz
Jiayuan Gu
Hao Su
3DPC
132
140
0
30 Sep 2019
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
210
13,886
0
02 Dec 2016
1