ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.08586
  4. Cited By
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

12 October 2023
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
Xianglong He
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
    SSL
ArXivPDFHTML

Papers citing "PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm"

48 / 48 papers shown
Title
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
Sergio Arnaud
Paul Mcvay
Ada Martin
Arjun Majumdar
Krishna Murthy Jatavallabhula
...
Nicolas Ballas
Mido Assran
Oleksandr Maksymets
Aravind Rajeswaran
Franziska Meier
3DPC
36
0
0
19 Apr 2025
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Pedro Hermosilla
Christian Stippel
Leon Sick
SSL
3DPC
62
0
0
09 Apr 2025
Aether: Geometric-Aware Unified World Modeling
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Y. Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffM
VGen
51
2
0
24 Mar 2025
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data
Runjian Chen
Wenqi Shao
Bo-Wen Zhang
Shaoshuai Shi
Li Jiang
Ping Luo
48
0
0
11 Mar 2025
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
Ziliang Miao
Runjian Chen
Yixi Cai
Buwei He
Wenquan Zhao
Wenqi Shao
Bo-Wen Zhang
Fu Zhang
3DPC
44
0
0
10 Mar 2025
LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding
LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding
Ang Cao
Sergio Arnaud
Oleksandr Maksymets
Jianing Yang
Ayush Jain
...
Aravind Rajeswaran
Franziska Meier
Justin Johnson
Jeong Joon Park
Alexander Sax
52
0
0
27 Feb 2025
LeAP: Consistent multi-domain 3D labeling using Foundation Models
LeAP: Consistent multi-domain 3D labeling using Foundation Models
Simon Gebraad
Andras Palffy
Holger Caesar
77
1
0
06 Feb 2025
CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning
Runjian Chen
H. Zhang
Avinash Ravichandran
Wenqi Shao
Alex Wong
Ping Luo
Ping Luo
3DPC
72
0
0
04 Dec 2024
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Hao Liu
Minglin Chen
Yanni Ma
Haihong Xiao
Ying He
3DGS
3DPC
68
0
0
27 Nov 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
31
3
0
17 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
32
5
0
10 Oct 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features
  from Multi-View Images
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas J. Guibas
DD
36
5
0
30 Aug 2024
Text3DAug -- Prompted Instance Augmentation for LiDAR Perception
Text3DAug -- Prompted Instance Augmentation for LiDAR Perception
Laurenz Reichardt
Luca Uhr
Oliver Wasenmüller
21
4
0
26 Aug 2024
CooPre: Cooperative Pretraining for V2X Cooperative Perception
CooPre: Cooperative Pretraining for V2X Cooperative Perception
Seth Z. Zhao
Hao Xiang
Chenfeng Xu
Xin Xia
Bolei Zhou
Jiaqi Ma
3DPC
32
1
0
20 Aug 2024
InLUT3D: Challenging real indoor dataset for point cloud analysis
InLUT3D: Challenging real indoor dataset for point cloud analysis
Jakub Walczak
3DPC
24
0
0
22 Jul 2024
Efficient Depth-Guided Urban View Synthesis
Efficient Depth-Guided Urban View Synthesis
Sheng Miao
Jiaxin Huang
Dongfeng Bai
Weichao Qiu
Bingbing Liu
Andreas Geiger
Yiyi Liao
36
3
0
17 Jul 2024
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric
  Foundation Models
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
Julian Straub
Daniel DeTone
Tianwei Shen
Nan Yang
Chris Sweeney
Richard A. Newcombe
EgoV
30
6
0
14 Jun 2024
Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Xinyu Zhang
Yuhan Liu
Haonan Chang
Abdeslam Boularias
36
1
0
12 Jun 2024
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
Sijin Chen
Xin Chen
Anqi Pang
Xianfang Zeng
Wei Cheng
...
C. Zhang
Jingyi Yu
Gang Yu
Bin-Bin Fu
Tao Chen
AI4CE
42
35
0
31 May 2024
Grounded 3D-LLM with Referent Tokens
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
37
22
0
16 May 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
24
1
0
01 Apr 2024
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma
Peihao Wang
Zhiwen Fan
Zhangyang Wang
Hao Su
R. Ramamoorthi
VLM
24
7
0
27 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
37
110
0
22 Mar 2024
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
Bohao Peng
Xiaoyang Wu
Li Jiang
Yukang Chen
Hengshuang Zhao
Zhuotao Tian
Jiaya Jia
40
16
0
21 Mar 2024
TTT-KD: Test-Time Training for 3D Semantic Segmentation through
  Knowledge Distillation from Foundation Models
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models
Lisa Weijler
Muhammad Jehanzeb Mirza
Leon Sick
Can Ekkazan
Pedro Hermosilla
TTA
20
0
0
18 Mar 2024
GroupContrast: Semantic-aware Self-supervised Representation Learning
  for 3D Understanding
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Chengyao Wang
Li Jiang
Xiaoyang Wu
Zhuotao Tian
Bohao Peng
Hengshuang Zhao
Jiaya Jia
3DPC
SSL
69
13
0
14 Mar 2024
Point Cloud Mamba: Point Cloud Learning via State Space Model
Point Cloud Mamba: Point Cloud Learning via State Space Model
Tao Zhang
Xiangtai Li
Haobo Yuan
Shunping Ji
Shuicheng Yan
24
18
0
01 Mar 2024
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene
  Understanding
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding
Yu-Qi Yang
Yufeng Guo
Yang Liu
3DPC
27
2
0
22 Feb 2024
Point Cloud Matters: Rethinking the Impact of Different Observation
  Spaces on Robot Learning
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
Haoyi Zhu
Yating Wang
Di Huang
Weicai Ye
Wanli Ouyang
Tong He
SSL
3DPC
23
18
0
04 Feb 2024
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration
Yifan Zhang
Siyu Ren
Junhui Hou
Jinjian Wu
Guangming Shi
Guangming Shi
SSL
3DPC
51
3
0
23 Jan 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges,
  Methodologies, and Opportunities
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
46
16
0
16 Jan 2024
Hulk: A Universal Knowledge Translator for Human-Centric Tasks
Hulk: A Universal Knowledge Translator for Human-Centric Tasks
Yizhou Wang
YiXuan Wu
Shixiang Tang
Weizhen He
Xun Guo
...
Lei Bai
Rui Zhao
Jian Wu
Tong He
Wanli Ouyang
VLM
23
14
0
04 Dec 2023
Applications of Large Scale Foundation Models for Autonomous Driving
Applications of Large Scale Foundation Models for Autonomous Driving
Yu Huang
Yue Chen
Zhu Li
ELM
AI4CE
LRM
ALM
LM&Ro
46
14
0
20 Nov 2023
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D
  Representation Learning
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu
Xin Wen
Xihui Liu
Hengshuang Zhao
3DPC
109
23
0
24 Mar 2023
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360$^{\circ} $
  Insufficient RGB-D Views
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360∘^{\circ} ∘ Insufficient RGB-D Views
Haoyi Zhu
Haoshu Fang
Cewu Lu
19
9
0
11 Oct 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud
  Pre-training
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang
Ziyu Guo
Rongyao Fang
Bingyan Zhao
Dong Wang
Yu Qiao
Hongsheng Li
Peng Gao
3DPC
156
241
0
28 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D
  Scene Understanding
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding
Yujin Chen
Matthias Nießner
Angela Dai
3DPC
92
57
0
06 Dec 2021
Multimodal Virtual Point 3D Detection
Multimodal Virtual Point 3D Detection
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
140
243
0
12 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
255
7,337
0
11 Nov 2021
Guided Point Contrastive Learning for Semi-supervised Point Cloud
  Semantic Segmentation
Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation
Li Jiang
Shaoshuai Shi
Zhuotao Tian
Xin Lai
Shu-Lin Liu
Chi-Wing Fu
Jiaya Jia
SSL
3DPC
106
97
0
15 Oct 2021
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Zaiwei Zhang
Rohit Girdhar
Armand Joulin
Ishan Misra
3DPC
106
235
0
07 Jan 2021
Neural Sparse Voxel Fields
Neural Sparse Voxel Fields
Lingjie Liu
Jiatao Gu
Kyaw Zaw Lin
Tat-Seng Chua
Christian Theobalt
177
1,234
0
22 Jul 2020
PointContrast: Unsupervised Pre-training for 3D Point Cloud
  Understanding
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
Saining Xie
Jiatao Gu
Demi Guo
C. Qi
Leonidas J. Guibas
Or Litany
3DPC
128
618
0
21 Jul 2020
Convolutional Occupancy Networks
Convolutional Occupancy Networks
Songyou Peng
Michael Niemeyer
L. Mescheder
Marc Pollefeys
Andreas Geiger
3DV
AI4CE
206
860
0
10 Mar 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
229
3,029
0
09 Mar 2020
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
Benjin Zhu
Zhengkai Jiang
Xiangxin Zhou
Zeming Li
Gang Yu
3DPC
149
482
0
26 Aug 2019
Joint 2D-3D-Semantic Data for Indoor Scene Understanding
Joint 2D-3D-Semantic Data for Indoor Scene Understanding
Iro Armeni
S. Sax
Amir Zamir
Silvio Savarese
3DV
3DPC
108
864
0
03 Feb 2017
1