ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.06785
  4. Cited By
Learning 3D Representations from 2D Pre-trained Models via
  Image-to-Point Masked Autoencoders

Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders

13 December 2022
Renrui Zhang
Liuhui Wang
Yu Qiao
Peng Gao
Hongsheng Li
    3DPC
ArXivPDFHTML

Papers citing "Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders"

50 / 110 papers shown
Title
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models
Wan Xu
Tianyu Huang
Tianyu Qu
Guanglei Yang
Yiwen Guo
Wangmeng Zuo
11
0
0
28 Dec 2023
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for
  Open-Vocabulary 3D Detection
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Dongmei Zhang
Chang Li
Ray Zhang
Shenghao Xie
Wei Xue
Xiaodong Xie
Shanghang Zhang
VLM
17
13
0
22 Dec 2023
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual
  Test-Time Adaptation
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
22
10
0
19 Dec 2023
Towards Compact 3D Representations via Point Feature Enhancement Masked
  Autoencoders
Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders
Yaohua Zha
Huizhen Ji
Jinmin Li
Rongsheng Li
Tao Dai
Bin Chen
Zhi Wang
Shu-Tao Xia
3DPC
17
21
0
17 Dec 2023
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation
  Learning
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning
Weijie Wei
F. Karimi Nejadasl
Theo Gevers
Martin R. Oswald
3DPC
20
3
0
15 Dec 2023
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V
Dingning Liu
Xiaomeng Dong
Renrui Zhang
Xu Luo
Peng Gao
Xiaoshui Huang
Yongshun Gong
Zhihui Wang
19
10
0
15 Dec 2023
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object
  Identifiers
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Haifeng Huang
Zehan Wang
Rongjie Huang
Luping Liu
Xize Cheng
Yang Zhao
Tao Jin
Zhou Zhao
52
40
0
13 Dec 2023
MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition
MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition
Dan Song
Xinwei Fu
Weizhi Nie
Wenhui Li
Lanjun Wang
You Yang
Anan Liu
VLM
16
6
0
30 Nov 2023
Instance-aware 3D Semantic Segmentation powered by Shape Generators and
  Classifiers
Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers
Bo Sun
Qixing Huang
Xiangru Huang
3DV
3DPC
17
0
0
21 Nov 2023
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D
  Pre-training
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
Yipeng Gao
Zeyu Wang
Wei-Shi Zheng
Cihang Xie
Yuyin Zhou
3DPC
19
8
0
03 Nov 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
85
18
0
26 Oct 2023
Can pre-trained models assist in dataset distillation?
Can pre-trained models assist in dataset distillation?
Yao Lu
Xuguang Chen
Yuchen Zhang
Jianyang Gu
Tianle Zhang
Yifan Zhang
Xiaoniu Yang
Qi Xuan
Kai Wang
Yang You
DD
29
10
0
05 Oct 2023
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Yiwen Tang
Ivan Tang
Ray Gu
Dong Wang
Eric Zhang
Bin Zhao
Xuelong Li
3DPC
24
19
0
04 Oct 2023
Regress Before Construct: Regress Autoencoder for Point Cloud
  Self-supervised Learning
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning
Yang Liu
C. L. P. Chen
Can Wang
Xulin King
Mengyuan Liu
3DPC
19
7
0
25 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
28
168
0
20 Sep 2023
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D
  Understanding, Generation, and Instruction Following
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Yiwen Tang
Xianzheng Ma
...
Ke Chen
Peng Gao
Xianzhi Li
Hongsheng Li
Pheng-Ann Heng
MLLM
14
123
0
01 Sep 2023
Chat-3D: Data-efficiently Tuning Large Language Model for Universal
  Dialogue of 3D Scenes
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes
Zehan Wang
Haifeng Huang
Yang Zhao
Ziang Zhang
Zhou Zhao
16
58
0
17 Aug 2023
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive
  Representation
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Zekun Qi
Muzhou Yu
Runpei Dong
Kaisheng Ma
3DPC
11
11
0
28 Jul 2023
Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Ziyi Wang
Xumin Yu
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
3DPC
11
18
0
27 Jul 2023
A Universal Semantic-Geometric Representation for Robotic Manipulation
A Universal Semantic-Geometric Representation for Robotic Manipulation
Tong Zhang
Yingdong Hu
Hanchen Cui
Hang Zhao
Yang Gao
60
16
0
18 Jun 2023
Explore In-Context Learning for 3D Point Cloud Understanding
Explore In-Context Learning for 3D Point Cloud Understanding
Zhongbin Fang
Xiangtai Li
Xia Li
J. M. Buhmann
Chen Change Loy
Mengyuan Liu
3DPC
11
13
0
14 Jun 2023
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan
Chen Song
Youkang Kong
Qi-Xing Huang
3DPC
14
2
0
05 Jun 2023
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
Aoran Xiao
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DPC
27
18
0
31 May 2023
Point-GCC: Universal Self-supervised 3D Scene Pre-training via
  Geometry-Color Contrast
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
Guo Fan
Zekun Qi
Wenkai Shi
Kaisheng Ma
3DPC
SSL
12
9
0
31 May 2023
GTNet: Graph Transformer Network for 3D Point Cloud Classification and
  Semantic Segmentation
GTNet: Graph Transformer Network for 3D Point Cloud Classification and Semantic Segmentation
Wei Zhou
Qian Wang
Weiwei Jin
X. Shi
Yong Yu
ViT
3DPC
10
4
0
24 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with
  Foundation Models
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
11
31
0
15 May 2023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Le Xue
Ning Yu
Shu Zhen Zhang
Artemis Panagopoulou
Junnan Li
...
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
8
113
0
14 May 2023
VTPNet for 3D deep learning on point cloud
VTPNet for 3D deep learning on point cloud
Wei Zhou
Weiwei Jin
Qian Wang
Yifan Wang
Dekui Wang
Xingxing Hao
Yong Yu
3DPC
ViT
14
0
0
10 May 2023
Self-supervised Learning for Pre-Training 3D Point Clouds: A Survey
Self-supervised Learning for Pre-Training 3D Point Clouds: A Survey
Ben Fei
Weidong Yang
Liwen Liu
Tian-jian Luo
Rui Zhang
Yixuan Li
Ying He
3DPC
13
17
0
08 May 2023
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior
  Refinement
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
Xiang-yu Zhu
Renrui Zhang
Bowei He
A-Long Zhou
Dong Wang
Bingyan Zhao
Peng Gao
VLM
27
76
0
03 Apr 2023
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with
  GPT and Prototype Guidance
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance
Zoey Guo
Yiwen Tang
Renrui Zhang
Dong Wang
Zhigang Wang
Bin Zhao
Xuelong Li
23
53
0
29 Mar 2023
Point2Vec for Self-Supervised Representation Learning on Point Clouds
Point2Vec for Self-Supervised Representation Learning on Point Clouds
Karim Abou Zeid
Jonas Schult
Alexander Hermans
Bastian Leibe
3DPC
12
26
0
29 Mar 2023
Revisiting Multimodal Representation in Contrastive Learning: From Patch
  and Token Embeddings to Finite Discrete Tokens
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
Yuxiao Chen
Jianbo Yuan
Yu Tian
Shijie Geng
Xinyu Li
Ding Zhou
Dimitris N. Metaxas
Hongxia Yang
12
30
0
27 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for
  3D Point Cloud Analysis
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Peng Gao
Hongsheng Li
Jianbo Shi
3DPC
12
50
0
14 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D
  Object Detection
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
63
59
0
14 Mar 2023
Traj-MAE: Masked Autoencoders for Trajectory Prediction
Traj-MAE: Masked Autoencoders for Trajectory Prediction
Hao Chen
Jiaze Wang
Kun Shao
Furui Liu
Jianye Hao
Chenyong Guan
Guangyong Chen
Pheng-Ann Heng
50
37
0
12 Mar 2023
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D
  Dense CLIP
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
Junbo Zhang
Runpei Dong
Kaisheng Ma
CLIP
VLM
11
77
0
08 Mar 2023
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong
  Few-shot Learners
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Renrui Zhang
Xiangfei Hu
Bohao Li
Siyuan Huang
Hanqiu Deng
Hongsheng Li
Yu Qiao
Peng Gao
VLM
MLLM
13
167
0
03 Mar 2023
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Jianbo Shi
3DPC
32
10
0
01 Mar 2023
Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud
  Pre-training
Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzhi Li
Pheng-Ann Heng
3DPC
18
52
0
27 Feb 2023
Joint Representation Learning for Text and 3D Point Cloud
Joint Representation Learning for Text and 3D Point Cloud
Rui Huang
Xuran Pan
Henry Zheng
Haojun Jiang
Zhifeng Xie
S. Song
Gao Huang
11
19
0
18 Jan 2023
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry
  Learning
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning
Pei-Kai Huang
L. Liu
Renrui Zhang
Song Zhang
Xin Xu
Bai-Qi Wang
G. Liu
3DPC
MDE
25
42
0
28 Dec 2022
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
Xiangyang Zhu
Renrui Zhang
Bowei He
Ziyu Guo
Ziyao Zeng
Zipeng Qin
Shanghang Zhang
Peng Gao
VLM
22
133
0
21 Nov 2022
Point-DAE: Denoising Autoencoders for Self-supervised Point Cloud
  Learning
Point-DAE: Denoising Autoencoders for Self-supervised Point Cloud Learning
Yabin Zhang
Jiehong Lin
Ruihuang Li
K. Jia
Lei Zhang
3DPC
9
6
0
13 Nov 2022
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual
  Grounding
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Yanmin Wu
Xinhua Cheng
Renrui Zhang
Zesen Cheng
Jian Zhang
44
62
0
29 Sep 2022
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzheng Ma
Xupeng Miao
Xuming He
Bin Cui
VLM
AAML
55
108
0
28 Sep 2022
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
Xu Yan
Jiantao Gao
Chaoda Zheng
Chao Zheng
Ruimao Zhang
Shenghui Cui
Zhen Li
3DPC
81
210
0
10 Jul 2022
Masked Autoencoders in 3D Point Cloud Representation Learning
Masked Autoencoders in 3D Point Cloud Representation Learning
Jincen Jiang
Xuequan Lu
Lizhi Zhao
Richard Dazeley
Meili Wang
3DPC
ViT
49
28
0
04 Jul 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud
  Pre-training
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang
Ziyu Guo
Rongyao Fang
Bingyan Zhao
Dong Wang
Yu Qiao
Hongsheng Li
Peng Gao
3DPC
167
241
0
28 May 2022
PointCLIP: Point Cloud Understanding by CLIP
PointCLIP: Point Cloud Understanding by CLIP
Renrui Zhang
Ziyu Guo
Wei Zhang
Kunchang Li
Xupeng Miao
Bin Cui
Yu Qiao
Peng Gao
Hongsheng Li
VLM
3DPC
161
428
0
04 Dec 2021
Previous
123
Next