Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.06785
Cited By
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
13 December 2022
Renrui Zhang
Liuhui Wang
Yu Qiao
Peng Gao
Hongsheng Li
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders"
50 / 110 papers shown
Title
GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Liangyu Xu
Yingxiu Zhao
J. Wang
Yingyao Wang
Bu Pi
...
Jihao Gu
X. Li
Xiaoyong Zhu
Jun Song
Bo Zheng
LRM
82
1
0
17 Apr 2025
MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning
Xu Han
Yuan Tang
Jinfeng Xu
Xianzhi Li
48
0
0
24 Mar 2025
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
Hariprasath Govindarajan
Maciej K. Wozniak
Marvin Klingner
Camille Maurice
B. R. Kiran
S. Yogamani
53
0
0
12 Mar 2025
Spectral Informed Mamba for Robust Point Cloud Processing
Ali Bahri
Moslem Yazdanpanah
Mehrdad Noori
Sahar Dastani
Milad Cheraghalikhani
David Osowiechi
G. A. V. Hakim
Farzad Beizaee
Ismail ben Ayed
Christian Desrosiers
Mamba
3DPC
67
0
0
06 Mar 2025
CrossOver: 3D Scene Cross-Modal Alignment
S. Sarkar
O. Mikšík
Marc Pollefeys
Daniel Barath
Iro Armeni
3DPC
69
0
0
20 Feb 2025
Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis
Hongyu Sun
Qiuhong Ke
Y. Wang
Wang Chen
Kang Yang
Deying Li
Jianfei Cai
3DPC
63
3
0
17 Jan 2025
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Jiaqi Wang
Hengshuang Zhao
83
6
0
02 Jan 2025
Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging
Ali Bahri
Moslem Yazdanpanah
Mehrdad Noori
Sahar Dastani Oghani
Milad Cheraghalikhani
David Osowiech
Farzad Beizaee
G. A. V. Hakim
Ismail ben Ayed
Christian Desrosiers
3DPC
TTA
38
0
0
31 Dec 2024
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Yueru Jia
Jiaming Liu
Sixiang Chen
Chenyang Gu
Z. Wang
...
Lily Lee
Pengwei Wang
Zhongyuan Wang
Renrui Zhang
Shanghang Zhang
87
11
0
27 Nov 2024
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
Feifei Shao
Ping Liu
Zhao Wang
Yawei Luo
Hongwei Wang
Jun Xiao
3DPC
64
0
0
25 Nov 2024
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders
Yaohua Zha
Tao Dai
Yanzi Wang
Hang Guo
Taolin Zhang
Zhihao Ouyang
Chunlin Fan
B. Chen
Ke Chen
Shu-Tao Xia
3DPC
23
1
0
13 Oct 2024
Pic@Point: Cross-Modal Learning by Local and Global Point-Picture Correspondence
Vencia Herzog
Stefan Suwelack
3DPC
21
0
0
12 Oct 2024
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning
Dingkang Liang
Tianrui Feng
Xin Zhou
Yumeng Zhang
Zhikang Zou
Xiang Bai
18
5
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
31
3
0
09 Oct 2024
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta
Shreyas Verma
Ujjwala Anantheswaran
Kevin Scaria
Mihir Parmar
Swaroop Mishra
Chitta Baral
ReLM
LRM
24
4
0
06 Oct 2024
Triple Point Masking
Jiaming Liu
Linghe Kong
Yue Wu
Maoguo Gong
Hao Li
Qiguang Miao
Wenping Ma
Can Qin
3DPC
20
0
0
26 Sep 2024
RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning
Kunming Su
Qiuxia Wu
Panpan Cai
Xiaogang Zhu
Xuequan Lu
Zhiyong Wang
Kun Hu
3DPC
27
2
0
31 Aug 2024
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Chengzhuo Tong
Peng Gao
Chunyuan Li
Pheng-Ann Heng
VGen
3DPC
42
9
0
29 Aug 2024
Positional Prompt Tuning for Efficient 3D Representation Learning
Shaochen Zhang
Zekun Qi
Runpei Dong
Xiuxiu Bai
Xing Wei
31
2
0
21 Aug 2024
PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders
Xiangdong Zhang
Shaofeng Zhang
Junchi Yan
3DPC
30
2
0
16 Aug 2024
DC3DO: Diffusion Classifier for 3D Objects
Nursena Koprucu
Meher Shashwat Nigam
Shicheng Xu
Biruk Abere
Gabriele Dominici
Andrew Rodriguez
Sharvaree Vadgam
Berfin Inal
Alberto Tono
DiffM
23
0
0
13 Aug 2024
Masked Image Modeling: A Survey
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
N. Sebe
53
6
0
13 Aug 2024
Past Movements-Guided Motion Representation Learning for Human Motion Prediction
Junyu Shi
Baoxuan Wang
3DH
29
0
0
04 Aug 2024
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers
Longkun Zou
Wanru Zhu
Ke Chen
Lihua Guo
K. Guo
Kui Jia
Yaowei Wang
3DPC
ViT
22
0
0
26 Jul 2024
Multi-modal Relation Distillation for Unified 3D Representation Learning
Huiqun Wang
Yiping Bao
Panwang Pan
Zeming Li
Xiao Liu
Ruijie Yang
Di Huang
45
0
0
19 Jul 2024
Adapt PointFormer: 3D Point Cloud Analysis via Adapting 2D Visual Transformers
Mengke Li
Da Li
Guoqing Yang
Yiu-ming Cheung
Hui Huang
3DPC
35
0
0
18 Jul 2024
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Xiaoxu Xu
Yitian Yuan
Jinlong Li
Qiudan Zhang
Zequn Jie
Lin Ma
Hao Tang
N. Sebe
Xu Wang
38
2
0
13 Jul 2024
Pre-training Point Cloud Compact Model with Partial-aware Reconstruction
Yaohua Zha
Yanzi Wang
Tao Dai
Shu-Tao Xia
40
0
0
12 Jul 2024
PointViG: A Lightweight GNN-based Model for Efficient Point Cloud Analysis
Qiang Zheng
Yafei Qi
Chen Wang
Chao Zhang
Jian Sun
3DPC
27
2
0
01 Jul 2024
Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds
Hongliang Zeng
Ping Zhang
Fang Li
Jiahua Wang
Tingyu Ye
Pengteng Guo
3DPC
26
0
0
25 Jun 2024
Lightweight Model Pre-training via Language Guided Knowledge Distillation
Mingsheng Li
Lin Zhang
Mingzhen Zhu
Zilong Huang
Gang Yu
Jiayuan Fan
Tao Chen
29
0
0
17 Jun 2024
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
Yaohua Zha
Naiqi Li
Yanzi Wang
Tao Dai
Hang Guo
Bin Chen
Zhi Wang
Zhihao Ouyang
Shu-Tao Xia
Mamba
37
8
0
27 May 2024
GeoMask3D: Geometrically Informed Mask Selection for Self-Supervised Point Cloud Learning in 3D
Ali Bahri
Moslem Yazdanpanah
Mehrdad Noori
Milad Cheraghalikhani
G. A. V. Hakim
David Osowiechi
Farzad Beizaee
Ismail Ben Ayed
Christian Desrosiers
3DPC
53
1
0
20 May 2024
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
Xue-Qiu Jiang
Sheng Jin
Xiaoqin Zhang
Ling Shao
Shijian Lu
MDE
39
6
0
13 May 2024
ESP-Zero: Unsupervised enhancement of zero-shot classification for Extremely Sparse Point cloud
Jiayi Han
Zidi Cao
Weibo Zheng
Xiangguo Zhou
Xiangjian He
Yuanfang Zhang
Daisen Wei
3DPC
33
0
0
30 Apr 2024
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Ayumu Saito
Prachi Kudeshia
Jiju Poovvancheri
3DPC
22
7
0
25 Apr 2024
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Yiwen Tang
Ray Zhang
Jiaming Liu
Zoey Guo
Dong Wang
...
Bin Zhao
Shanghang Zhang
Peng Gao
Hongsheng Li
Xuelong Li
30
10
0
11 Apr 2024
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
Xiangyang Zhu
Renrui Zhang
Bowei He
Ziyu Guo
Jiaming Liu
Han Xiao
Chaoyou Fu
Hao Dong
Peng Gao
3DPC
23
8
0
05 Apr 2024
iSeg: Interactive 3D Segmentation via Interactive Attention
Itai Lang
Fei Xu
Dale Decatur
Sudarshan Babu
Rana Hanocka
30
4
0
04 Apr 2024
SUGAR: Pre-training 3D Visual Representations for Robotics
Shizhe Chen
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
34
13
0
01 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
32
12
0
01 Apr 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Renrui Zhang
Dongzhi Jiang
Yichi Zhang
Haokun Lin
Ziyu Guo
...
Aojun Zhou
Pan Lu
Kai-Wei Chang
Peng Gao
Hongsheng Li
27
165
0
21 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
30
10
0
12 Mar 2024
MaskLRF: Self-supervised Pretraining via Masked Autoencoding of Local Reference Frames for Rotation-invariant 3D Point Set Analysis
Takahiko Furuya
3DPC
30
2
0
01 Mar 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
33
49
0
27 Feb 2024
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Hongyu Sun
Yongcai Wang
Wang Chen
Haoran Deng
Deying Li
VPVLM
36
5
0
24 Feb 2024
Attention-Guided Masked Autoencoders For Learning Image Representations
Leon Sick
Dominik Engel
Pedro Hermosilla
Timo Ropinski
30
1
0
23 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
43
41
0
19 Feb 2024
PointMamba: A Simple State Space Model for Point Cloud Analysis
Dingkang Liang
Xin Zhou
Wei Xu
Xingkui Zhu
Zhikang Zou
Xiaoqing Ye
Xinyu Wang
Xiang Bai
79
87
0
16 Feb 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun-Xiong Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
29
13
0
31 Dec 2023
1
2
3
Next