Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.01055
Cited By
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
3 October 2022
Tianyu Huang
Bowen Dong
Yunhan Yang
Xiaoshui Huang
Rynson W. H. Lau
Wanli Ouyang
W. Zuo
VLM
3DPC
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training"
50 / 109 papers shown
Title
SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
Zhongrui Yu
Haoran Wang
Jinze Yang
Hanzhang Wang
Zeke Xie
Yunfeng Cai
Jiale Cao
Zhong Ji
Mingming Sun
3DGS
52
19
0
29 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
136
301
0
21 Mar 2024
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
Djamahl Etchegaray
Zi Huang
Tatsuya Harada
Yadan Luo
21
9
0
20 Mar 2024
UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Yuanhuiyi Lyu
Xueye Zheng
Jiazhou Zhou
Lin Wang
30
14
0
19 Mar 2024
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Ziqi Zhou
Minghui Li
Wei Liu
Shengshan Hu
Yechao Zhang
Wei Wan
Lulu Xue
Leo Yu Zhang
Dezhong Yao
Hai Jin
SILM
AAML
40
9
0
16 Mar 2024
A
3
^{3}
3
lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Zeng Tao
Yan Wang
Junxiong Lin
Haoran Wang
Xinji Mai
...
Ziheng Zhou
Shaoqi Yan
Qing Zhao
Liyuan Han
Wenqiang Zhang
30
11
0
07 Mar 2024
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang
Shengcao Cao
Yu-Xiong Wang
30
16
0
28 Feb 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
39
49
0
27 Feb 2024
GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Haoyuan Li
Yanpeng Zhou
Yihan Zeng
Hang Xu
Xiaodan Liang
3DGS
CLIP
11
0
0
09 Feb 2024
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
Qingdong He
Jinlong Peng
Zhengkai Jiang
Kai Wu
Xiaozhong Ji
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Mingang Chen
Yunsheng Wu
3DPC
13
7
0
21 Jan 2024
Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding
Qi Sun
Xiao Cui
Wen-gang Zhou
Houqiang Li
3DPC
16
1
0
15 Jan 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Xinpeng Ding
Jinahua Han
Hang Xu
Xiaodan Liang
Wei Zhang
Xiaomeng Li
18
38
0
02 Jan 2024
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels
Rui Huang
Songyou Peng
Ayca Takmaz
Federico Tombari
Marc Pollefeys
Shiji Song
Gao Huang
Francis Engelmann
VLM
13
37
0
28 Dec 2023
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models
Wan Xu
Tianyu Huang
Tianyu Qu
Guanglei Yang
Yiwen Guo
Wangmeng Zuo
13
0
0
28 Dec 2023
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Huan Liu
Zichang Tan
Chuangchuang Tan
Yunchao Wei
Yao-Min Zhao
Jingdong Wang
ViT
26
40
0
27 Dec 2023
Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges
Weiguang Zhao
Guanyu Yang
Rui Zhang
Chenru Jiang
Chaolong Yang
Yuyao Yan
Amir Hussain
Kaizhu Huang
VLM
31
3
0
12 Dec 2023
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
Tianyu Huang
Yihan Zeng
Zhilu Zhang
Wan Xu
Hang Xu
Songcen Xu
Rynson W. H. Lau
Wangmeng Zuo
23
25
0
11 Dec 2023
Cross-BERT for Point Cloud Pretraining
Xin Li
Peng Li
Zeyong Wei
Zhe Zhu
Mingqiang Wei
Junhui Hou
Liangliang Nan
J. Qin
H. Xie
F. Wang
SSL
3DPC
15
0
0
08 Dec 2023
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Zhangyang Qi
Ye Fang
Zeyi Sun
Xiaoyang Wu
Tong Wu
Jiaqi Wang
Dahua Lin
Hengshuang Zhao
MLLM
71
35
0
05 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
26
3
0
05 Dec 2023
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model
Guozhang Li
Xinpeng Ding
De-Chun Cheng
Jie Li
Nannan Wang
Xinbo Gao
25
1
0
05 Dec 2023
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
3DPC
11
6
0
04 Dec 2023
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Jin-Chuan Shi
Miao Wang
Hao-Bin Duan
Shao-Hua Guan
3DGS
25
83
0
30 Nov 2023
MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition
Dan Song
Xinwei Fu
Weizhi Nie
Wenhui Li
Lanjun Wang
You Yang
Anan Liu
VLM
19
6
0
30 Nov 2023
Point Cloud Pre-training with Diffusion Models
Xiao Zheng
Xiaoshui Huang
Guofeng Mei
Yuenan Hou
Zhaoyang Lyu
Bo Dai
Wanli Ouyang
Yongshun Gong
17
18
0
25 Nov 2023
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
Yipeng Gao
Zeyu Wang
Wei-Shi Zheng
Cihang Xie
Yuyin Zhou
3DPC
19
8
0
03 Nov 2023
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Chunlei Wang
Wenquan Feng
Xiangtai Li
Guangliang Cheng
Shuchang Lyu
Binghao Liu
Lijiang Chen
Qi Zhao
ObjD
VLM
21
9
0
22 Oct 2023
Uni3D: Exploring Unified 3D Representation at Scale
Junsheng Zhou
Jinsheng Wang
Baorui Ma
Yu-Shen Liu
Tiejun Huang
Xinlong Wang
26
86
0
10 Oct 2023
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Tianyu Huang
Yihan Zeng
Bowen Dong
Hang Xu
Songcen Xu
Rynson W. H. Lau
Wangmeng Zuo
DiffM
18
9
0
29 Sep 2023
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Chenming Zhu
Wenwei Zhang
Tai Wang
Xihui Liu
Kai-xiang Chen
3DPC
37
18
0
18 Sep 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
39
46
0
01 Sep 2023
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Yiwen Tang
Xianzheng Ma
...
Ke Chen
Peng Gao
Xianzhi Li
Hongsheng Li
Pheng-Ann Heng
MLLM
17
123
0
01 Sep 2023
PointLLM: Empowering Large Language Models to Understand Point Clouds
Runsen Xu
Xiaolong Wang
Tai Wang
Yilun Chen
Jiangmiao Pang
Dahua Lin
MLLM
51
146
0
31 Aug 2023
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
3DV
VLM
16
28
0
01 Aug 2023
LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
Hao-Liang Yang
Liyuan Pan
Yan Yang
Richard Hartley
Miaomiao Liu
VLM
27
9
0
19 Jul 2023
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Bernard Ghanem
Dacheng Tao
ObjD
VLM
27
134
0
28 Jun 2023
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
Aoran Xiao
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DPC
27
18
0
31 May 2023
Point Cloud Completion Guided by Prior Knowledge via Causal Inference
Songxue Gao
Chuanqi Jiao
Ruidong Chen
Weijie Wang
Weizhi Nie
3DPC
8
0
0
28 May 2023
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Sitian Shen
Zilin Zhu
Linqian Fan
Harry Zhang
Xinxiao Wu
DiffM
24
26
0
25 May 2023
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Minghua Liu
Ruoxi Shi
Kaiming Kuang
Yinhao Zhu
Xuanlin Li
Shizhong Han
H. Cai
Fatih Porikli
Hao Su
3DPC
22
115
0
18 May 2023
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding
Zhang Tao
Su He
D. Tao
Bin Chen
Zhi Wang
Shutao Xia
VLM
27
21
0
18 May 2023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Le Xue
Ning Yu
Shu Zhen Zhang
Artemis Panagopoulou
Junnan Li
...
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
13
113
0
14 May 2023
Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models
Yaohua Zha
Jinpeng Wang
Tao Dai
Bin Chen
Zhi Wang
Shutao Xia
VLM
40
45
0
14 Apr 2023
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding
Yu-Qi Yang
Yu-Xiao Guo
Jiangfeng Xiong
Yang Liu
Hao Pan
Peng-Shuai Wang
Xin Tong
B. Guo
ViT
28
75
0
14 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
10
61
0
03 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
34
451
0
03 Apr 2023
CLIP
2
^2
2
: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data
Yi Zeng
Chenhan Jiang
Jiageng Mao
Jianhua Han
Chao Ye
Qingqiu Huang
Dit-Yan Yeung
Zhen Yang
Xiaodan Liang
Hang Xu
3DPC
VLM
CLIP
14
68
0
22 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Peng Gao
Hongsheng Li
Jianbo Shi
3DPC
14
50
0
14 Mar 2023
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
Junbo Zhang
Runpei Dong
Kaisheng Ma
CLIP
VLM
11
77
0
08 Mar 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
28
117
0
05 Feb 2023
Previous
1
2
3
Next