Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02624
Cited By
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
6 October 2021
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation"
34 / 234 papers shown
Title
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
Anuj Diwan
Puyuan Peng
Raymond J. Mooney
VLM
18
2
0
03 Nov 2022
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language
Aditya Sanghi
Rao Fu
Vivian Liu
Karl Willis
Hooman Shayani
Amir Hosein Khasahmadi
Srinath Sridhar
Daniel E. Ritchie
13
51
0
02 Nov 2022
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training
Junfan Lin
Jianlong Chang
Lingbo Liu
Guanbin Li
Liang Lin
Qi Tian
Changan Chen
VGen
38
26
0
28 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
17
116
0
20 Oct 2022
TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition
Y. Chen
Rui Chen
Jiabao Lei
Yabin Zhang
K. Jia
CLIP
11
81
0
20 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
Xiaohui Zeng
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
27
478
0
12 Oct 2022
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization
Tanvir Mahmud
Diana Marculescu
CLIP
11
29
0
11 Oct 2022
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Han-Hung Lee
Angel X. Chang
11
63
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
24
2,302
0
29 Sep 2022
GAMA: Generative Adversarial Multi-Object Scene Attacks
Abhishek Aich
Calvin-Khang Ta
Akash Gupta
Chengyu Song
S. Krishnamurthy
M. Salman Asif
A. Roy-Chowdhury
AAML
30
17
0
20 Sep 2022
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
165
25
0
09 Sep 2022
Prompt Tuning with Soft Context Sharing for Vision-Language Models
Kun Ding
Ying Wang
Pengzhang Liu
Qiang Yu
Hao Zhang
Shiming Xiang
Chunhong Pan
VPVLM
VLM
17
14
0
29 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
4
17
0
03 Aug 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
19
78
0
19 Jul 2022
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge J. Belongie
Sagie Benaim
VGen
DiffM
12
16
0
24 Jun 2022
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
41
518
0
13 Jun 2022
Volumetric Disentanglement for 3D Scene Manipulation
Sagie Benaim
Frederik Warburg
Peter Ebert Christensen
Serge J. Belongie
11
15
0
06 Jun 2022
CyCLIP: Cyclic Contrastive Language-Image Pretraining
Shashank Goel
Hritik Bansal
S. Bhatia
Ryan A. Rossi
Vishwa Vinay
Aditya Grover
CLIP
VLM
166
131
0
28 May 2022
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong
Mingyuan Zhang
Liang Pan
Zhongang Cai
Lei Yang
Ziwei Liu
CLIP
94
55
0
17 May 2022
Language-Grounded Indoor 3D Semantic Segmentation in the Wild
Dávid Rozenberszki
Or Litany
Angela Dai
3DV
VLM
8
182
0
16 Apr 2022
CLIP-Mesh: Generating textured meshes from text using pretrained image-text models
N. Khalid
Tianhao Xie
Eugene Belilovsky
Tiberiu Popa
CLIP
6
291
0
24 Mar 2022
MotionCLIP: Exposing Human Motion Generation to CLIP Space
Guy Tevet
Brian Gordon
Amir Hertz
Amit H. Bermano
Daniel Cohen-Or
CLIP
15
317
0
15 Mar 2022
Text and Image Guided 3D Avatar Generation and Manipulation
Zehranaz Canfes
M. Atasoy
Alara Dirik
Pinar Yanardag
3DH
24
42
0
12 Feb 2022
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas J. Guibas
Minhyuk Sung
3DPC
28
25
0
13 Dec 2021
Text2Mesh: Text-Driven Neural Stylization for Meshes
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
175
350
0
06 Dec 2021
Zero-Shot Text-Guided Object Generation with Dream Fields
Ajay Jain
B. Mildenhall
Jonathan T. Barron
Pieter Abbeel
Ben Poole
24
560
0
02 Dec 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
30
191
0
29 Nov 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
322
2,108
0
02 Sep 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Mohit Bansal
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
182
342
0
13 Jul 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Convolutional Occupancy Networks
Songyou Peng
Michael Niemeyer
L. Mescheder
Marc Pollefeys
Andreas Geiger
3DV
AI4CE
209
860
0
10 Mar 2020
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
210
13,886
0
02 Dec 2016
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Jiajun Wu
Chengkai Zhang
Tianfan Xue
Bill Freeman
J. Tenenbaum
GAN
161
1,926
0
24 Oct 2016
Previous
1
2
3
4
5