Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02624
Cited By
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
6 October 2021
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation"
50 / 234 papers shown
Title
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
Zilong Chen
Yikai Wang
Wenqiang Sun
Feng Wang
Yiwen Chen
Huaping Liu
24
0
0
07 May 2025
Enhancing Target-unspecific Tasks through a Features Matrix
Fangming Cui
Yonggang Zhang
Xuan Wang
Xinmei Tian
Jun Yu
AAML
33
0
0
06 May 2025
mrCAD: Multimodal Refinement of Computer-aided Designs
William P. McCarthy
Saujas Vaduguru
K. Willis
Justin Matejka
Judith E. Fan
Daniel Fried
Yewen Pu
33
0
0
28 Apr 2025
Recent Advance in 3D Object and Scene Generation: A Survey
Xiang Tang
Ruotong Li
Xiaopeng Fan
75
0
0
16 Apr 2025
DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction
Sicong Pan
Liren Jin
Xuying Huang
C. Stachniss
Marija Popović
Maren Bennewitz
34
0
0
16 Apr 2025
ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting
Huiqi Wu
Jianbo Mei
Yingjie Huang
Yining Xu
Jingjiao You
Yilong Liu
Li Yao
3DGS
22
0
0
14 Apr 2025
Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving
Lucas Nunes
Rodrigo Marcuzzi
Jens Behley
C. Stachniss
3DPC
76
0
0
27 Mar 2025
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
Zichen Tang
Yuan Yao
Miaomiao Cui
Liefeng Bo
Hongyu Yang
3DGS
DiffM
44
0
0
14 Mar 2025
On the Limitations of Vision-Language Models in Understanding Image Transforms
Ahmad Mustafa Anis
Hasnain Ali
Saquib Sarfraz
VLM
Presented at
ResearchTrend Connect | VLM
on
28 Mar 2025
131
0
0
12 Mar 2025
M
3
^3
3
amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification
Mingxiang Cao
Weiying Xie
Xin Zhang
Jiaqing Zhang
Kai Jiang
Jie Lei
Yunsong Li
Mamba
41
0
0
09 Mar 2025
Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion
Yuan Bian
Min Liu
Yunqi Yi
Xueping Wang
Yaonan Wang
AAML
40
0
0
27 Feb 2025
Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting
Hengyu Meng
D. B. Wang
Zhijing Shao
Ligang Liu
Z. Wang
43
1
0
27 Feb 2025
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim
Sangmin Lee
Jaegul Choo
3DGS
61
0
0
17 Feb 2025
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability
Zhiyu Zhu
Zhibo Jin
Jiayu Zhang
Nan Yang
Jiahao Huang
Jianlong Zhou
Fang Chen
34
0
0
16 Feb 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
92
19
0
21 Jan 2025
Text2Data: Low-Resource Data Generation with Textual Control
Shiyu Wang
Yihao Feng
Tian Lan
Ning Yu
Yu Bai
R. Xu
H. Wang
Caiming Xiong
S.
DiffM
80
0
0
03 Jan 2025
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Xiaokun Sun
Zeyu Cai
Zhenyu Zhang
Ying Tai
Jian Yang
69
0
0
16 Dec 2024
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
Yayuan Li
Jintao Guo
Lei Qi
Wenbin Li
Yinghuan Shi
VLM
CLIP
74
0
0
16 Dec 2024
PaintScene4D: Consistent 4D Scene Generation from Text Prompts
Vinayak Gupta
Yunze Man
Yu-Xiong Wang
VGen
79
0
0
05 Dec 2024
Fixing the Perspective: A Critical Examination of Zero-1-to-3
Jack Yu
Xueying Jia
Charlie Sun
Prince Wang
DiffM
67
0
0
24 Nov 2024
Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
Maximilian Mews
Ansar Aynetdinov
Vivian Schiller
Peter Eisert
Alan Akbik
3DV
AI4CE
94
0
0
22 Nov 2024
Identity Preserving 3D Head Stylization with Multiview Score Distillation
Bahri Batuhan Bilecen
Ahmet Berke Gokmen
Furkan Guzelant
Aysegül Dündar
65
0
0
20 Nov 2024
Towards motion from video diffusion models
Paul Janson
Tiberiu Popa
Eugene Belilovsky
DiffM
VGen
62
0
0
19 Nov 2024
Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings
Aditya Sanghi
Aliasghar Khani
Pradyumna Reddy
Arianna Rampini
Derek Cheung
Kamal Rahimi Malekshan
Kanika Madan
Hooman Shayani
29
3
0
12 Nov 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
W. Cheng
Juncheng Mu
Xianfang Zeng
Xin Chen
Anqi Pang
...
Zhibin Wang
Bin-Bin Fu
Gang Yu
Z. Liu
Liang Pan
34
8
0
04 Nov 2024
EEG-Driven 3D Object Reconstruction with Style Consistency and Diffusion Prior
Xin Xiang
Wenhui Zhou
Guojun Dai
DiffM
21
0
0
28 Oct 2024
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrieval
Bin Kang
Bin Chen
J. T. Wang
Yong Xu
14
0
0
26 Oct 2024
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection
Andrea Appiani
Cigdem Beyan
CLIP
VLM
13
0
0
18 Oct 2024
DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Jingxiang Sun
Cheng Peng
Ruizhi Shao
Y. Guo
Xiaochen Zhao
Yangguang Li
Yanpei Cao
Bo Zhang
Yebin Liu
24
2
0
16 Oct 2024
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models
Rohit Jena
Ali Taghibakhshi
Sahil Jain
Gerald Shen
Nima Tajbakhsh
Arash Vahdat
33
3
0
09 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
16
0
0
01 Sep 2024
GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars
Keqiang Sun
Amin Jourabloo
Riddhish Bhalodia
Moustafa Meshry
Yu Rong
...
Christian Haene
Jiu Xu
Sam Johnson
Hongsheng Li
Sofien Bouaziz
DiffM
29
0
0
24 Aug 2024
HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model
Yi Wang
Jian Ma
Ruizhi Shao
Qiao Feng
Yu-Kun Lai
Kun Li
19
5
0
21 Aug 2024
Barbie: Text to Barbie-Style 3D Avatars
Xiaokun Sun
Zhenyu Zhang
Ying Tai
Qian Wang
Hao Tang
Zili Yi
Jian Yang
LM&Ro
36
2
0
17 Aug 2024
Localized Gaussian Splatting Editing with Contextual Awareness
Hanyuan Xiao
Yingshu Chen
Huajian Huang
Haolin Xiong
Jing Yang
P. Prasad
Yajie Zhao
3DGS
DiffM
20
4
0
31 Jul 2024
Advancing Prompt Learning through an External Layer
Fangming Cui
Xun Yang
Chao Wu
Liang Xiao
Xinmei Tian
VLM
29
1
0
29 Jul 2024
Magic3DSketch: Create Colorful 3D Models From Sketch-Based 3D Modeling Guided by Text and Language-Image Pre-Training
Ying-Dong Zang
Yidong Han
Chao Ding
Jianqi Zhang
Tianrun Chen
DiffM
44
2
0
27 Jul 2024
HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation
Zezeng Li
Weimin Wang
WenHai Li
Na Lei
Xianfeng Gu
OT
DiffM
20
0
0
19 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
67
41
0
17 Jul 2024
VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Wenjie Zhuo
Fan Ma
Hehe Fan
Yi Yang
DiffM
32
8
0
13 Jul 2024
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu
Chaohui Yu
Chenjie Cao
Wen Qian
Fan Wang
DiffM
24
3
0
05 Jul 2024
Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing
Anushrut Jignasu
Kelly O. Marshall
Ankush Kumar Mishra
Lucas Nerone Rillo
Baskar Ganapathysubramanian
Aditya Balu
Chinmay Hegde
Adarsh Krishnamurthy
24
0
0
04 Jul 2024
GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality
Taoran Yi
Jiemin Fang
Zanwei Zhou
Junjie Wang
Guanjun Wu
Lingxi Xie
Xiaopeng Zhang
Wenyu Liu
Xinggang Wang
Qi Tian
3DGS
31
8
0
26 Jun 2024
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao
Junshu Tang
Jiangning Zhang
Ran Yi
Yijia Hong
Moran Li
Weijian Cao
Yating Wang
Lizhuang Ma
DiffM
33
0
0
24 Jun 2024
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev
Nina Konovalova
Daniil Selikhanovych
Nikolay Patakin
Nikolay Patakin
...
Anton Konushin
Peter Wonka
Alexander Filippov
Peter Wonka
Evgeny Burnaev
DiffM
49
0
0
21 Jun 2024
GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
Xiqian Yu
Hanxin Zhu
Tianyu He
Zhibo Chen
3DGS
DiffM
23
2
0
14 Jun 2024
VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Jianmeng Liu
Yichen Liu
Yuyao Zhang
Zeyuan Meng
Yu-Wing Tai
Chi-Keung Tang
36
0
0
08 Jun 2024
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
F. Babiloni
Alexandros Lattas
Jiankang Deng
S. Zafeiriou
DiffM
25
4
0
26 May 2024
Challenges and Opportunities in 3D Content Generation
Ke Zhao
Andreas Larsen
17
0
0
24 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
11
0
16 May 2024
1
2
3
4
5
Next