ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.17618
  4. Cited By
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
v1v2v3 (latest)

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

IEEE transactions on multimedia (IEEE TMM), 2023
29 November 2023
Fukun Yin
Xin Chen
C. Zhang
Biao Jiang
Zibo Zhao
Jiayuan Fan
Gang Yu
Taihao Li
Tao Chen
ArXiv (abs)PDFHTML

Papers citing "ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model"

23 / 23 papers shown
LATTICE: Democratize High-Fidelity 3D Generation at Scale
LATTICE: Democratize High-Fidelity 3D Generation at Scale
Zeqiang Lai
Yunfei Zhao
Zibo Zhao
Haolin Liu
Qingxiang Lin
Jingwei Huang
Chunchao Guo
Xiangyu Yue
68
3
0
24 Nov 2025
Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction
Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction
Yun Zhou
Yaoting Wang
Guangquan Jie
Jinyu Liu
Henghui Ding
83
0
0
24 Nov 2025
SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion
SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion
Jiajie Guo
Qingpeng Zhu
Jin Zeng
Xiaolong Wu
Changyong He
Weida Wang
LRM
244
0
0
21 Nov 2025
Spatial Reasoning in Multimodal Large Language Models: A Survey of Tasks, Benchmarks and Methods
Weichen Liu
Qiyao Xue
Haoming Wang
Xiangyu Yin
Boyuan Yang
Wei Gao
122
3
0
14 Nov 2025
3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs
3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs
Noor Ahmed
Cameron Braunstein
Steffen Eger
Eddy Ilg
116
1
0
12 Aug 2025
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Shuangkang Fang
I-Chao Shen
Yufeng Wang
Yi-Hsuan Tsai
Y. Yang
Shuchang Zhou
Wenrui Ding
Takeo Igarashi
M. Yang
AI4CE
267
5
0
02 Aug 2025
BANG: Dividing 3D Assets via Generative Exploded Dynamics
BANG: Dividing 3D Assets via Generative Exploded DynamicsACM Transactions on Graphics (TOG), 2025
Longwen Zhang
Qixuan Zhang
Haoran Jiang
Yinuo Bai
Wei Yang
Lan Xu
Jingyi Yu
228
15
0
29 Jul 2025
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details
Zeqiang Lai
Yunfei Zhao
Haolin Liu
Zibo Zhao
Qingxiang Lin
...
Yuhong Liu
Jie Jiang
Linus
J. Huang
Chunchao Guo
227
42
0
19 Jun 2025
OmniSVG: A Unified Scalable Vector Graphics Generation Model
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Yiying Yang
Wei Cheng
Sijin Chen
Xianfang Zeng
Jiaxu Zhang
Liao Wang
Gang Yu
Jiabo He
Xingjun Ma
Yu Jiang
VLM
551
27
0
08 Apr 2025
Distilling Multi-view Diffusion Models into 3D Generators
Distilling Multi-view Diffusion Models into 3D Generators
Hao Qin
Luyuan Chen
Ming Kong
Mengxu Lu
Qiang Zhu
3DGS
554
2
0
01 Apr 2025
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding
Jiahe Zhao
Ruibing Hou
Zejie Tian
Hong Chang
Shiguang Shan
402
2
0
17 Mar 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
778
207
0
21 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Jiayi Zhang
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
475
34
0
06 Jan 2025
LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression
LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression
Yuqi Ye
Wei Gao
202
2
0
16 Aug 2024
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and
  Consistency-Enhanced MAE
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE
Yiying Yang
Fukun Yin
Jiayuan Fan
Xin Chen
Wanzhang Li
Gang Yu
VGen
270
4
0
10 Aug 2024
Component Selection for Craft Assembly Tasks
Component Selection for Craft Assembly Tasks
V. H. Isume
Takuya Kiyokawa
N. Yamanobe
Y. Domae
Weiwei Wan
Kensuke Harada
275
1
0
19 Jul 2024
Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing
Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing
Anushrut Jignasu
Kelly O. Marshall
Ankush Kumar Mishra
Lucas Nerone Rillo
Baskar Ganapathysubramanian
Aditya Balu
Chinmay Hegde
Adarsh Krishnamurthy
269
3
0
04 Jul 2024
YouDream: Generating Anatomically Controllable Consistent Text-to-3D
  Animals
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Sandeep Mishra
Oindrila Saha
A. Bovik
222
0
0
24 Jun 2024
VP-LLM: Text-Driven 3D Volume Completion with Large Language Models
  through Patchification
VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Jianmeng Liu
Yichen Liu
Yuyao Zhang
Zeyuan Meng
Yu-Wing Tai
Chi-Keung Tang
262
0
0
08 Jun 2024
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
Sijin Chen
Xin Chen
Anqi Pang
Xianfang Zeng
Wei Cheng
...
C. Zhang
Jingyi Yu
Gang Yu
Bin-Bin Fu
Tao Chen
AI4CE
322
83
0
31 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Brandon Smart
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
409
33
0
16 May 2024
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
SemGrasp: Semantic Grasp Generation via Language Aligned DiscretizationEuropean Conference on Computer Vision (ECCV), 2024
Kailin Li
Jingbo Wang
Lixin Yang
Cewu Lu
Bo Dai
263
34
0
04 Apr 2024
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding,
  Reasoning, and Planning
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and PlanningComputer Vision and Pattern Recognition (CVPR), 2023
Sijin Chen
Xin Chen
C. Zhang
Mingsheng Li
Gang Yu
Hao Fei
Erik Cambria
Jiayuan Fan
Tao Chen
MLLM
365
184
0
30 Nov 2023
1
Page 1 of 1