Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.16217
Cited By
ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
24 December 2023
Xiaoqi Li
Mingxu Zhang
Yiran Geng
Haoran Geng
Yuxing Long
Yan Shen
Renrui Zhang
Jiaming Liu
Hao Dong
LM&Ro
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation"
17 / 67 papers shown
Title
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
11
0
16 May 2024
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Yiwen Tang
Ray Zhang
Jiaming Liu
Zoey Guo
Dong Wang
...
Bin Zhao
Shanghang Zhang
Peng Gao
Hongsheng Li
Xuelong Li
25
9
0
11 Apr 2024
Unified Language-driven Zero-shot Domain Adaptation
Senqiao Yang
Zhuotao Tian
Li Jiang
Jiaya Jia
23
7
0
10 Apr 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
31
5
0
28 Mar 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Peng Gao
Hongsheng Li
Hao Dong
LM&Ro
32
16
0
17 Mar 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
33
49
0
27 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
37
20
0
25 Feb 2024
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu
Ran Xu
Senqiao Yang
Renrui Zhang
Qizhe Zhang
Zehui Chen
Yandong Guo
Shanghang Zhang
TTA
22
10
0
19 Dec 2023
Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation
Jiayin Ni
Senqiao Yang
Ran Xu
Jiaming Liu
Xiaoqi Li
Wenyu Jiao
Zehui Chen
Yi Liu
Shanghang Zhang
TTA
14
7
0
24 Sep 2023
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu
Senqiao Yang
Peidong Jia
Renrui Zhang
Ming Lu
Yandong Guo
Wei Xue
Shanghang Zhang
TTA
OOD
VLM
15
36
0
07 Jun 2023
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy
Yinzhen Xu
Weikang Wan
Jialiang Zhang
Haoran Liu
Zikang Shan
...
Yijia Weng
Jiayi Chen
Tengyu Liu
Li Yi
He-Nan Wang
62
114
0
02 Mar 2023
End-to-End Affordance Learning for Robotic Manipulation
Yiran Geng
Boshi An
Haoran Geng
Yuanpei Chen
Yaodong Yang
Hao Dong
54
59
0
26 Sep 2022
Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer
Jiaming Liu
Qizhe Zhang
Jianing Li
Ming Lu
Tiejun Huang
Shanghang Zhang
8
10
0
26 Aug 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
SAPIEN: A SimulAted Part-based Interactive ENvironment
Fanbo Xiang
Yuzhe Qin
Kaichun Mo
Yikuan Xia
Hao Zhu
...
He-Nan Wang
Li Yi
Angel X. Chang
Leonidas J. Guibas
Hao Su
195
482
0
19 Mar 2020
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
136
1,458
0
06 Jun 2016
Previous
1
2