ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.16217
  4. Cited By
ManipLLM: Embodied Multimodal Large Language Model for Object-Centric
  Robotic Manipulation

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

24 December 2023
Xiaoqi Li
Mingxu Zhang
Yiran Geng
Haoran Geng
Yuxing Long
Yan Shen
Renrui Zhang
Jiaming Liu
Hao Dong
    LM&Ro
    LRM
ArXivPDFHTML

Papers citing "ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation"

50 / 67 papers shown
Title
CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
Xiaoqi Li
Lingyun Xu
M. Zhang
Jiaming Liu
Yan Shen
...
Jiahui Xu
Liang Heng
Siyuan Huang
S. Zhang
Hao Dong
LM&Ro
31
0
0
04 May 2025
3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment
3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment
X. Li
J. H. Liu
Nuowei Han
Liang Heng
Y. Guo
Hao Dong
Yang Liu
39
0
0
03 May 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Y. Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
73
2
0
26 Apr 2025
Few-Shot Vision-Language Action-Incremental Policy Learning
Few-Shot Vision-Language Action-Incremental Policy Learning
Mingchen Song
Xiang Deng
Guoqiang Zhong
Qi Lv
Jia Wan
Yinchuan Li
Jianye Hao
Weili Guan
20
0
0
22 Apr 2025
Joint Action Language Modelling for Transparent Policy Execution
Joint Action Language Modelling for Transparent Policy Execution
Theodor Wulff
R. S. Maharjan
Xinyun Chi
Angelo Cangelosi
22
0
0
14 Apr 2025
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Jianhua Sun
Jiude Wei
Y. Li
Cewu Lu
LM&Ro
54
1
0
30 Mar 2025
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Ziyu Guo
Young Yoon Lee
Joseph Liu
Yizhak Ben-Shabat
Victor Zordan
Mubbasir Kapadia
DiffM
VGen
66
0
0
27 Mar 2025
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Kexian Tang
Junyao Gao
Yanhong Zeng
Haodong Duan
Yanan Sun
Zhening Xing
Wenran Liu
Kaifeng Lyu
Kai-xiang Chen
ELM
LRM
48
1
0
25 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
58
0
0
19 Mar 2025
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?
Xinyan Chen
Jiaxin Ge
Hongming Dai
Qiang Zhou
Qiuxuan Feng
Jingtong Hu
Y. Wang
Jiaming Liu
Shanghang Zhang
LM&Ro
60
0
0
19 Mar 2025
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu
Z. Wang
C. Zhang
Peng Li
Yang Liu
CoGe
VLM
63
0
0
18 Mar 2025
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Zhenyu Wu
Yuheng Zhou
Xiuwei Xu
Z. Wang
Haibin Yan
41
2
0
17 Mar 2025
Dense Policy: Bidirectional Autoregressive Learning of Actions
Dense Policy: Bidirectional Autoregressive Learning of Actions
Yue Su
Xinyu Zhan
Hongjie Fang
Han Xue
Hao-Shu Fang
Y. Li
Cewu Lu
Lixin Yang
VGen
44
2
0
17 Mar 2025
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Y. Wang
Shengqiong Wu
Y. Zhang
William Yang Wang
Ziwei Liu
Jiebo Luo
Hao Fei
LRM
69
7
0
16 Mar 2025
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu
Hao Chen
Pengju An
Zhuoyang Liu
Renrui Zhang
...
Chengkai Hou
Mengdi Zhao
KC alex Zhou
Pheng-Ann Heng
S. Zhang
58
5
0
13 Mar 2025
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
Jian-Jian Jiang
Xiao-Ming Wu
Yi-Xiang He
Ling-an Zeng
Yi-Lin Wei
Dandan Zhang
Wei-Shi Zheng
35
2
0
13 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
53
0
0
11 Mar 2025
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
Junlong Ren
Hao Wu
Hui Xiong
H. Wang
60
0
0
26 Feb 2025
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Zekun Qi
Wenyao Zhang
Yufei Ding
Runpei Dong
Xinqiang Yu
...
Xin Jin
Kaisheng Ma
Zhizheng Zhang
He Wang
Li Yi
LM&Ro
125
3
0
18 Feb 2025
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Kun Wu
Chengkai Hou
Jiaming Liu
Zhengping Che
Xiaozhu Ju
...
Zhenyu Wang
Pengju An
Siyuan Qian
S. Zhang
Jian Tang
LM&Ro
103
15
0
17 Feb 2025
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Yuhang Dong
Haizhou Ge
Yupei Zeng
J. Zhang
Beiwen Tian
...
Yufei Jia
Ruixiang Wang
Ran Yi
Guyue Zhou
Longhua Ma
48
0
0
11 Feb 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
83
10
0
06 Jan 2025
Diving into Self-Evolving Training for Multimodal Reasoning
Diving into Self-Evolving Training for Multimodal Reasoning
Wei Liu
Junlong Li
Xiwen Zhang
Fan Zhou
Yu Cheng
Junxian He
ReLM
LRM
32
3
0
23 Dec 2024
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
Yawen Shao
Wei-dong Zhai
Yuhang Yang
Hongchen Luo
Yang Cao
Zheng-jun Zha
83
1
0
29 Nov 2024
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
Weixin Mao
Weiheng Zhong
Zhou Jiang
Dong Fang
Zhongyue Zhang
...
Fan Jia
Tiancai Wang
Haoqiang Fan
Osamu Yoshie
Osamu Yoshie
114
4
0
29 Nov 2024
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for
  Robust 3D Robotic Manipulation
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Yueru Jia
Jiaming Liu
Sixiang Chen
Chenyang Gu
Z. Wang
...
Lily Lee
Pengwei Wang
Zhongyuan Wang
Renrui Zhang
Shanghang Zhang
79
11
0
27 Nov 2024
EfficientEQA: An Efficient Approach for Open Vocabulary Embodied
  Question Answering
EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Kai Cheng
Zhengyuan Li
Xingpeng Sun
Byung-Cheol Min
Amrit Singh Bedi
Aniket Bera
27
2
0
26 Oct 2024
WorldSimBench: Towards Video Generation Models as World Simulators
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin
Zhelun Shi
Jiwen Yu
Xijun Wang
Enshen Zhou
...
Lu Sheng
Jing Shao
Lei Bai
Wanli Ouyang
Ruimao Zhang
EGVM
VGen
113
364
0
23 Oct 2024
Visual-Geometric Collaborative Guidance for Affordance Learning
Visual-Geometric Collaborative Guidance for Affordance Learning
Hongchen Luo
Wei-dong Zhai
J. Wang
Yang Cao
Zheng-jun Zha
15
0
0
15 Oct 2024
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic
  Manipulation
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation
K. Zhang
Pengzhen Ren
Bingqian Lin
Junfan Lin
Shikui Ma
Hang Xu
Xiaodan Liang
11
0
0
14 Oct 2024
SELU: Self-Learning Embodied MLLMs in Unknown Environments
SELU: Self-Learning Embodied MLLMs in Unknown Environments
Boyu Li
Haobin Jiang
Ziluo Ding
Xinrun Xu
Haoran Li
Dongbin Zhao
Zongqing Lu
LRM
19
2
0
04 Oct 2024
Autoregressive Action Sequence Learning for Robotic Manipulation
Autoregressive Action Sequence Learning for Robotic Manipulation
Xinyu Zhang
Yuhan Liu
Haonan Chang
Liam Schramm
Abdeslam Boularias
23
6
0
04 Oct 2024
Open-World Reinforcement Learning over Long Short-Term Imagination
Open-World Reinforcement Learning over Long Short-Term Imagination
Jiajian Li
Q. Wang
Yunbo Wang
Xin Jin
Yang Li
Wenjun Zeng
Xiaokang Yang
OCL
VLM
42
1
0
04 Oct 2024
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Qiaojun Yu
Siyuan Huang
Xibin Yuan
Zhengkai Jiang
Ce Hao
...
Junbo Wang
Liu Liu
Hongsheng Li
Peng Gao
Cewu Lu
55
3
0
30 Sep 2024
AIR-Embodied: An Efficient Active 3DGS-based Interaction and
  Reconstruction Framework with Embodied Large Language Model
AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model
Zhenghao Qi
Shenghai Yuan
Fen Liu
Haozhi Cao
Tianchen Deng
Jianfei Yang
Lihua Xie
LM&Ro
DiffM
37
3
0
24 Sep 2024
Robot Manipulation in Salient Vision through Referring Image
  Segmentation and Geometric Constraints
Robot Manipulation in Salient Vision through Referring Image Segmentation and Geometric Constraints
Chen Jiang
Allie Luo
Martin Jägersand
13
0
0
17 Sep 2024
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for
  Robotic Manipulation
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Wenlong Huang
Chen Wang
Y. Li
Ruohan Zhang
Li Fei-Fei
34
81
0
03 Sep 2024
PhysPart: Physically Plausible Part Completion for Interactable Objects
PhysPart: Physically Plausible Part Completion for Interactable Objects
Rundong Luo
Haoran Geng
Congyue Deng
Puhao Li
Zan Wang
Baoxiong Jia
Leonidas J. Guibas
Siyuan Huang
19
6
0
25 Aug 2024
MAVIS: Mathematical Visual Instruction Tuning
MAVIS: Mathematical Visual Instruction Tuning
Renrui Zhang
Xinyu Wei
Dongzhi Jiang
Yichi Zhang
Ziyu Guo
...
Aojun Zhou
Bin Wei
Shanghang Zhang
Peng Gao
Hongsheng Li
MLLM
22
24
0
11 Jul 2024
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot
  Robotic Manipulation
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation
Yuxuan Kuang
Junjie Ye
Haoran Geng
Jiageng Mao
Congyue Deng
Leonidas J. Guibas
He Wang
Yue Wang
LM&Ro
24
20
0
05 Jul 2024
Human-centered In-building Embodied Delivery Benchmark
Human-centered In-building Embodied Delivery Benchmark
Zhuoqun Xu
Yang Liu
Xiaoqi Li
Jiyao Zhang
Hao Dong
25
0
0
25 Jun 2024
SpatialBot: Precise Spatial Understanding with Vision Language Models
SpatialBot: Precise Spatial Understanding with Vision Language Models
Wenxiao Cai
Yaroslav Ponomarenko
Jianhao Yuan
Xiaoqi Li
Wankou Yang
Hao Dong
Bo-Lu Zhao
VLM
32
24
0
19 Jun 2024
AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic
  Manipulation
AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation
Chuyan Xiong
Chengyu Shen
Xiaoqi Li
Kaichen Zhou
Jiaming Liu
Ruiping Wang
Hao Dong
LRM
30
10
0
17 Jun 2024
Language-Guided Manipulation with Diffusion Policies and Constrained
  Inpainting
Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting
Ce Hao
Kelvin Lin
Siyuan Luo
Harold Soh
28
4
0
14 Jun 2024
A3VLM: Actionable Articulation-Aware Vision Language Model
A3VLM: Actionable Articulation-Aware Vision Language Model
Siyuan Huang
Haonan Chang
Yuhan Liu
Yimeng Zhu
Hao Dong
Peng Gao
Abdeslam Boularias
Hongsheng Li
31
6
0
11 Jun 2024
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning
  and Manipulation
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation
Jiaming Liu
Mengzhen Liu
Zhenyu Wang
Lily Lee
Kaichen Zhou
Pengju An
Senqiao Yang
Renrui Zhang
Yandong Guo
Shanghang Zhang
LM&Ro
LRM
Mamba
27
5
0
06 Jun 2024
PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle
  Motion Planning
PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Yupeng Zheng
Zebin Xing
Qichao Zhang
Bu Jin
Pengfei Li
...
Zhongpu Xia
Kun Zhan
Xianpeng Lang
Yaran Chen
Dongbin Zhao
LM&Ro
LRM
LLMAG
33
14
0
03 Jun 2024
Learning Manipulation by Predicting Interaction
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
45
19
0
01 Jun 2024
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
Weitai Kang
Mengxue Qu
Jyoti Kini
Yunchao Wei
Mubarak Shah
Yan Yan
LM&Ro
3DPC
28
9
0
28 May 2024
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for
  Continual Test Time Adaptation
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Rongyu Zhang
Aosong Cheng
Yulin Luo
Gaole Dai
Huanrui Yang
...
Ran Xu
Li Du
Yuan Du
Yanbing Jiang
Shanghang Zhang
MoE
TTA
29
6
0
26 May 2024
12
Next