3D Affordance Keypoint Detection for Robotic Manipulation

27 November 2025

Zhiyang Liu

Ruiteng Zhao

Lei Zhou

Chengran Yuan

Yuwei Wu

Sheng Guo

Zhengshen Zhang

Chenchen Liu

Marcelo H Ang Jr

3DPC

ArXiv (abs)PDF HTML

Main:6 Pages

5 Figures

Bibliography:1 Pages

Abstract

This paper presents a novel approach for affordance-informed robotic manipulation by introducing 3D keypoints to enhance the understanding of object parts' functionality. The proposed approach provides direct information about what the potential use of objects is, as well as guidance on where and how a manipulator should engage, whereas conventional methods treat affordance detection as a semantic segmentation task, focusing solely on answering the what question. To address this gap, we propose a Fusion-based Affordance Keypoint Network (FAKP-Net) by introducing 3D keypoint quadruplet that harnesses the synergistic potential of RGB and Depth image to provide information on execution position, direction, and extent. Benchmark testing demonstrates that FAKP-Net outperforms existing models by significant margins in affordance segmentation task and keypoint detection task. Real-world experiments also showcase the reliability of our method in accomplishing manipulation tasks with previously unseen objects.

View on arXiv

Comments on this paper