Agentic AI for Robot Control: Flexible but still Fragile

13 February 2026

Oscar Lima

Marc Vinci

Martin Günther

Marian Renz

Alexander Sung

Sebastian Stock

Johannes Brust

Lennart Niecksch

Zongyao Yi

Felix Igelbrink

Benjamin Kisliuk

Martin Atzmueller

Joachim Hertzberg

LM&Ro

ArXiv (abs)PDF HTML Github

Main:3 Pages

2 Figures

1 Tables

Appendix:6 Pages

Abstract

Recent work leverages the capabilities and commonsense priors of generative models for robot control. In this paper, we present an agentic control system in which a reasoning-capable language model plans and executes tasks by selecting and invoking robot skills within an iterative planner and executor loop. We deploy the system on two physical robot platforms in two settings: (i) tabletop grasping, placement, and box insertion in indoor mobile manipulation (Mobipick) and (ii) autonomous agricultural navigation and sensing (Valdemar). Both settings involve uncertainty, partial observability, sensor noise, and ambiguous natural-language commands. The system exposes structured introspection of its planning and decision process, reacts to exogenous events via explicit event checks, and supports operator interventions that modify or redirect ongoing execution. Across both platforms, our proof-of-concept experiments reveal substantial fragility, including non-deterministic suboptimal behavior, instruction-following errors, and high sensitivity to prompt specification. At the same time, the architecture is flexible: transfer to a different robot and task domain largely required updating the system prompt (domain model, affordances, and action catalogue) and re-binding the same tool interface to the platform-specific skill API.

View on arXiv

Comments on this paper