Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025 |
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical ReachabilityComputer Vision and Pattern Recognition (CVPR), 2025 |
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language ModelsIEEE International Conference on Robotics and Automation (ICRA), 2024 |