BIG-Bench Extra HardAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesComputer Vision and Pattern Recognition (CVPR), 2024 |
Evaluating Vision-Language Models as Evaluators in Path PlanningComputer Vision and Pattern Recognition (CVPR), 2024 |
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for RoboticsComputer Vision and Pattern Recognition (CVPR), 2024 |
BALROG: Benchmarking Agentic LLM and VLM Reasoning On GamesInternational Conference on Learning Representations (ICLR), 2024 |
Scaffolding Language Learning via Multi-modal Tutoring Systems with
Pedagogical InstructionsConference on Algebraic Informatics (CAI), 2024 |