AmbiK: Dataset of Ambiguous Tasks in Kitchen EnvironmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Repairs in a Block World: A New Benchmark for Handling User Corrections
with Multi-Modal Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |