39
0

MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps

Main:4 Pages
3 Figures
Bibliography:2 Pages
8 Tables
Appendix:1 Pages
Abstract

In this paper we expose our approach to solve the \textit{SemEval 2025 Task 8: Question-Answering over Tabular Data} challenge. Our strategy leverages Python code generation with LLMs to interact with the table and get the answer to the questions. The process is composed of multiple steps: understanding the content of the table, generating natural language instructions in the form of steps to follow in order to get the answer, translating these instructions to code, running it and handling potential errors or exceptions. These steps use open source LLMs and fine grained optimized prompts for each task (step). With this approach, we achieved a score of 70.50%70.50\% for subtask 1.

View on arXiv
@article{lagos2025_2505.22264,
  title={ MRT at SemEval-2025 Task 8: Maximizing Recovery from Tables with Multiple Steps },
  author={ Maximiliano Hormazábal Lagos and Álvaro Bueno Saez and Héctor Cerezo-Costas and Pedro Alonso Doval and Jorge Alcalde Vesteiro },
  journal={arXiv preprint arXiv:2505.22264},
  year={ 2025 }
}
Comments on this paper