Meta-Reasoning Improves Tool Use in Large Language Models

North American Chapter of the Association for Computational Linguistics (NAACL), 2024

7 November 2024

Lisa Alazraki

Marek Rei

LRM

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Main:9 Pages

3 Figures

Bibliography:1 Pages

7 Tables

Appendix:3 Pages

Abstract

External tools help large language models (LLMs) succeed at tasks where they would otherwise typically fail. In existing frameworks, LLMs learn tool use either by in-context demonstrations or via full model fine-tuning on annotated data. As these approaches do not easily scale, a recent trend is to abandon them in favor of lightweight, parameter-efficient tuning paradigms. These methods allow quickly alternating between the frozen LLM and its specialised fine-tuned version, by switching on or off a handful of additional custom parameters. Hence, we postulate that the generalization ability of the frozen model can be leveraged to improve tool selection. We present Tool selECTion via meta-reasONing (TECTON), a two-phase system that first reasons over a task using a custom fine-tuned LM head and outputs candidate tools. Then, with the custom head disabled, it meta-reasons (i.e., it reasons over the previous reasoning process) to make a final choice. We show that TECTON results in substantial gains - both in-distribution and out-of-distribution - on a range of math reasoning datasets.

View on arXiv

Comments on this paper