Learning Multi-step Reasoning from Arithmetic Task

Annual Meeting of the Association for Computational Linguistics (ACL), 2023

2 June 2023

Tianduo Wang

Wei Lu

ReLM

LRM

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Abstract

Mathematical reasoning is regarded as a necessary ability for Language Models (LMs). Recent works demonstrate large LMs' impressive performance in solving math problems. The success is attributed to their Chain-of-Thought (CoT) reasoning abilities, i.e., the ability to decompose complex questions into step-by-step reasoning chains, but such ability seems only to emerge from models with abundant parameters. This work investigates how to incorporate relatively small LMs with the capabilities of multi-step reasoning. We propose to inject such abilities by continually pre-training LMs on a synthetic dataset MsAT, which stands for Multi-step Arithmetic Task. Our experiments on four math word problem datasets show the effectiveness of the proposed method in enhancing LMs' math reasoning abilities.

View on arXiv

Comments on this paper