v1v2 (latest)

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

6 October 2023

Wanyun Cui

Qianle Wang

LRM

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Abstract

Generating diverse and sophisticated instructions for downstream tasks by Large Language Models (LLMs) is pivotal for advancing the effect. Current approaches leverage closed-source LLMs, employing in-context prompting for instruction generation. However, in this paper, we found that in-context prompting cannot generate complex instructions with length $\ge 100$ for tasks like code completion. To solve this problem, we introduce Ada-Instruct, an adaptive instruction generator developed by fine-tuning open-source LLMs. Our pivotal finding illustrates that fine-tuning open-source LLMs with a mere ten samples generates long instructions that maintain distributional consistency for complex reasoning tasks. We empirically validated Ada-Instruct's efficacy across different applications, including code completion, mathematical reasoning, and commonsense reasoning. The results underscore Ada-Instruct's superiority, evidencing its improvements over its base models, current self-instruct methods, and other state-of-the-art models.

View on arXiv

Comments on this paper