228

Eliciting Reasoning in Language Models with Cognitive Tools

Main:9 Pages
3 Figures
Bibliography:3 Pages
8 Tables
Appendix:13 Pages
Abstract

The recent advent of reasoning models like OpenAI's o1 was met with excited speculation by the AI community about the mechanisms underlying these capabilities in closed models, followed by a rush of replication efforts, particularly from the open source community. These speculations were largely settled by the demonstration from DeepSeek-R1 that chains-of-thought and reinforcement learning (RL) can effectively replicate reasoning on top of base LLMs. However, it remains valuable to explore alternative methods for theoretically eliciting reasoning that could help elucidate the underlying mechanisms, as well as providing additional methods that may offer complementary benefits.

View on arXiv
Comments on this paper