How cost-effectively can strong reasoning abilities be achieved in language models? Driven by this fundamental question, we present Tina, a family of tiny reasoning models achieved with high cost-efficiency. Notably, Tina demonstrates that substantial reasoning performance can be developed using only minimal resources, by applying parameter-efficient updates during reinforcement learning (RL), using low-rank adaptation (LoRA), to an already tiny 1.5B parameter base model. This minimalist approach produces models that achieve reasoning performance which is competitive with, and sometimes surpasses, SOTA RL reasoning models built upon the same base model. Crucially, this is achieved at a tiny fraction of the computational post-training cost employed by existing SOTA models. In fact, the best Tina model achieves a >20\% reasoning performance increase and 43.33\% Pass@1 accuracy on AIME24, at only \9USDpost−trainingandevaluationcost(i.e.,anestimated260xcostreduction).OurworkrevealsthesurprisingeffectivenessofefficientRLreasoningviaLoRA.Wevalidatethisacrossmultipleopen−sourcereasoningdatasetsandvariousablationsettingsstartingwithasingle,fixedsetofhyperparameters.Furthermore,wehypothesizethatthiseffectivenessandefficiencystemfromLoRArapidlyadaptingthemodeltothestructuralformatofreasoningrewardedbyRL,whilelargelypreservingthebasemodel′sunderlyingknowledge.Inserviceofaccessibilityandopenresearch,wefullyopen−sourceallcode,traininglogs,andmodelweights&checkpoints.
@article{wang2025_2504.15777,
title={ Tina: Tiny Reasoning Models via LoRA },
author={ Shangshang Wang and Julian Asilis and Ömer Faruk Akgül and Enes Burak Bilgin and Ollie Liu and Willie Neiswanger },
journal={arXiv preprint arXiv:2504.15777},
year={ 2025 }
}