RDBLearn: Simple In-Context Prediction Over Relational Databases

14 February 2026

Yanlin Zhang

Linjie Xu

Quan Gan

David Wipf

Minjie Wang

LMTD

ArXiv (abs)PDF HTML Github (2★)

Main:9 Pages

7 Figures

Bibliography:1 Pages

8 Tables

Appendix:3 Pages

Abstract

Recent advances in tabular in-context learning (ICL) show that a single pretrained model can adapt to new prediction tasks from a small set of labeled examples, avoiding per-task training and heavy tuning. However, many real-world tasks live in relational databases, where predictive signal is spread across multiple linked tables rather than a single flat table. We show that tabular ICL can be extended to relational prediction with a simple recipe: automatically featurize each target row using relational aggregations over its linked records, materialize the resulting augmented table, and run an off-the-shelf tabular foundation model on it. We package this approach in \textit{RDBLearn} (this https URL), an easy-to-use toolkit with a scikit-learn-style estimator interface that makes it straightforward to swap different tabular ICL backends; a complementary agent-specific interface is provided as well. Across a broad collection of RelBench and 4DBInfer datasets, RDBLearn is the best-performing foundation model approach we evaluate, at times even outperforming strong supervised baselines trained or fine-tuned on each dataset.

View on arXiv

Comments on this paper