ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.03970
96
0

Is Deep Learning finally better than Decision Trees on Tabular Data?

17 February 2025
Guri Zabergja
Arlind Kadra
Christian M. M. Frey
Josif Grabocka
    LMTD
ArXivPDFHTML
Abstract

Tabular data is a ubiquitous data modality due to its versatility and ease of use in many real-world applications. The predominant heuristics for handling classification tasks on tabular data rely on classical machine learning techniques, as the superiority of deep learning models has not yet been demonstrated. This raises the question of whether new deep learning paradigms can surpass classical approaches. Recent studies on tabular data offer a unique perspective on the limitations of neural networks in this domain and highlight the superiority of gradient boosted decision trees (GBDTs) in terms of scalability and robustness across various datasets. However, novel foundation models have not been thoroughly assessed regarding quality or fairly compared to existing methods for tabular classification. Our study categorizes ten state-of-the-art neural models based on their underlying learning paradigm, demonstrating specifically that meta-learned foundation models outperform GBDTs in small data regimes. Although dataset-specific neural networks generally outperform LLM-based tabular classifiers, they are surpassed by an AutoML library which exhibits the best performance but at the cost of higher computational demands.

View on arXiv
@article{zabërgja2025_2402.03970,
  title={ Is Deep Learning finally better than Decision Trees on Tabular Data? },
  author={ Guri Zabërgja and Arlind Kadra and Christian M. M. Frey and Josif Grabocka },
  journal={arXiv preprint arXiv:2402.03970},
  year={ 2025 }
}
Comments on this paper