ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.09516
74
90
v1v2v3v4v5 (latest)

CatBoost: unbiased boosting with categorical features

28 June 2017
Anna Veronika Dorogush
Gleb Gusev
A. Vorobev
Nikita Kazeev
Andrey Gulin
ArXiv (abs)PDFHTML
Abstract

This paper presents the key algorithmic techniques behind CatBoost, a state-of-the-art open-source gradient boosting toolkit. Their combination leads to CatBoost outperforming other publicly available boosting implementations in terms of quality on a variety of datasets. Two critical algorithmic advances introduced in CatBoost are the implementation of ordered boosting, a permutation-driven alternative to the classic algorithm, and an innovative algorithm for processing categorical features. Both techniques were created to fight a prediction shift caused by a special kind of target leakage present in all currently existing implementations of gradient boosting algorithms. In this paper, we provide a detailed analysis of this problem and demonstrate that proposed algorithms solve it effectively, leading to excellent empirical results.

View on arXiv
Comments on this paper