ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.02756
67
8
v1v2v3v4 (latest)

Pointwise adaptation via stagewise aggregation of local estimates for multiclass classification

8 April 2018
Nikita Puchkin
V. Spokoiny
ArXiv (abs)PDFHTML
Abstract

We consider a problem of multiclass classification, where the training sample Sn={(Xi,Yi)}i=1nS_n = \{(X_i, Y_i)\}_{i=1}^nSn​={(Xi​,Yi​)}i=1n​ is generated from the model p(Y=m∣X=x)=θm(x)\mathbb p(Y = m | X = x) = \theta_m(x)p(Y=m∣X=x)=θm​(x), 1≤m≤M1 \leq m \leq M1≤m≤M, and θ1(x),…,θM(x)\theta_1(x), \dots, \theta_M(x)θ1​(x),…,θM​(x) are unknown Lipschitz functions. Given a test point XXX, our goal is to estimate θ1(X),…,θM(X)\theta_1(X), \dots, \theta_M(X)θ1​(X),…,θM​(X). An approach based on nonparametric smoothing uses a localization technique, i.e. the weight of observation (Xi,Yi)(X_i, Y_i)(Xi​,Yi​) depends on the distance between XiX_iXi​ and XXX. However, local estimates strongly depend on localizing scheme. In our solution we fix several schemes W1,…,WKW_1, \dots, W_KW1​,…,WK​, compute corresponding local estimates θ~(1),…,θ~(K)\widetilde\theta^{(1)}, \dots, \widetilde\theta^{(K)}θ(1),…,θ(K) for each of them and apply an aggregation procedure. We propose an algorithm, which constructs a convex combination of the estimates θ~(1),…,θ~(K)\widetilde\theta^{(1)}, \dots, \widetilde\theta^{(K)}θ(1),…,θ(K) such that the aggregated estimate behaves approximately as well as the best one from the collection θ~(1),…,θ~(K)\widetilde\theta^{(1)}, \dots, \widetilde\theta^{(K)}θ(1),…,θ(K). We also study theoretical properties of the procedure, prove oracle results and establish rates of convergence under mild assumptions.

View on arXiv
Comments on this paper