ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0903.2003
171
144
v1v2v3v4 (latest)

Feature selection in "omics" prediction problems using cat scores and false non-discovery rate control

11 March 2009
M. Ahdesmaki
K. Strimmer
ArXiv (abs)PDFHTML
Abstract

We propose an effective framework for high-dimensional linear discriminant analysis (LDA) based on three key elements: James-Stein shrinkage for learning prediction rules, feature ranking by correlation-adjusted t-scores (cat scores), and feature selection by thresholding and controlling false non-discovery rates (FNDR). Relative to competing LDA approaches our algorithm is computationally inexpensive and makes practical high-dimensional LDA analysis. Furthermore, we show on four experimental data sets and by comparing with the "higher criticism" approach that feature selection by FNDR control is very effective not only for LDA but also for diagonal discriminant analysis. The proposed shrinkage discriminant and variable selection procedure is implemented in the R package "sda" available from the R repository CRAN.

View on arXiv
Comments on this paper