91

Optimizing Sparse Generalized Singular Vectors for Feature Selection in Proximal Support Vector Machines with Application to Breast and Ovarian Cancer Detection

Main:14 Pages
9 Figures
Bibliography:2 Pages
6 Tables
Abstract

This paper presents approaches to compute sparse solutions of Generalized Singular Value Problem (GSVP). The GSVP is regularized by 1\ell_1-norm and q\ell_q-penalty for 0<q<10<q<1, resulting in the 1\ell_1-GSVP and q\ell_q-GSVP formulations. The solutions of these problems are determined by applying the proximal gradient descent algorithm with a fixed step size. The inherent sparsity levels within the computed solutions are exploited for feature selection, and subsequently, binary classification with non-parallel Support Vector Machines (SVM). For our feature selection task, SVM is integrated into the 1\ell_1-GSVP and q\ell_q-GSVP frameworks to derive the 1\ell_1-GSVPSVM and q\ell_q-GSVPSVM variants. Machine learning applications to cancer detection are considered. We remarkably report near-to-perfect balanced accuracy across breast and ovarian cancer datasets using a few selected features.

View on arXiv
Comments on this paper