Evolutionary Search of Thresholds for Robust Feature Set Selection; Application to the Analysis of Microarray Data

C. Cotta, C. Sloper, P. Moscato

Applications of Evolutionary Computation, G. Raidl et al.(eds.), Lecture Notes in Computer Science 3005, pp. 21-30, Springer-Verlag Berlin, 2004

© Springer-Verlag Berlin Heidelberg 2004. All rights reserved.


We deal with two important problems in pattern recognition that arise in the analysis of large datasets. While most feature subset selection methods use statistical techniques to preprocess the labeled datasets, these methods are generally not linked with the combinatorial properties of the final solutions. We prove that it is $NP-$hard to obtain an appropriate set of thresholds that will transform a given dataset into a binary instance of a robust feature subset selection problem. We address this problem using an evolutionary algorithm that learns the appropriate value of the thresholds. The empirical evaluation shows that robust subset of genes can be obtained. This evaluation is done using real data corresponding to the gene expression of lymphomas.

Download BibTEX entry
Download PDF version (217K)
[Back to publications page].