The analysis of the three-fold cross validation performance of the algorithm on the Leukemia dataset. The dotted lines indicate the performance values in individual tests. The solid lines indicate the average values; and the dotted lines indicate one standard deviation from the averages. The X-axis represents the number of genes in Si. (a) The classification accuracy θtest on the test samples. (b) The classification accuracy θtrain on the training samples. (c) The p-values ρS(i) based on . (d) The p-values ρS(i) based on . (e) The p-value ρall(i) based on (Ai). (f) The average Silhouette width (Ai) of active clusters in Ai.
Xu et al. BMC Genomics 2008 9(Suppl 2):S18 doi:10.1186/1471-2164-9-S2-S18