Table 3

Rates of correct identifications of the informative variable for the power case

Degree of dependence

Method

Replacement

0.05

0.1

0.15

0.2


Scaled

randomForest

true

0.234

0.497

0.770

0.956

false

0.237

0.489

0.760

0.949

cforest

true

0.338

0.672

0.923

0.991

false

0.365

0.728

0.943

0.994


Unscaled

randomForest

true

0.194

0.413

0.701

0.928

false

0.186

0.400

0.710

0.919

cforest

true

0.324

0.648

0.910

0.989

false

0.370

0.729

0.943

0.994


Rates of correct identifications of the informative variable with the scaled and unscaled permutation importance of the randomForest method, applied with sampling with and without replacement, as compared to those of the cforest method, applied with sampling with and without replacement, as a function of the degree of dependence (indicated by the relevance parameter, cf. Table 2) between the informative variable X2 and the response. (Standard errors of the rates of correct identifications r over 1000 iterations can easily be computet by se =

<a onClick="popup('http://www.biomedcentral.com/1471-2105/8/25/mathml/M1','MathML',630,470);return false;" target="_blank" href="http://www.biomedcentral.com/1471-2105/8/25/mathml/M1">View MathML</a>

.)

Strobl et al. BMC Bioinformatics 2007 8:25   doi:10.1186/1471-2105-8-25

Open Data