Table 2

Results with a naïve Bayes classifier.

Cross-validation

Trained on:

All

All

NoS

All

NoE

All

key

Tested on:

All

NoS

NoS

NoE

NoE

key

key


mixed

AUC

0.83 ± 0.01

0.70 ± 0.02

0.70 ± 0.02

0.81 ± 0.02

0.81 ± 0.02

0.80 ± 0.02

0.79 ± 0.01

MCC

0.44 ± 0.04

0.27 ± 0.03

0.27 ± 0.03

0.43 ± 0.03

0.43 ± 0.03

0.41 ± 0.02

0.35 ± 0.06

Overall error rate

0.19 ± 0.01

0.24 ± 0.01

0.24 ± 0.01

0.18 ± 0.01

0.18 ± 0.01

0.18 ± 0.01

0.21 ± 0.00

Effect error rate

0.35 ± 0.05

0.52 ± 0.03

0.52 ± 0.03

0.26 ± 0.07

0.26 ± 0.07

0.24 ± 0.07

0.41 ± 0.04

No effect error rate

0.15 ± 0.02

0.18 ± 0.01

0.18 ± 0.01

0.17 ± 0.01

0.17 ± 0.01

0.17 ± 0.02

0.17 ± 0.03

sensitivity

0.47 ± 0.12

0.37 ± 0.06

0.37 ± 0.06

0.37 ± 0.06

0.37 ± 0.06

0.36 ± 0.09

0.38 ± 0.16

specificity

0.92 ± 0.03

0.88 ± 0.02

0.88 ± 0.02

0.96 ± 0.02

0.96 ± 0.02

0.96 ± 0.03

0.92 ± 0.05


lac rep

AUC

0.84 ± 0.02

0.74 ± 0.02

0.74 ± 0.02

0.82 ± 0.02

0.82 ± 0.02

0.80 ± 0.02

0.80 ± 0.02

MCC

0.47 ± 0.03

0.33 ± 0.06

0.33 ± 0.06

0.46 ± 0.04

0.46 ± 0.04

0.44 ± 0.03

0.39 ± 0.05

Overall error rate

0.18 ± 0.01

0.23 ± 0.01

0.23 ± 0.01

0.18 ± 0.01

0.18 ± 0.01

0.19 ± 0.01

0.21 ± 0.00

Effect error rate

0.27 ± 0.05

0.40 ± 0.04

0.40 ± 0.04

0.20 ± 0.06

0.20 ± 0.06

0.18 ± 0.09

0.36 ± 0.05

No effect error rate

0.16 ± 0.02

0.19 ± 0.02

0.19 ± 0.02

0.18 ± 0.02

0.18 ± 0.02

0.19 ± 0.03

0.18 ± 0.03

sensitivity

0.47 ± 0.10

0.36 ± 0.12

0.36 ± 0.12

0.37 ± 0.08

0.38 ± 0.08

0.34 ± 0.12

0.41 ± 0.13

specificity

0.93 ± 0.03

0.92 ± 0.04

0.92 ± 0.04

0.96 ± 0.02

0.96 ± 0.02

0.97 ± 0.04

0.92 ± 0.04


lysozyme

AUC

0.83 ± 0.02

0.68 ± 0.04

0.68 ± 0.05

0.81 ± 0.04

0.81 ± 0.04

0.78 ± 0.04

0.77 ± 0.04

MCC

0.40 ± 0.05

0.23 ± 0.06

0.23 ± 0.06

0.38 ± 0.08

0.38 ± 0.08

0.36 ± 0.11

0.28 ± 0.09

Overall error rate

0.17 ± 0.02

0.24 ± 0.01

0.24 ± 0.02

0.17 ± 0.03

0.17 ± 0.03

0.16 ± 0.02

0.21 ± 0.03

Effect error rate

0.40 ± 0.05

0.63 ± 0.05

0.63 ± 0.05

0.39 ± 0.12

0.39 ± 0.12

0.33 ± 0.13

0.54 ± 0.09

No effect error rate

0.13 ± 0.02

0.15 ± 0.01

0.15 ± 0.01

0.13 ± 0.03

0.13 ± 0.03

0.15 ± 0.02

0.14 ± 0.02

Sensitivity

0.43 ± 0.11

0.39 ± 0.07

0.39 ± 0.07

0.38 ± 0.17

0.38 ± 0.17

0.28 ± 0.09

0.36 ± 0.11

Specificity

0.93 ± 0.03

0.84 ± 0.02

0.84 ± 0.02

0.93 ± 0.07

0.93 ± 0.07

0.97 ± 0.01

0.89 ± 0.04


Train: lac rep

AUC

0.80

0.66

0.67

0.78

0.78

0.77

0.77

MCC

0.40

0.23

0.23

0.35

0.35

0.35

0.35

Overall error rate

0.20

0.27

0.24

0.17

0.17

0.16

0.16

Test: lysozyme

Effect error rate

0.52

0.65

0.63

0.41

0.41

0.32

0.32

No effect error rate

0.10

0.14

0.15

0.14

0.14

0.15

0.16

Sensitivity

0.58

0.46

0.39

0.33

0.33

0.26

0.26

Specificity

0.85

0.80

0.84

0.95

0.95

0.97

0.97


Train: lysozyme

AUC

0.81

0.71

0.71

0.80

0.80

0.79

0.79

MCC

0.43

0.37

0.37

0.41

0.41

0.42

0.42

Overall error rate

0.20

0.22

0.22

0.20

0.20

0.19

0.19

Test: lac rep

Effect error rate

0.34

0.43

0.43

0.25

0.25

0.18

0.18

No effect error rate

0.17

0.17

0.17

0.19

0.19

0.20

0.20

Sensitivity

0.45

0.46

0.46

0.33

0.33

0.30

0.30

Specificity

0.92

0.88

0.88

0.96

0.96

0.98

0.98


Column: (1) trained on all variables, tested with all variables observed; (2) trained on all variables, tested without any structural information (NoS) – only evolutionary variables observed; (3) trained and tested using only five evolutionary nodes; (4) trained on all variables, tested without any evolutionary information (NoE) – only structural variables observed; (5) trained and tested using only eight structural nodes; (6) trained on all variables, tested with only key variables observed (see later section); (7) trained and tested using only the three key variables.

Needham et al. BMC Bioinformatics 2006 7:405   doi:10.1186/1471-2105-7-405

Open Data