Table 1

MetaPhyler performance using fewer and fewer training dataset

Exclude Training

60bp

300bp


Genus

Family

Order

Class

Phylum

Genus

Family

Order

Class

Phylum


Genome

90.72

33.45

97.18

54.22

98.10

59.59

99.11

70.72

99.56

75.30

97.90

52.39

99.14

70.17

99.15

78.09

99.34

84.52

99.64

91.18

Genus

77.15

16.47

86.32

23.16

94.92

34.60

96.72

43.48

92.55

31.06

95.71

48.63

98.23

64.22

98.84

77.35

Family

63.62

13.19

90.31

24.64

94.65

34.99

85.25

26.65

96.78

53.15

97.66

69.42

Order

80.04

17.73

90.29

27.80

93.69

39.97

96.26

58.86

Class

78.16

16.59

90.94

42.62


MetaPhyler phylogenetic classification performance on 60bp and 300bp simulated metagenomic reads. For each prediction, the top and bottom numbers are precision and sensitivity in percentage, respectively. Different taxonomic levels are excluded when evaluating the classification, e.g., ’Genus’ means genes that have the same genus label as the query read are excluded from the reference training dataset.

Liu et al. BMC Genomics 2011 12(Suppl 2):S4   doi:10.1186/1471-2164-12-S2-S4

Open Data