Figure 1.

simMC (≥ 8 kbp): unsupervised setting. OFDEG, OFDEG+GC and tetranucleotide frequency (TF) comparison using unsupervised methods, for the Phrap (A) and Arachne (B) assemblers. We see that particularly in the sensitivity measure of binning performance, OFDEG features greatly improve on the TF feature space. There is evidence of only a minimal improvement in performance with the addition of G-C content, which demonstrates that OFDEG alone has greater capacity for binning.

