Additional file 5: Figure S4.

Phylogenetic diversity of large reference database is weakly inversely correlated with topological error. The phylogenetic diversity of each reference database was determined by summing all branch lengths in a phylogenetic tree inferred via RAxML from the sequences in that database. Due to their construction (see Methods), our simulated reference databases all have greater diversity than is likely to be present in real reference databases. Each point is the mean of the nRF error over 10 simulations, for 400-bp mean read length and 200 reads. Shadowed region represents the 95% confidence interval.

Riesenfeld and Pollard BMC Genomics 2013 14:419   doi:10.1186/1471-2164-14-419