Cumulative distribution of orthologous protein sequence identity values. The minimum sequence identity values from 945 Prodigal ortholog sets were plotted, illustrating the impact of gene start site revisions over the range of observed protein conservation. Main plot shows majority of data, between 50 and 90% protein sequence identity, while inset shows entire range of values.
Dunbar et al. BMC Genomics 2011 12:125 doi:10.1186/1471-2164-12-125