Pan-genomic distribution. Distribution of CDS based on a homology measure of 60% amino acid identity and 80% coverage. Chromosomes were added 10000 times without replacement in a randomized order and the number of core (mutually conserved) and accessory (found in at least one but not all strains) genes was noted. Since mean and median values for each step showed only little variation the mean numbers of gene classes were plotted. In order to predict a possible future pan-genomic distribution for this species we performed a power law fitting. A) Pan-genomic CDS after each consecutive addition of a strain, B) mutually conserved CDS, C) conservation of CDS and homology clusters.
Kuenne et al. BMC Genomics 2013 14:47 doi:10.1186/1471-2164-14-47