Distribution of fraction of Dissimilar function (Ordinate: fraction) versus sequence identity (X-axis in bins of 10%). The top of each box is the upper 75th percentile, the bottom is the lower 25th percentile. The median of each box is also shown but is superimposed on the 25th percentile. The circles are single extreme cases. The line joins the mean fraction of Dissimilar function at each level of sequence identity. The mean is well above the median due to the extreme skewness of the distribution towards mostly similar function.
Sangar et al. BMC Bioinformatics 2007 8:294 doi:10.1186/1471-2105-8-294