Excess or lack in number of sites per profile and per clade for the four major clades. The difference of distribution of the clade-specific profile affiliations is shown for real (top) and the average of ten simulated (bottom) data from the nuc80 dataset; the difference is measured based on the average of sites affiliated to each profile over the four clades. The clades of interest are Arthropoda (blue), Nematoda (purple), Platyhelminthes (green) and Lophotrochozoa (red). Boxes group profiles by similar physico-chemical properties: on the top, the profile name is defined according to the following rules: (i) only the amino acids with a stationary frequency of 0.1 or more are present, (ii) the amino acid is written in uppercase if its stationary frequency is of 0.4 or more; on the bottom, in the profile description, the amino acid is defined by the one-letter code and the height of the letter is proportional to its stationary frequency in the profile.
Roure and Philippe BMC Evolutionary Biology 2011 11:17 doi:10.1186/1471-2148-11-17