|
Resolution: standard / high Figure 3.
Sequence logos for |C| = 20, 40, 80, showing several features of IB discretization. First, variable numbers of clusters are assigned to different amino acids according to their overall frequencies: A and L are more common, while C is least common. Second, clusters capture strongly- and weakly-conserved variants, as well as some chemical similarities: I, V, L, and M are all hydrophobic.
O'Rourke et al. BMC Bioinformatics 2006 7(Suppl 1):S8 doi:10.1186/1471-2105-7-S1-S8 |