Presentation of the MW-pI plots for the C. elegans II data sets in three phases. These plots represent the distribution of the Phase I, II, III data sets for C. elegans II for the purpose of demonstrating the applicability of the proposed data set generation method in the absence of a priori parameter configuration scheme. We suggest the user to divide the entire ranges of the parameters (e.g. MW and pI) into subgroups and use the proposed algorithm to handle each subgroup individually. Ultimately a representative data set is obtained by combining all of the subgroups together. From the plots we can observe that eventually (in Phase III) the most populated regions converge to some certain ranges, i.e., MW in between 20–60 KDa and pI in between 4 to 10. A bimodal pattern can also be observed.
Zeng et al. BMC Bioinformatics 2009 10:206 doi:10.1186/1471-2105-10-206