Clustering evaluation on simulation data set. The result is based on the simulated data with 4 dimensions. Figure 3(a) shows the number of clusters k in 2000 MCMC iterations. Figure 3(b) shows the overall precision vs. overall recall for 2000 times of DPBMM. The overall precision almost always stays at 1. Figure 3(c) shows the F metric vs. recall curve. Figure 3(d) shows the histogram of F metric results for 2000 times of DPBMM clustering.
Zhang et al. BMC Genomics 2012 13(Suppl 6):S20 doi:10.1186/1471-2164-13-S6-S20