Additional File 3.

The training results obtained using various combinations of 'cumulative sequence count' (20K, 30K 40K, 50K, 60K, 70K, 80K) and 'overlap percentage' (20%, 30%, 40%, 50%, 60%, 70%, 80%). The four tables 3A-D (in this file) show results obtained with Sanger, 454-400, 454-250 and 454-100 training data sets respectively.

Mohammed et al. BMC Genomics 2011 12(Suppl 3):S12   doi:10.1186/1471-2164-12-S3-S12