Table 1

Megabases remaining after each filtering and masking step
Hard filters Length >1kbp 0%Simple Repeats Far (bp) from genes Med/high recombination Far (bp) from genes, med/high recombination High BG selection coefficient
A 1921.62 (65.0%) 676.70 (35.2%) 522.88 (77.3%) 267.28 (51.1%) 120.17 (23.0%) 54.59 (10.4%) 395.32 (75.6%)
X 97.99 (63.3%) 43.30 (44.2%) 20.56 (47.5%) 14.63 (71.1%) 3.08 (15.0%) 2.15 (10.5%) 10.31 (50.2%)

The first three filters, starting with the leftmost column were sequentially applied, resulting in the “genome-wide” set on which all additional analyses are based for both the X-chromosome (X) and the autosomes (A). Subsequent filters are all subsets of this set. Indicated percentages are out of the previous filtering step, i.e. previous column and the third column for all following columns.

Arbiza et al.

Arbiza et al. BMC Bioinformatics 2012 13:301   doi:10.1186/1471-2105-13-301

Open Data