Table 1

Summary of the sequencing data and assembly statistics


Reads Used

Total Length

Scaffold N50#

male reads

24.9 Gb (124.5-fold)

234 Mb

9.9 kb

female reads

5.7 Gb (28.5-fold)

170 Mb

12.6 kb


14.0 Gb (98.9-fold)

183 Mb

31.9 kb → 49.0 kb

* There are three versions of initial assemblies: one produced using only male reads, one using only female reads, and both of these assemblies were used to extract candidate B- or neo-Y linked sequences. The reference assembly was produced using all the female reads and part of the male reads, using parameters to maximize the N50 statistics. The reference assembly was subjected to further optimization (designated by '→' in Table 1) by comparative analysis with other Drosophila genomes and gene annotations. # N50: the length L where 50% of all nucleotides in the assembly are contained in contigs/scaffolds of size ≥ L.

Zhou et al. BMC Genomics 2012 13:109   doi:10.1186/1471-2164-13-109

Open Data