Table 1

Statistics describing the distributions of different properties of contig sequences (a) obtained by the first run of assembly, (b) second run of assembly and (c) final set after filtering for minimum sequence length and average quality
640,040 reads Min 1st Q Median Mean 3rd Q Max
(a) 61,838 contigs Length 40 301 413 460.4 551 2590
Number of reads 1 2 4 10.4 10 2611
Average coverage 1 1.9 3.1 5.3 5.7 945.4
Average quality 14.8 34.9 39.4 45.1 55.2 88.7
GC content 16.1 37.1 41.9 42.4 47.3 73.9
(b) 52,125 contigs Length 40 299 413 466.5 564 2590
Average quality 14.8 35 39.8 46.1 56.7 90
GC content 16.1 37.1 41.8 42.3 47.2 73.9
(c) 44,896 contigs Length 200 348 441 511 606 2590
after filtering Average quality 30 36 41.6 47.9 58.6 90
GC content 22 37.3 41.9 42.4 47.1 68.9

Pujolar et al.

Pujolar et al. BMC Genomics 2012 13:507   doi:10.1186/1471-2164-13-507

Open Data