Table 1

Summary of DNA sequence filtering results.

Filter applied1

Pass Filter

(%)

GATC start (%)


Pre-selection

l32 n.

107888201

94.48

24.78

Assembly

l32 n. q20 o230

27979963

24.50

46.64

SNP

l32 n. q10 o230

32941906

28.84

40.23


1 sequences are filtered for length 32, without base-call errors (n or .). Singly represented reads are required to have a per base-call quality score of 20 (assembly data set) or 10 (SNP data set). Sequences more than four times overrepresented, based on the expected 56× coverage, were discarded.

Kerstens et al. BMC Genomics 2009 10:479   doi:10.1186/1471-2164-10-479

Open Data