Table 5

Characteristics of the NN269 and DGSplicer data sets containing true and decoy acceptor and donor splice sites derived from the human genome.

NN269

DGSplicer

Acceptor

Donor

Acceptor

Donor


Sequence length

90

15

36

18

Consensus positions

AG at 69

GT at 8

AG at 26

GT at 10

Train total

5788

5256

322156

228268

Fraction positives

19.3%

21.2%

0.6%

0.8%

Test total

1087

990

80539

57067

Fraction positives

19.4%

21.0%

0.6%

0.8%


Sonnenburg et al. BMC Bioinformatics 2007 8(Suppl 10):S7   doi:10.1186/1471-2105-8-S10-S7

Open Data