Table 1

Regression models of genomic di-, tetra- and hexanucleotide frequencies and AT content

DNA word size

Regression equations

Coefficient of determination

Significance


Dinucleotides

Y2 = exp(-6.42-8.64XAT + 6.59X2AT)

R2 = 0.17

p < 0.001

Tetranucleotides

Y4 = exp(-8.85-14.73XAT + 12.39X2AT)

R2 = 0.33

p < 0.001

Hexanucleotides

Y6 = exp(-11.74-21.94XAT + 19.40X2AT)

R2 = 0.46

p < 0.001


Bohlin et al. BMC Genomics 2009 10:487   doi:10.1186/1471-2164-10-487

Open Data