Table 1

Motif factors and p values of TATAAAAG and TATATAAG and their extension sequences on S1000 and S2000.

Sequences
S1000
S2000
p

TATAAAAG
19
12.3
<1e-16
GTATAAAAG
18
8.5
<1e-16
GGTATAAAAG
8
8
2.3e-10
CTATAAAAG
11
10
<1e-16
TCTATAAAAG
8
6
1.6e-11
TATAAAAGC
26
21
<1e-16
TATAAAAGCA
7
9
8.6e-8
TATAAAAGG
8.3
7.2
1.9e-15
TATAAAAGGC
8
12
1.4e-12
TATAAAAGGG
8
9
2.0e-12
TATATAAG
9
7.3
<1e-16
GTATATAAG
17
15
<1e-16
GGTATATAAG
6
8
<1e-16
CTATATAAG
9
8
8.3e-14
TATATAAGG
13.5
11
<1e-16
TATAAAAAGG
8
8
4.0e-12

TATA extension sequences which are statistically significant mainly extend from two TATA elements: TATAAAAG and TATATAAG. Table 1 gives the motif factors and p values for these two TATA elements and fourteen TATA extension sequences. P values are calculated based on the human promoters of length 1000 bp. In Table 1, bases of italic bold font are the extension bases.

Shi and Zhou BMC Bioinformatics 2006 7(Suppl 4):S2   doi:10.1186/1471-2105-7-S4-S2