Table 2

Summary of statistical analysis on TATAAAAG and TATATAAG and their extension sequences on gene promoters of four organisms.


Human
Mice
Zebrafish
Drosophila



MF
%
MF
%
MF
%
MF
%

TATAAAAG
19
2.5
12
3.8
8
6.4
37.4
5.9
GTATAAAAG
18
0.5
6
0.8
13
1.4
40.3
1.5
TATAAAAGG
8.3
0.7
5
1.0
9
1.1
42
1.0
CTATAAAAG
11
0.5
9
1.2
9
1.0
34.5
1.4
TATAAAAGC
26
0.6
9
1.1
6.5
1.8
30.8
1.9
TATATAAG
9
1.2
5
2.3
4.3
4.0
8.7
3.8
GTATATAAG
17
0.3
3
0.6
8
0.6
7
0.6
TATATAAGG
13.5
0.3
5
0.8
2
0.4
6.5
0.6
CTATATAAG
9
0.3
7
0.8
6
0.6
15
0.8

TATAAAAG and TATATAAG and seven of their extension sequences are found to be statistically significant in all four organisms including human, mice, zebrafish and drosophila melanogaster (their motif factors are equal to or greater than 2 in each organism). These TATA elements and TATA extension sequences are believed to be conserved in the course of evolution. Seven TATA extension sequences in this table are also included in Table 1. In this table, bases of bold italic font denote the extension bases, MF denotes Motif factor and % denotes the percentage of promoters which contain the corresponding sequence in the first column.

Shi and Zhou BMC Bioinformatics 2006 7(Suppl 4):S2   doi:10.1186/1471-2105-7-S4-S2