Table 2

Distribution of poly (dA.dT) repeats of various lengths in the 5' upstream regions of housekeeping and tissue specific genes.

Poly (dA.dT) stretch (bp)
No. of repeat stretches in the two classes
No. of genes with repeats in 5' region (%)





Hkg#
Tsg*
Hkg#
Tsg*
§P-value

>10
443
345
268 (51.0)
240 (43.0)
1.31E-04
>11
381
297
243 (46.3)
214 (38.4)
3.25E-04
>12
339
248
226 (43.1)
184 (33.0)
6.10E-05
>13
295
207
209 (39.8)
156 (28.0)
4.29E-05
>14
251
168
188 (35.8)
128 (22.9)
2.77E-05
>15
209
140
164 (31.2)
111 (19.9)
8.83E-05
>16
180
116
146 (27.8)
99 (17.7)
7.58E-05
>17
155
103
134 (25.5)
88 (15.8)
2.42E-04
>18
138
79
120 (22.9)
71 (12.7)
2.23E-05
>19
112
66
101 (19.2)
59 (10.6)
2.61E-04
>20
100
58
92 (17.5)
53 (9.5)
5.32E-04

#Housekeeping genes, *Tissue specific genes. A total of 525 housekeeping and 558 tissue specific genes were analysed. The numbers in parentheses (4th & 5th columns) represent the percentage of genes containing the repeat stretch.

§Difference in the distribution of poly (dA.dT) stretches in Hkg and Tsg analysed by applying t-test (for normalizing the difference in sample size). The repeat lengths from >12 to >18 bp are showing very significantly different distributions between Hkg and Tsg. The distributions were examined in 2000 bp upstream region from the gene start site.

Ganapathi et al. BMC Bioinformatics 2005 6:126   doi:10.1186/1471-2105-6-126