Table 2

Top five motifs in 5' intergenic regions of T. parva and T. annulata


Consensus
Width
Bits
E-value
Sites

T. parva Motif 1
HDWTYCCCCATVVR
14
16.0
5.7e-472
639
2
WAATGTGTARR
11
13.6
5.2e-414
1199
3
BDRGATTCCAY
11
16.0
8.5e-108
298
4
CCBCBRSARGGAGCY
15
24.2
3.7e-048
39
5
GCTCCCKCYWGGSG
14
26.2
1.2e-019
19

T. annulata Motif 1
WTTCCCCATNND
12
15.5
4.1e-388
615
2
WAATGTGTAAW
11
13.3
1.1e-348
1173
3
TGTTTGTTCTGATGG
15
27.2
7.1e-140
56
4
BDRGATTCCAY
11
15.8
8.8e-103
302
5
GTAATATTTCTGTAA
15
23.7
5.0e-096
62

Motifs are identified by MEME and their consensus sequences are shown here using IUPAC (International union of pure and applied chemistry) nucleic acid codes. Information content in bits is calculated based on the first-order Markov background model. The E-value estimates the number of motifs with the same width and number of occurrence that would have equal or higher likelihood in the same number of random sequences generated by the background model. Sites refer to the number of sequences that MEME used to build each motif model.

Guo and Silva BMC Genomics 2008 9:582   doi:10.1186/1471-2164-9-582