Table 3

An example of protein location prediction. The host YGL115W has four guest proteins that share four statistically significant signatures. The host and all its guests with known location were found in cytoplasm. Thus the location of YGL208W was predicted as cytoplasm. The prediction was then confirmed with the ontology annotation in SGD database. The p-value of the occurrence is the probability that a single random subsequence of the length of the motif matches the motif.

Guest
Motif ID
P-value
Guest location

YER027C
YGL115W_1
3.17E-76
cytoplasm
YGL208W
YGL115W_1
7.48E-75

YDR422C
YGL115W_1
4.78E-48
cytoplasm
YER027C
YGL115W_2
3.87E-56
cytoplasm
YGL208W
YGL115W_2
8.48E-57

YDR422C
YGL115W_2
3.64E-37
cytoplasm
YER027C
YGL115W_3
6.83E-77
cytoplasm
YGL208W
YGL115W_3
6.37E-71

YDR028C
YGL115W_3
9.81E-38
cytoplasm
YER027C
YGL115W_4
5.62E-22
cytoplasm
YGL208W
YGL115W_4
7.23E-24

YDR477W
YGL115W_4
1.89E-14
cytoplasm

Fang et al. BMC Bioinformatics 2005 6:277   doi:10.1186/1471-2105-6-277