|
Resolution: standard / high Figure 1.
Flow chart of the method used for subcellular localizations of Leptospira interrogans serovar Lai genome. Protein sequences of Leptospira interrogans serovar Lai genome (4,727 ORFs) were analyzed for subcellular localization using PSORTb,
ProtCompB, and Proteome analyst (PA) prediction. (a) The consensus vote was obtained
from the majority vote type procedure to obtain the result with high prediction accuracy.
If all 3 methods agree for localization it was assigned as a consensus vote. The remaining
(1 or 2 out of 3 predicted result) was assigned as non-consensus vote. The consensus
vote of CP and CM was used as a training set for the development of an LDA-based classifier
for CP and CM in the next step. (b) The non-consensus vote results of OM, PP, and
EX were further analyzed for sequence and structure homology by DBsubloc and GTD prediction.
The non-consensus vote of EX, OM, and PP with significant homology or/and structure
information were identified by DBsubloc and GTD prediction. (c) Non-consensus votes
of CP, CM and the non predicted data from DBsubloc and GTD predictions were further
analyzed for subcellular localization using LDA-based classifier for CP and CM. Significantly
predicted results were proteins classified with more than 0.90 probability for CP
and CM proteins. The remaining queries that could not be identified in this step were
classified as "unknown" results.
Viratyosin et al. BMC Genomics 2008 9:181 doi:10.1186/1471-2164-9-181 |