Metabolic classification of microbial genomes using functional probes
- Equal contributors
1 Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan
2 Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
3 Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
4 Department of Computer Science and Computer Engineering, La Trobe University, Melbourne, Australia
5 Department of Computer Science and Information Engineering, Providence University, Taichung, Taiwan
6 Department of Medical Science, National Tsing Hua University, Hsinchu, Taiwan
7 Graduate Institute of Molecular Systems Biomedicine, China Medical University, Taichung, Taiwan
BMC Genomics 2012, 13:157 doi:10.1186/1471-2164-13-157Published: 27 April 2012
Microorganisms able to grow under artificial culture conditions comprise only a small proportion of the biosphere's total microbial community. Until recently, scientists have been unable to perform thorough analyses of difficult-to-culture microorganisms due to limitations in sequencing technology. As modern techniques have dramatically increased sequencing rates and rapidly expanded the number of sequenced genomes, in addition to traditional taxonomic classifications which focus on the evolutionary relationships of organisms, classifications of the genomes based on alternative points of view may help advance our understanding of the delicate relationships of organisms.
We have developed a proteome-based method for classifying microbial species. This classification method uses a set of probes comprising short, highly conserved amino acid sequences. For each genome, in silico translation is performed to obtained its proteome, based on which a probe-set frequency pattern is generated. Then, the probe-set frequency patterns are used to cluster the proteomes/genomes.
Features of the proposed method include a high running speed in challenge of a large number of genomes, and high applicability for classifying organisms with incomplete genome sequences. Moreover, the probe-set clustering method is sensitive to the metabolic phenotypic similarities/differences among species and is thus supposed potential for the classification or differentiation of closely-related organisms.