Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

This article is part of the supplement: UT-ORNL-KBRIN Bioinformatics Summit 2010

Open Access Poster presentation

Developing measures for microbial genome assembly quality control

Rachel M Adams1, Jason B Harris1, Jeremy J Jay2*, Beth G Johnson3, Miriam L Land4 and Loren J Hauser4

Author Affiliations

1 Genome Science and Technology Graduate School, University of Tennessee, Knoxville, TN 37996, USA

2 Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN 37996, USA

3 Department of Mathematics, University of Tennessee, Knoxville, TN 37996, USA

4 Computational Biology and Bioinformatics Group, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA

For all author emails, please log on.

BMC Bioinformatics 2010, 11(Suppl 4):P14  doi:10.1186/1471-2105-11-S4-P14

The electronic version of this article is the complete one and can be found online at:

Published:23 July 2010

© 2010 Jay et al; licensee BioMed Central Ltd.


Advances in sequencing technologies are outpacing the rate at which genomes can be thoroughly finished and analyzed. Over the next year, genome sequencing will increase many-fold, but high quality and high-throughput annotation methods have yet to be developed to handle the need. As more microbial genomes are sequenced, whole-genome annotation methods identify many putative genes which need further verification. By analyzing a broad range of annotated genomes we can identify patterns and statistics useful in determining the annotation quality and spurious gene outliers. Our work is attempting to identify quality control measures based on a full inter-genomic comparison instead of individual sequence-level or database-specific statistics. Using these methods to compare and filter, it is possible to narrow the scope of manual gene curation and allow greater scrutiny on putative genes before publication, making higher quality genome annotation possible. Our results plainly show the quality of well-studied genomes, the weaknesses of draft genome builds, and illustrate the need for further high-throughput quality control measures.