BMC Bioinformatics

official impact factor 3.03

Open Access Database

MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level

Hélène Chiapello1*, Annie Gendrault1, Christophe Caron1, Jérome Blum1, Marie-Agnès Petit2 and Meriem El Karoui2*

Author Affiliations

1 INRA UR1077, Unité Mathématique, Informatique & Génome, Domaine de Vilvert, 78352, Jouy-en-Josas, France

2 INRA UR888, Unité des Bactéries Lactiques et Pathogènes Opportunistes, Domaine de Vilvert, 78352, Jouy-en-Josas, France

For all author emails, please log on.

BMC Bioinformatics 2008, 9:498 doi:10.1186/1471-2105-9-498

Published: 27 November 2008

Abstract

Background

The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons.

Description

Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons.

Conclusion

The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic webcite.