Open Access Research article

Phylogenetic and genomic diversity of human bacteremic Escherichia coli strains

Françoise Jaureguy12, Luce Landraud1, Virginie Passet3, Laure Diancourt3, Eric Frapy1, Ghislaine Guigon4, Etienne Carbonnelle1, Olivier Lortholary56, Olivier Clermont7, Erick Denamur7, Bertrand Picard2, Xavier Nassif1 and Sylvain Brisse34*

Author Affiliations

1 INSERM UMR570, Faculté de Médecine, Université Paris Descartes, Paris, France

2 Hôpital Avicenne, AP-HP; UFR Santé, Médecine, Biologie Humaine, Université Paris 13, Bobigny, France

3 Biodiversity of Emerging Bacterial Pathogens, Institut Pasteur, 28 rue du Dr Roux, 75724 Paris, France

4 Genotyping of Pathogens and Public Health, Institut Pasteur, 28 rue du Dr Roux, 75724 Paris, France

5 Faculté de Médecine, Université René Descartes, Hôpital Necker-Enfants Malades, AP-HP, Paris, France

6 Institut Pasteur, CNRS URA3012, Paris, France

7 INSERM U722, Faculté de Médecine, Université Paris Diderot, Paris, France

For all author emails, please log on.

BMC Genomics 2008, 9:560  doi:10.1186/1471-2164-9-560

Published: 26 November 2008



Extraintestinal pathogenic Escherichia coli (ExPEC) strains represent a huge public health burden. Knowledge of their clonal diversity and of the association of clones with genomic content and clinical features is a prerequisite to recognize strains with a high invasive potential. In order to provide an unbiased view of the diversity of E. coli strains responsible for bacteremia, we studied 161 consecutive isolates from patients with positive blood culture obtained during one year in two French university hospitals. We collected precise clinical information, multilocus sequence typing (MLST) data and virulence gene content for all isolates. A subset representative of the clonal diversity was subjected to comparative genomic hybridization (CGH) using 2,324 amplicons from the flexible gene pool of E. coli.


Recombination-insensitive phylogenetic analysis of MLST data in combination with the ECOR collection revealed that bacteremic E. coli isolates were highly diverse and distributed into five major lineages, corresponding to the classical E. coli phylogroups (A+B1, B2, D and E) and group F, which comprises strains previously assigned to D. Compared to other strains of phylogenetic group B2, strains belonging to MLST-derived clonal complexes (CCs) CC1 and CC4 were associated (P < 0.05) with a urinary origin. In contrast, no CC appeared associated with severe sepsis or unfavorable outcome of the bacteremia. CGH analysis revealed genomic characteristics of the distinct CCs and identified genomic regions associated with CC1 and/or CC4.


Our results demonstrate that human bacteremia strains distribute over the entire span of E. coli phylogenetic diversity and that CCs represent important phylogenetic units for pathogenesis and comparative genomics.