<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>1471-2164-12-576</ui>
<ji>1471-2164</ji>
<fm>
<dochead>Research article</dochead>
<bibl>
<title><p>Comparative genomics of the type VI secretion systems of <it>Pantoea </it>and <it>Erwinia </it>species reveals the presence of putative effector islands that may be translocated by the VgrG and Hcp proteins</p></title>
<aug>
<au ca="yes" id="A1"><snm>De Maayer</snm><fnm>Pieter</fnm><insr iid="I1"/><email>pieter.demaayer@fabi.up.ac.za</email></au>
<au id="A2"><snm>Venter</snm><mi>N</mi><fnm>Stephanus</fnm><insr iid="I1"/><email>fanus.venter@up.ac.za</email></au>
<au id="A3"><snm>Kamber</snm><fnm>Tim</fnm><insr iid="I2"/><email>tim.kamber@acw.admin.ch</email></au>
<au id="A4"><snm>Duffy</snm><fnm>Brion</fnm><insr iid="I2"/><email>duffy@acw.admin.ch</email></au>
<au id="A5"><snm>Coutinho</snm><mi>A</mi><fnm>Teresa</fnm><insr iid="I1"/><email>teresa.coutinho@up.ac.za</email></au>
<au id="A6"><snm>Smits</snm><mi>HM</mi><fnm>Theo</fnm><insr iid="I2"/><email>theo.smits@acw.admin.ch</email></au>
</aug>
<insg>
<ins id="I1"><p>Forestry and Agricultural Biotechnology Institute, University of Pretoria, South Africa</p></ins>
<ins id="I2"><p>Agroscope Changins-W&#228;denswil ACW, Division of Plant Protection, W&#228;denswil, Switzerland</p></ins>
</insg>
<source>BMC Genomics</source>
<issn>1471-2164</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>1</issue>
<fpage>576</fpage>
<url>http://www.biomedcentral.com/1471-2164/12/576</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-12-576</pubid><pubid idtype="pmpid">22115407</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>8</day><month>8</month><year>2011</year></date></rec><acc><date><day>24</day><month>11</month><year>2011</year></date></acc><pub><date><day>24</day><month>11</month><year>2011</year></date></pub></history><cpyrt><year>2011</year><collab>De Maayer et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>The Type VI secretion apparatus is assembled by a conserved set of proteins encoded within a distinct locus. The putative effector proteins Hcp and VgrG are also encoded within these loci. We have identified numerous distinct Type VI secretion system (T6SS) loci in the genomes of several ecologically diverse <it>Pantoea </it>and <it>Erwinia </it>species and detected the presence of putative effector islands associated with the <it>hcp </it>and <it>vgrG </it>genes.</p>
</sec>
<sec><st><p>Results</p></st>
<p>Between two and four T6SS loci occur among the <it>Pantoea </it>and <it>Erwinia </it>species. While two of the loci (T6SS-1 and T6SS-2) are well conserved among the various strains, the third (T6SS-3) locus is not universally distributed. Additional orthologous loci are present in <it>Pantoea </it>sp. aB-valens and <it>Erwinia billingiae </it>Eb661. Comparative analysis of the T6SS-1 and T6SS-3 loci showed non-conserved islands associated with the <it>vgrG </it>and <it>hcp</it>, and <it>vgrG </it>genes, respectively. These regions had a G+C content far lower than the conserved portions of the loci. Many of the proteins encoded within the <it>hcp </it>and <it>vgrG </it>islands carry conserved domains, which suggests they may serve as effector proteins for the T6SS. A number of the proteins also show homology to the C-terminal extensions of evolved VgrG proteins.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Extensive diversity was observed in the number and content of the T6SS loci among the <it>Pantoea </it>and <it>Erwinia </it>species. Genomic islands could be observed within some of T6SS loci, which are associated with the <it>hcp </it>and <it>vgrG </it>proteins and carry putative effector domain proteins. We propose new hypotheses concerning a role for these islands in the acquisition of T6SS effectors and the development of novel evolved VgrG and Hcp proteins.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>The Type VI secretion system (T6SS) was first identified in the human pathogens <it>Pseudomonas aeruginosa </it>and <it>Vibrio cholerae </it>and was shown in the latter to function as an injectisome for the delivery of pathogenicity effector proteins into the host cell <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. T6SS have since been identified in genome sequences of many Gram negative bacteria, including those of several animal and plant pathogens, but also in symbiotic and free-living bacteria <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. This has led to the speculation that the T6SS is involved in other non-pathogenic functions including inter-bacterial communication, regulating biofilm formation and environmental stress response <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. In the animal pathogens <it>Salmonella enterica </it>subsp. <it>enterica </it>serovar Typhimurium and <it>Helicobacter hepaticus </it>mutational analysis indicated that the T6SS suppresses virulence and promotes replication in host cells, thus contributing to long-term colonization <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. In other bacteria, including the animal pathogen <it>Yersinia pestis </it>and the phytopathogen <it>Pectobacterium atrosepticum</it>, this increased proliferation has been hypothesized to result in increased fitness and, subsequently, increased virulence <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The T6SS of the plant symbiont <it>Rhizobium leguminosarum </it>has been linked to host-specificity, where a T6SS deletion mutant of a strain restricted to <it>Trifolium subterraneum </it>(clover cv. Woogenellup) acquired the ability to form nitrogen-fixing nodules on the non-host <it>Pisum sativum </it>(pea) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Several authors have identified an antibacterial role for the T6SS. A secreted effector, Tse2, in <it>P. aeruginosa </it>has been shown to have toxic effects on other bacteria <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Furthermore, the T6SS effectors Tse1 and Tse3 represent lytic enzymes that degrade peptidoglycan in the cell walls of closely related Gram negative bacteria <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Similarly, bactericidal functions have been ascribed for T6SSs in <it>V. cholerae </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and <it>S. enterica </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The availability of more genome sequences and more experimental data will facilitate a greater understanding of new and known functions of the T6SS. Several bacteria encode multiple T6SS loci on their genomes. For example, the genomes of <it>P. aeruginosa</it>, <it>Y. pestis </it>and <it>Burkholderia pseudomallei </it>encode three, four and six T6SS loci, respectively <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. These seemingly non-paralogous loci, together with the divergent functions of the T6SS suggests that this secretory system could influence a variety of interactions within a single species, with various hosts, and/or with other bacteria occupying the same niche <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
<p>While the biological roles of the T6SS in many Gram negative bacteria still need to be elucidated, other aspects of this secretory system, including evolutionary, genetic, structural and regulatory facets are becoming better understood. T6SS loci are generally comprised of 15-25 genes and include a core of 16 conserved proteins, which assemble the T6SS apparatus in the cell membrane <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. These conserved proteins include IcmF (COG3523) and DotU (COG3455), which have homology to proteins associated with the type IV secretion system and are thought to stabilize the T6SS apparatus in the cytoplasmic membrane <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. An AAA+ ATPase, ClpV (COG0542) may energize the assembly of the apparatus. Other conserved proteins include the DUF770 (COG3516) and DUF877 (COG3517) proteins, which are predicted to function as chaperones and the COG3521 outer membrane lipoprotein <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The outer component of the T6SS apparatus is comprised of two proteins, VgrG (COG3501) and Hcp (COG3157), which have also been identified as secreted effectors of the T6SS <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. These proteins resemble both structurally and genetically the components of the T4 bacteriophage tail spike, suggesting that the VgrG-Hcp combination forms a similar membrane penetrating structure and are evolutionarily related <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Hcp proteins form hexameric rings that are conceivably stacked to form a tube penetrating the outer membrane through which proteins are transported into the extracellular space <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The VgrG protein is postulated to puncture the outer cell membrane to allow extrusion of the Hcp tube and subsequently breach the host cell membrane <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
<p>VgrG proteins have been demonstrated to be secreted by the T6SS and are themselves required for the secretion of other effector proteins, such as the Tse3 effector in <it>P. aeruginosa </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. However, a VgrG protein in this organism has been shown to be secreted in a T6SS-independent manner <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. This indicates that the VgrG proteins are distinct and may perform diverse functions. Some VgrG proteins have been found to carry C-terminal extensions, which carry conserved domains of various predicted functions. The N-terminal of these "evolved VgrGs" may serve in assembly of the T6SS machinery, while the C-terminal portion functions as effector <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. One such evolved VgrG in <it>V. cholerae </it>(VgrG-1) carries an actin-binding domain in its C-terminal extension which has been shown to be involved in host cell toxicity <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The C-terminal extension of a VgrG in <it>S. enterica </it>subsp. <it>arizonae </it>(IIIa) carries an S-type pyocin domain which has been proposed to function in bacterial cell killing <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Over 500 VgrG orthologs have been identified in bacterial genome sequences with many of these representing evolved VgrGs. In a number of organisms for which genome sequences are available, additional Hcp and VgrG paralogs with a function in T6SS, termed "orphan" Hcps and VgrGs are encoded by genes located on the genome separately from the T6SS loci <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B21">21</abbr></abbrgrp>.</p>
<p>The genera <it>Pantoea </it>and <it>Erwinia </it>include ecologically diverse species, including the plant pathogens <it>Erwinia amylovora </it>and <it>Pantoea stewartii </it>subsp. <it>stewartii</it>, which are linked to devastating diseases of rosaceous plants and maize, respectively <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. The opportunistic plant pathogen <it>Pantoea ananatis </it>causes disease on a broad range of host plants and has been associated with human bacteremia <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Several <it>Erwinia </it>species, including <it>Erwinia billingiae </it>and <it>Erwinia tasmaniensis</it>, are found as part of the normal epiphytic microflora of plants <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Strains of <it>Pantoea agglomerans </it>and <it>Pantoea vagans </it>have recently received attention as potent biological control agents <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Strains of both <it>Pantoea </it>and <it>Erwinia </it>have also been isolated from insects <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. The complete and draft genomes of several <it>Pantoea </it>and <it>Erwinia </it>species have recently been sequenced, which provide an extensive base to identify genomic differences that may be linked to the divergent biological and ecological phenotypes of these species. We used the genome sequences to identify the T6SS in these organisms. An in depth analysis was undertaken which showed that not all T6SS loci are universal to the sequenced <it>Pantoea </it>and <it>Erwinia </it>species and that there are notable anomalies between some of the loci that are conserved among the compared organisms. We show that these differences occur in putative genomic islands associated with the Hcp and VgrG proteins and postulate that these islands represent regions of rapid and extensive evolution, which may drive functional diversification of the T6SS.</p>
</sec>
<sec><st><p>Results and Discussion</p></st>
<sec><st><p>Several orthologous T6SS loci occur in <it>Pantoea </it>and <it>Erwinia </it>species</p></st>
<p>By means of baiting with the conserved proteins from several characterized T6SS in other bacteria, between two and four T6SS loci were identified in the complete or draft genome sequences of five <it>Pantoea </it>and four <it>Erwinia </it>strains (Figure <figr fid="F1">1</figr>). These strains were the <it>Eucalyptus </it>pathogen <it>P. ananatis </it>LMG 20103 <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, the biological control strains <it>P. vagans </it>C9-1 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and <it>P. agglomerans </it>E325 (Smits and Duffy, unpublished results), the insect-associated strains <it>Pantoea </it>sp. aB-valens <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and <it>Pantoea </it>sp. At-9b <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> which represent novel species of the genus <it>Pantoea</it>, the fire blight pathogens <it>E. amylovora </it>CFBP 1430 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and <it>E. pyrifoliae </it>DSM 12163<sup>T </sup><abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and the apple epiphytes <it>E. tasmaniensis </it>Et1/99 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and <it>E. billingiae </it>Eb661 <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Two orthologous loci are found in all <it>Pantoea </it>and <it>Erwinia </it>strains, while a more restricted distribution could be observed for the T6SS-3 locus (Figure <figr fid="F1">1</figr>). Additional T6SS loci were identified in the genome sequences of <it>E. billingiae </it>Eb661 and <it>Pantoea </it>sp. aB-valens, respectively.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Distribution of T6SS loci among <it>Pantoea </it>and <it>Erwinia </it>species</p></caption><text>
   <p><b>Distribution of T6SS loci among <it>Pantoea </it>and <it>Erwinia </it>species</b>. The presence (+) or absence (-) of ortholgous T6SS loci for the various <it>Pantoea </it>and <it>Erwinia </it>strains are shown. A neighbor-joining tree (bootstrap n = 1,000) based on a ClustalW alignment of GyrB amino acid sequences indicates the phylogenetic relationship of the strains.</p>
</text><graphic file="1471-2164-12-576-1"/></fig>
<p>The T6SS-1 loci range in size from 28.8 and 40.0 kb and are built around a well-conserved and syntenous core which includes orthologs of all sixteen conserved T6SS proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> (Figure <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). Additional genes encode a serine/threonine protein kinase (PpkA) and serine/threonine phosphatase (PppA). These proteins interact with the Fork Head-Associated (FHA) domain protein (COG3456) for the post-translational regulation of the T6SS <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. In both <it>Pantoea </it>and <it>Erwinia </it>strains the conserved core T6SS proteins are arranged in two syntenous clusters (block I and III - Figure <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). These are separated by a variable region (block II), which is linked to the <it>hcp </it>gene (COG3157). A further variable region occurs next to the <it>vgrG </it>gene (block IV). In all sequenced <it>Pantoea </it>and <it>Erwinia </it>genomes, with the exception of <it>E. pyrifoliae </it>DSM 12163<sup>T</sup>, this variable region is flanked at the 3' end by a gene encoding a re-arrangement hot spot (<it>rhs</it>) element <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In <it>P. agglomerans </it>E325, <it>E. amylovora </it>CFBP 1430, <it>E. pyrifoliae </it>DSM 12163<sup>T </sup>and <it>E. tasmaniensis </it>Et1/99, an additional copy of the <it>vgrG </it>gene occurs within this variable region. A non-conserved region harboring two hypothetical genes, which is not associated with a <it>vgrG </it>or <it>hcp </it>gene could be observed between COG3520 and <it>clpV </it>in <it>E. tasmaniensis </it>Et1/99 and <it>E. pyrifoliae </it>DSM 12163<sup>T </sup>(Figure <figr fid="F3">3</figr>).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>The T6SS-1 loci of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430.</p></caption><text>
   <p><b>The T6SS-1 loci of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430</b>. The conserved regions (block I and III) are shaded in gray, while the non-conserved <it>hcp </it>and <it>vgrG </it>islands are not shaded. Genes encoding conserved domain proteins are represented by green arrows while the grey arrows indicate other genes conserved among the <it>Pantoea </it>and <it>Erwinia </it>T6SS-1 loci which were not identified as part of the conserved core described by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Red arrows represent the <it>hcp </it>and <it>vgrG </it>genes while genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species are colored in white. Graphs show the G+C content (%) (window size = 50 bp, step = 10 bp) in the respective T6SS-1 loci.</p>
</text><graphic file="1471-2164-12-576-2"/></fig>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>The orthologous T6SS-1 loci in <it>Pantoea </it>and <it>Erwinia </it>species</p></caption><text>
   <p><b>The orthologous T6SS-1 loci in <it>Pantoea </it>and <it>Erwinia </it>species</b>. The conserved regions (block I and III) are shaded in gray, while the non-conserved <it>hcp </it>and <it>vgrG </it>islands are not shaded. Genes encoding conserved domain proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> are represented by green arrows, and grey arrows indicate other genes conserved among the <it>Pantoea </it>and <it>Erwinia </it>T6SS-1 loci which were not identified as part of the conserved core described by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Red arrows represent the <it>hcp </it>and <it>vgrG </it>genes while genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species are colored in white.</p>
</text><graphic file="1471-2164-12-576-3"/></fig>
<p>The T6SS-2 loci of the <it>Pantoea </it>strains and <it>E. billingiae </it>Eb661 are structurally and genetically conserved and consist of eight syntenous genes which encode proteins that are highly conserved among the sequenced strains (Figure <figr fid="F4">4</figr>). Only four of these belong to conserved T6SS proteins outlined by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, suggesting the T6SS-2 loci encodes a partial and potentially non-functional T6SS. An average amino acid identity of 52% could be observed between the proteins encoded in the T6SS-2 locus and their respective paralogs in the T6SS-1 locus for each of the strains. This suggests that the T6SS-2 locus could have arisen through a partial duplication of the T6SS-1 locus. This is supported by phylogenetic analysis with the IcmF (COG3523) protein sequences (Figure <figr fid="F5">5</figr>), where the T6SS-2 loci branch from the T6SS-1 loci. The <it>E. amylovora </it>CFBP 1430 and <it>E. pyrifoliae </it>DSM12613<sup>T </sup>T6SS-2 loci encode only four proteins, lacking the genes encoding the serine/threonine kinase and phosphatase required for posttranslational regulation, while an FHA domain (COG3456) protein is encoded in these loci. The <it>icmF </it>gene in the <it>E. pyrifoliae </it>DSM12613<sup>T </sup>T6SS-2 locus is inactivated by a frameshift <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, while the <it>E. tasmaniensis </it>Et1/99 T6SS-2 locus is missing the FHA domain gene, indicating further gene loss in these T6SS loci (Figure <figr fid="F4">4</figr>).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>The orthologous T6SS-2 loci in <it>Pantoea </it>and <it>Erwinia </it>species</p></caption><text>
   <p><b>The orthologous T6SS-2 loci in <it>Pantoea </it>and <it>Erwinia </it>species</b>. Genes encoding proteins with the conserved domains identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> are represented by green arrows while the grey arrows indicate other genes conserved among the <it>Pantoea </it>and <it>Erwinia </it>T6SS-2 loci which were not identified as part of the conserved core. White arrows represent the genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species.</p>
</text><graphic file="1471-2164-12-576-4"/></fig>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Phylogenetic relationships between the T6SS loci among the <it>Pantoea and Erwinia </it>species</p></caption><text>
   <p><b>Phylogenetic relationships between the T6SS loci among the <it>Pantoea and Erwinia </it>species</b>. A neighbor-joining tree (bootstrap n = 1,000; Poisson correction; Complete gap deletion) was constructed based on a ClustalW alignment of the IcmF (COG3523) amino acid sequences from the <it>Pantoea </it>and <it>Erwinia </it>T6SS loci and orthologous loci in several closely related <it>Enterobacteriaceae</it>.</p>
</text><graphic file="1471-2164-12-576-5"/></fig>
<p>The <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 loci are between 18.8 and 34.6 kb in size and encompass 19 to 32 protein coding sequences. In contrast to the T6SS-1 and -2 loci, the T6SS-3 locus is not universal to all <it>Pantoea </it>and <it>Erwinia </it>strains (Figure <figr fid="F1">1</figr>). The phylogeny based on the IcmF protein shows a greater evolutionary distance between the <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 loci, which are interspersed with those of other <it>Enterobacteriaceae </it>(Figure <figr fid="F5">5</figr>), indicating that this locus may have been acquired through horizontal gene transfer. This can also be correlated with greater diversity in gene content and order between the various <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 loci (Figure <figr fid="F6">6</figr> and <figr fid="F7">7</figr>). Thirteen of the T6SS conserved core proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> are encoded within the T6SS loci. Genes encoding orthologs of the COG3913, COG4455 and FHA domain (COG3456) proteins are absent. No PpkA or PppA orthologs are encoded in the T6SS-3 loci either, suggesting that this locus is not post-translationally regulated. As in the case of the T6SS-1 loci, syntenous blocks of core genes can be observed in the T6SS-3 loci (block I, III and V - Figure <figr fid="F6">6</figr> and <figr fid="F7">7</figr>). Here, the <it>hcp </it>gene is included in a conserved block (Figure <figr fid="F6">6</figr> and <figr fid="F7">7</figr>). The non-conserved regions in the T6SS-3 loci are generally associated with a <it>vgrG </it>gene. An additional complete and two partial <it>vgrG </it>genes can be found in the T6SS-3 locus of <it>E. amylovora </it>CFBP 1430. An additional non-conserved region, encoding two predicted proteins, PANA_4135 and PANA_4136 can be observed in the <it>P. ananatis </it>LMG 20103 T6SS-3 locus (Figure <figr fid="F7">7</figr>).</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>The T6SS-3 loci of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430</p></caption><text>
   <p><b>The T6SS-3 loci of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430</b>. The conserved regions (block I, III and V) are shaded in gray, while non-conserved regions are not shaded. Genes encoding conserved domain proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> represented by green arrows while the grey arrow indicates a gene conserved among the <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 loci which was not identified as part of the conserved core. Red arrows represent the <it>hcp </it>and <it>vgrG </it>genes while genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species are colored in white. The graphs show the G+C content (%) (window size = 50 bp, step = 10 bp) in the respective T6SS-3 loci.</p>
</text><graphic file="1471-2164-12-576-6"/></fig>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>The orthologous T6SS-3 loci in <it>Pantoea </it>and <it>Erwinia </it>species</p></caption><text>
   <p><b>The orthologous T6SS-3 loci in <it>Pantoea </it>and <it>Erwinia </it>species</b>. The conserved regions (block I, III and V) are shaded in gray, while non-conserved regions are not shaded. Genes encoding conserved core proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> are represented by green arrows while the grey arrows indicate other genes conserved among the <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 loci which were not identified as part of the conserved core. Red arrows represent the <it>hcp </it>and <it>vgrG </it>genes while genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species are colored in white.</p>
</text><graphic file="1471-2164-12-576-7"/></fig>
<p>A further T6SS locus was identified in <it>Pantoea </it>sp. aB-valens (Figure <figr fid="F8">8</figr>). This 19.6 kb locus encodes thirteen conserved core proteins. The IcmF phylogeny shows this T6SS locus to be distantly related to all other <it>Pantoea </it>and <it>Erwinia </it>T6SS loci. Similarly, an additional partial T6SS locus is encoded on the <it>E. billingiae </it>Eb661 genome, which contains four T6SS core proteins. A non-conserved region associated with the <it>vgrG </it>gene is likewise present. This <it>E. billingiae </it>Eb661 T6SS locus is flanked at both the 5' (EbC_39320 - phage integrase) and 3' end (EbC_393430-39600 - hypothetical phage genes) by genes of an integrated phage element. A phage origin has previously been hypothesized for the T6SS <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Additional T6SS loci in <it>Pantoea </it>sp. aB-valens and <it>Erwinia billingiae </it>Eb661</p></caption><text>
   <p><b>Additional T6SS loci in <it>Pantoea </it>sp. aB-valens and <it>Erwinia billingiae </it>Eb661</b>. Genes encoding proteins with the conserved domains identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> are indicated by green arrows. Red arrows represent the <it>hcp </it>and <it>vgrG </it>genes while genes not conserved among the <it>Pantoea </it>and <it>Erwinia </it>species are colored in white.</p>
</text><graphic file="1471-2164-12-576-8"/></fig>
</sec>
<sec><st><p>Different VgrG proteins are encoded in the T6SS-1, T6SS-3 loci and additional T6SS loci</p></st>
<p>The amino acid sequences of all VgrG proteins encoded in the T6SS-1, T6SS-3 and additional T6SS loci in <it>E. billingiae </it>and <it>Pantoea </it>sp. aB-valens were analyzed for the presence of conserved and evolved domains. The N-terminal region of all T6SS-1, T6SS-3 and additional T6SS VgrG proteins is conserved among all <it>Pantoea </it>and <it>Erwinia </it>species and contains a conserved VgrG (TIGR3361 <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>) and phage Gp5 (pfam04717 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>) domain (Figure <figr fid="F9">9</figr>). However, as has been observed in other bacteria, the VgrG proteins differ considerably in length between the various <it>Pantoea </it>and <it>Erwinia </it>species. This can be attributed to variable C-terminal regions, which suggest that they represent evolved VgrG proteins. Seven of the thirteen <it>Pantoea </it>and <it>Erwinia </it>T6SS-1 VgrG proteins contain such C-terminal extensions (Figure <figr fid="F10">10</figr>). Analysis of these C-terminal extensions showed that several of them contain conserved domains. The <it>P. agglomerans </it>E325 T6SS-1 VgrG2 protein (Pagg_1105) contains a peptidoglycan (PG)-binding domain (Pfam09374 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> - Cdsearch score: 45) as well as COG3926 and COG5526 lysozyme domains <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, which were also found in the lytic bacteriophage &#966;8, suggesting a bacteriolytic function for this VgrG protein. A similar function can be expected for the <it>Pantoea </it>sp. aB-valens T6SS-1 VgrG (PANABDRAFT_2668) which contains a &#946;-N-acetyl-glucosaminidase domain (COG4193 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> - Cdsearch score: 58.5) in its C-terminal extension. The <it>E. tasmaniensis </it>Et1/99 T6SS-1 VgrG2 (ETA_06370) C-terminal extension shows structural homology to the <it>Streptomyces </it>sp. N174 chitosanase (1_chk_A - Hhpred score: 190) <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
<fig id="F9"><title><p>Figure 9</p></title><caption><p>Conserved and non-conserved domains in the VgrG proteins of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430</p></caption><text>
   <p><b>Conserved and non-conserved domains in the VgrG proteins of <it>P. ananatis </it>LMG 20103 and <it>E. amylovora </it>CFBP 1430</b>. The conserved VgrG domains are colored in blue, Gp5 domains are in black, COG4253 domains in the T6SS-3 VgrG proteins in yellow and the non-conserved C-terminal extensions are colored in red. The amino acid locations of each domain are shown. The graphs show G+C contents (window size = 50 bp, step = 10 bp) in the respective <it>vgrG </it>genes.</p>
</text><graphic file="1471-2164-12-576-9"/></fig>
<fig id="F10"><title><p>Figure 10</p></title><caption><p>Phylogeny of the <it>Pantoea </it>and <it>Erwinia </it>VgrG proteins</p></caption><text>
   <p><b>Phylogeny of the <it>Pantoea </it>and <it>Erwinia </it>VgrG proteins</b>. Neighbor-joining tree showing the phylogenetic relationship between the VgrG proteins among the various <it>Pantoea and Erwinia </it>species. The tree (bootstrap n = 1,000; Poisson correction; Complete gap deletion) was constructed based on a ClustalW alignment of the VgrG amino acid sequences. "Evolved" VgrG proteins are indicated in brackets.</p>
</text><graphic file="1471-2164-12-576-10"/></fig>
<p>The C-terminal extensions of VgrG proteins encoded in the T6SS-3 loci of all <it>Pantoea </it>and <it>Erwinia </it>species contain a conserved domain of unknown function (COG4253), which is absent in the T6SS-1 loci VgrG proteins (Figure <figr fid="F9">9</figr>). This domain has also been identified flanking the N-terminal region of the <it>S. enterica </it>subsp. <it>arizonae </it>IIIa VgrG proteins, while it is not present in the VgrG proteins of other <it>Salmonella </it>serovars <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. A role for the COG4253 domain, which links the N-terminal VgrG transporter and C-terminal extension, in modulating the function of the VgrG between secretion and virulence has been suggested <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. However, with the exception of the <it>P. agglomerans </it>E325 T6SS-3 VgrG protein, which carries a C-terminal extension subsequent to the COG4253 region, no further C-terminal extensions are present in the <it>Pantoea </it>and <it>Erwinia </it>T6SS-3 VgrG proteins. An alternative function in host cell adhesion has also been suggested <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The latter function would imply that the COG4253-positive T6SS-3 and COG4253-negative T6SS-1 VgrG proteins could have different targets and serve different functions. A phylogeny based on the T6SS-1 and T6SS-3 VgrG proteins shows that they branch into two distinct clades suggesting distinct evolutionary backgrounds for these paralogous proteins (Figure <figr fid="F10">10</figr>). The VgrG protein found in the additional T6SS locus in <it>E. billingiae </it>Eb661 also contains a conserved COG4253 domain and clusters with the T6SS-3 VgrG proteins, while the additional <it>Pantoea </it>sp. aB-valens VgrG paralog clusters with the other VgrG proteins lacking this domain.</p>
</sec>
<sec><st><p>Non-conserved <it>hcp</it>/<it>vgrG </it>regions represent islands within the T6SS loci</p></st>
<p>While the T6SS-1 and T6SS-3 loci share conserved and syntenous cores among the various <it>Pantoea </it>and <it>Erwinia </it>strains, considerable variability in the <it>vgrG </it>and <it>hcp </it>regions can be observed. The G+C deviation across these T6SS loci was determined. This showed that for the T6SS-1 and T6SS-3 loci, there is a much lower G+C content in the variable regions associated with the <it>hcp </it>and <it>vgrG </it>genes compared to the conserved core regions (Figures <figr fid="F2">2</figr> and <figr fid="F4">4</figr>). In the T6SS-1 loci, the conserved core regions had an average G+C content of 60.17% across all strains, while the <it>hcp </it>regions (average G+C% = 42.30) and <it>vgrG </it>regions (average G+C% = 47.91) had a much lower G+C content. The substantial G+C deviations, variability in the gene content of the <it>hcp </it>and <it>vgrG </it>regions and the differential homology of proteins encoded in these regions to proteins in bacteria in other genera indicates that these regions represent variable islands within the T6SS loci. Similar G+C deviations could be observed for the <it>vgrG </it>regions in the additional <it>Pantoea </it>sp. aB-valens and <it>E. billingiae </it>Eb661 T6SS loci, which further supports that these regions serve as "hot spots" for rearrangement <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. These hot spots are frequently associated with Rhs proteins which are capable of displacing its C-terminal tip and replacing it with a non-homologous alternative. By this means, the Rhs protein can drive the sequential insertion of heterogeneous C-terminal sequences into the hot spot <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. As the additional <it>E. billingiae </it>Eb661 T6SS locus occurs within an integrated phage element, we postulate that transducing phages may play a role in the horizontal acquisition of non-conserved genes in the <it>vgrG </it>and <it>hcp </it>regions. Similar <it>vgrG</it>/<it>hcp </it>islands have also been identified in a number of <it>Pseudomonas </it>species <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. These islands are associated with "orphan" <it>vgrG </it>and <it>hcp </it>paralogs separately located from the remainder of the T6SS loci. In contrast, this is the first observation of <it>hcp </it>and <it>vgrG </it>islands associated with the conserved core T6SS loci. Analysis of the T6SS loci of other bacteria (data not shown), however, shows that this phenomenon is not restricted to <it>Pantoea </it>and <it>Erwinia</it>.</p>
</sec>
<sec><st><p>Hcp/VgrG islands harbor putative effector proteins</p></st>
<p>The proteins encoded in the variable <it>hcp </it>and <it>vgrG </it>islands in the T6SS-1 and T6SS-3 loci in the different <it>Pantoea </it>and <it>Erwinia </it>species were analyzed for sequence similarity and structural homology to known proteins and the presence of conserved domains. The majority of proteins encoded on the islands showed homology to proteins of unknown function. However, a number of island proteins share high sequence identity and contain conserved domains which suggest they may represent T6SS effectors with putative functions in host-microbe and inter-bacterial interactions (Additional File <supplr sid="S1">1</supplr> Table S1).</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Table S1 Non-conserved proteins encoded in the T6SS <it>hcp </it>and <it>vgrG </it>islands in <it>Pantoea </it>and <it>Erwinia </it>species</b>. Table showing the Blast analysis of the proteins encoded in the <it>Pantoea </it>and <it>Erwinia hcp </it>and <it>vgrG </it>islands against the NCBI protein database <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, Conserved Domain Database using the CD-search server <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and the Protein Database (PDB) using the Hhpred server <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. For the BlastP analysis against the NCBI protein database, the highest informative Blast result outside of the <it>Pantoea/Erwinia </it>group was determined. The amino acid positions showing sequence identity, percentage amino acid (aa) identity, bit scores and BLAST E-values are shown. For the Blast analysis for conserved domains <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, the protein families and conserved orthologous groups are shown, as well as the bit scores and E-values. For the Hhpred analysis of the amino acid sequences for structural homology the Hhpred scores and E-values are shown.</p></text>
<file name="1471-2164-12-576-S1.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>The amino acid sequence of ETA_06430 encoded in the <it>E. tasmaniensis </it>Et1/99 T6SS-1 <it>vgrG </it>island shares homology with a phospholipase in <it>V. cholerae </it>HE48 (VCHE48_2681 - 40% aa identity) while that of E. <it>billingiae </it>Eb661 EbC_4130 encoded on the T6SS-3 <it>vgrG </it>island shows homology to a lipase/esterase in <it>Yersinia bercovieri </it>ATCC 43970 (Yberc0001_35290 -64% aa identity). The former also shows structural homology to a <it>P. aeruginosa </it>lipase (1ex9_A - Hhpred score: 115) while the latter shows structural homology to a lipase in <it>Penicillium expansum </it>(3g7n_A: Hhpred score: 157), supporting that these two proteins represent lipases. Similarly, EbC_39360 encoded in the additional T6SS of <it>E. billingiae </it>Eb661 shares weak structural homology with a lipase in <it>Archaeoglobus fulgidus </it>(2zyr_A - Hhpred score: 81.1). Lipases hydrolyze long-chain triglycerides into fatty acids and glycerol and have been shown to represent major virulence factor in both animal and plant pathogens <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Genes encoding lipases were also identified in the <it>vgrG </it>islands of <it>P. aeruginosa </it>and transcriptome analysis showed that their expression is co-regulated with that of the T6SS <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. These lipases may therefore represent T6SS-secreted effectors in <it>P. aeruginosa </it>as well as the <it>Erwinia </it>species. Several genes in the <it>vgrG </it>regions of both <it>Pantoea </it>and <it>Erwinia </it>T6SS-1 and T6SS-3 loci may encode proteases. The <it>P. agglomerans </it>E325 Pagg2200 and <it>E. tasmaniensis </it>Et1/99 ETA_6240 proteins share 28% and 32% aa identity respectively with a putative zinc protease in <it>Acinetobacter baumanii </it>(AbauB_010100012633) and show weak structural homology to a zinc metalloproteases in <it>Geobacter sulfurreducens </it>(3c37_A - Hhpred score: 47.3 and 51.3, respectively). Furthermore, the <it>P. agglomerans </it>E325 Pagg_1085, <it>E. tasmaniensis </it>Et1/99 ETA_6210 and <it>E. amylovora </it>CFBP 1430 EAMY_3018 protein sequences show weak structural homology to the secreted cysteine protease stathopain (1cv8_A - Hhpred score: 29.8 34.2 and 34.2, respectively), which plays a role in skin infection by <it>Staphylococcus aureus </it><abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. EbC_39340 and EbC_39350 localized in the <it>vgrG </it>island in the additional T6SS locus of <it>E. billingiae </it>Eb661 encode proteins with weak structural homology to the secreted <it>Clostridium perfringens </it>sialidase NanI (2bf6_A - Hhpred score: 46 and 44.3, respectively) which is involved in the removal of sialic acids from host glycoconjugates with an important role in bacterial nutrition and pathogenesis <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. PANA_4143 associated with the T6SS-3 in <it>P. ananatis </it>LMG 20103, shares high sequence homology with the M23 peptidase of <it>Klebsiella </it>sp. 92-3 (HMPREF9538_05689: 78% aa identity) and extensive structural homology to the secreted chitinase G of <it>Streptomyces coelicolor </it>of (1chk_A - Hhpred score: 203). Chitinases degrade chitin, the carbohydrate polymer found in insect shells and the cell walls of fungi, suggesting the T6SS of <it>P. ananatis </it>LMG 20103 may secrete an effector with an antifungal or insecticidal function <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
<p>Two proteins, PANA_2363 and Pagg_2194, encoded in the T6SS-1 and T6SS-3 <it>hcp </it>islands of <it>P. ananatis </it>LMG20103 and <it>P. agglomerans </it>E325, respectively contain conserved peptidoglycan binding (Pfam01474 - PG_binding_1) and LysM (Pfam01476) domains which are typically associated with proteins involved in bacterial cell wall degradation. The latter also shows sequence homology to a putative lytic enzyme in <it>Acinetobacter calcoaceticus </it>RUH2202 (HMPREF0012_02474 - 34% aa identity). Similarly, <it>E. pyrifoliae </it>DSM 12163<sup>T </sup>Epyr_0675 shares sequence homology with a <it>Edwardsiella tarda </it>FL6-60 lysozyme (ETAF_ple052 - 32% aa identity) and also contains a conserved peptidoglycan binding domain (Pfam09374 - PG_binding_3) and a domain of unknown function (DUF847) frequently observed in lysozymes. The <it>E. billingiae </it>Eb661 EbC_05851 protein also shares sequence homology with the <it>Pseudomonas </it>phage PaP1 endolysin (PaP1_gp072 - 46% aa identity). The presence of domains conserved in bacterial cell wall degrading enzymes and the sequence homology to these enzymes indicate that these proteins may play a similar role to the Tse1 and Tse3 lytic enzymes in <it>P. aeruginosa </it>in the degradation of the bacterial cell wall and may thus have a bactericidal function <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B48">48</abbr></abbrgrp>. Furthermore, PANA_4136 encoded in the additional non-conserved region in the <it>P. ananatis </it>LMG 20103 T6SS-3 locus shows extensive sequence homology to S-pyocin domain containing proteins in <it>E. coli </it>3030-1 (EC30301_3278 - 58% aa identity) and <it>Yersinia pseudotuberculosis </it>IP 31758 (YpsIP31758_0897 - 53% aa identity) and shows weak structural homology to the colicin S4 of <it>Escherichia coli </it>(3few_X - Hhpred score: 45.5). Similarly, weak structural homology to colicin S4 can be observed for the T6SS <it>hcp </it>island <it>P. vagans </it>C9-1 Pvag_1032, <it>Pantoea </it>sp. At-9b Pat9b_2515 and <it>Pantoea </it>sp. aB-valens PANABDRAFT_2446 proteins (3few_X - Hhpred score: 60, 58.5 and 58.4, respectively). Colicins and pyocins are bacteriocins that are involved in killing closely related bacterial species <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. The presence of bacteriocin-like proteins in the T6SS loci of these <it>Pantoea </it>species agrees with the finding of a potential function for the T6SS in antibiosis and competition <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
</sec>
<sec><st><p><it>Pantoea </it>and <it>Erwinia </it>VgrG and Hcp proteins may carry effector proteins encoded in the <it>vgrG </it>and <it>hcp </it>islands</p></st>
<p>Analysis of the G+C contents of the <it>Pantoea </it>and <it>Erwinia </it>T6SS-1 <it>vgrG </it>genes shows that the N-terminal regions, which contain the conserved VgrG and Gp5 domains, have an average G+C content of 63.68%, which is similar to the conserved core of the T6SS-1 loci. By contrast, the C-terminal extensions have an average G+C content of 46.59%. This is similar to the non-conserved <it>vgrG </it>island, suggesting that the C-terminal extensions form part of the <it>vgrG </it>islands. In the T6SS-3 loci, the conserved N-terminal region of the <it>vgrG </it>genes has an average G+C content of 59.01% while the <it>vgrG </it>islands have an average G+C content of 48.38%.</p>
<p>Some of the proteins encoded in the <it>vgrG </it>islands show sequence homology and contain conserved domains found in the C-terminal extensions of evolved VgrG proteins. The <it>E. pyrifoliae </it>DSM 12163<sup>T </sup>T6SS-1 Epyr_00675 amino acid sequence shares 75% aa identity with the C-terminal region of the <it>P. agglomerans </it>E325 T6SS-1 evolved VgrG (Pagg_1105). The <it>P. ananatis </it>LMG 20103 T6SS-1 PANA_2363 and the C-terminal region of the evolved T6SS-1 VgrG of <it>P. agglomerans </it>E325 (Pagg_1105) also each contain a PG-binding domain and a LysM domain (Pfam01476) found in lytic enzymes, and show structural homology a cell wall degrading lysozyme in <it>Pseudomonas </it>phage Phikz (2kbh_A - Hhpred score: 128 and 44.5, respectively). BlastP analysis with the proteins encoded in the <it>vgrG </it>islands against the NCBI protein database showed that proteins encoded in the <it>vgrG </it>islands of several <it>Pantoea </it>and <it>Erwinia </it>species show substantial sequence identity to the C-terminal extensions in VgrG in bacteria belonging to other genera (Additional File <supplr sid="S1">1</supplr> Table S1). The putative lytic enzymes encoded in the T6SS-1 <it>vgrG </it>islands of <it>P. ananatis </it>LMG 20103 (PANA_2363) and <it>E. billingiae </it>Eb661 (EbC_05851) share homology with the C-terminal extensions in the VgrG proteins of <it>V. cholerae </it>TMA21 (VCB_002278 - 41% aa identity) and <it>V. cholerae </it>CT 5369-63 (VIH_000452 - 49% aa identity), respectively. The putative zinc proteases encoded in the <it>P. agglomerans </it>E325 T6SS-3 <it>vgrG </it>island (Ppag_2200) and the <it>E. tasmaniensis </it>Et1/99 T6SS-1 <it>vgrG </it>island (ETA_06240) share 43 and 46% aa identity with the C-terminal extension of a <it>Burkholderia </it>sp. 383 VgrG protein (Bcep18194_C7612). Similarly, the putative <it>E. tasmaniensis </it>Et1/99 lipase (ETA_06430) shows 42% aa identity to the <it>Burkholderia glumae </it>BGR1 VgrG (Bglu_2g02560). The sequence homology and the presence of shared conserved domains between the C-terminal extensions of evolved VgrG proteins and putative effector proteins encoded in the <it>vgrG </it>islands, suggest that VgrG proteins may carry effector proteins encoded in the <it>vgrG </it>islands. The COG4253 conserved domain found at the C-terminal end of all the T6SS-3 VgrG proteins may be involved in the anchorage of the VgrG transporter to the effector proteins, thereby modulating the function of the VgrG protein as a structural component of the T6SS apparatus and the transport of effector proteins <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. This domain is absent in the T6SS-1 loci, and the means by which effectors become associated with the VgrG transport region would need to be determined.</p>
<p>In the fish pathogen <it>Edwardsiella tarda</it>, the secreted effector protein EvpP has been shown to interact with the Hcp protein <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Furthermore, Hcp orthologs with C-terminal extensions have been identified in <it>S. enterica</it>, which may represent evolved Hcp proteins <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. It is therefore plausible that, as is the case for the <it>Pantoea </it>and <it>Erwinia </it>VgrG proteins, the Hcp proteins may transport effector proteins encoded on the <it>hcp </it>islands. By this means, various effector proteins could be tagged to the VgrG and Hcp proteins, thereby forming different VgrG-effector and Hcp-effector combinations, which may perform different biological functions. The VgrG or Hcp proteins could transport bacteriocins or pyocins, which would allow the bacterium to target other bacteria competing for an ecological niche. A chitinase effector carried by the VgrG protein could play an antifungal or insecticidal role in the <it>Pantoea </it>and <it>Erwinia </it>species. These and other putative pathogenicity factors encoded by hypothetical proteins in the <it>hcp </it>and <it>vgrG </it>islands translocated by the VgrG and Hcp proteins may therefore enable <it>Pantoea </it>and <it>Erwinia </it>species to target a range of bacterial, invertebrate, vertebrate and or plant hosts.</p>
</sec>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Comparative analysis of the genome sequences of several <it>Pantoea </it>and <it>Erwinia </it>species revealed that they encode between two and four T6SS loci. This suggests an important biological role for this secretion system in these two genera. Two of the T6SS loci, T6SS-1 and T6SS-2 are shared among all <it>Pantoea </it>and <it>Erwinia </it>strains, while orthologs of the third locus are only found in four of five <it>Pantoea </it>species and two of four <it>Erwinia </it>species, suggesting acquisition by horizontal gene transfer of this locus. <it>Pantoea </it>sp. aB-valens and <it>E. billingiae </it>Eb661 encode additional T6SS loci. Analysis of the T6SS-1, T6SS-3 and additional loci in <it>Pantoea </it>sp. aB-valens and <it>E. billingiae </it>Eb661 showed that while synteny is conserved among the <it>Pantoea </it>and <it>Erwinia </it>species for each locus, non-conserved regions could be observed associated with the <it>hcp </it>and <it>vgrG </it>genes. The G+C contents of these non-conserved regions differ substantially from the conserved portions of the loci, indicating horizontal acquisition of these regions separate from the rest of the T6SS loci. Several of the VgrG proteins encoded in the loci have a C-terminal extension and represent evolved VgrG proteins. These C-terminal extensions likewise have lower G+C contents than the remainder of the T6SS suggesting they form part of the <it>vgrG </it>islands.</p>
<p>Many of the proteins encoded in the <it>vgrG </it>and <it>hcp </it>islands carry conserved domains and show sequence and structural homology to proteins with various biological functions including antibiosis, fungal cell wall degradation and putative roles in animal and plant pathogenesis. We postulate that the <it>vgrG </it>and <it>hcp </it>islands may represent evolutionary hot spots for genes that encode effector proteins. Similar rearrangement hot spots have been observed in the regions adjacent to the <it>hcp </it>and <it>vgrG </it>genes in the T6SS loci of other bacteria <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B36">36</abbr></abbrgrp>, suggesting that this is a more widespread phenomenon. The sequence similarity and structural homology of some of these putative effector proteins to C-terminal extensions in characterized evolved VgrG proteins suggest that they may become tagged to the conserved core VgrG proteins which serve as transporters for these effectors, thereby forming new evolved VgrG proteins. Similarly, putative effectors encoded in the <it>hcp </it>islands may become associated with the Hcp proteins to form evolved Hcp proteins. We could therefore speak of "evolving" Hcp and VgrG proteins. By tagging various effector proteins different VgrG-effector and Hcp-effector combinations could be formed which may perform different biological functions. Thereby, the genomic islands associated with the Hcp and VgrG proteins could drive functional diversification of the T6SS, which may explain the plethora of biological roles described for this secretion system.</p>
</sec>
<sec><st><p>Methods</p></st>
<sec><st><p><it>In silico </it>identification of the T6SS loci in <it>Pantoea </it>and <it>Erwinia</it></p></st>
<p>The T6SS loci in five <it>Pantoea </it>and four <it>Erwinia </it>species, for which complete or draft genome sequences are available, were identified: <it>Pantoea ananatis </it>LMG 20103 <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, <it>Pantoea vagans </it>C9-1 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, <it>Pantoea agglomerans </it>E325 (Smits and Duffy, unpublished results), <it>Pantoea </it>sp. aB-valens <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, <it>Pantoea </it>sp. At-9b <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, <it>Erwinia amylovora </it>CFBP 1430 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, <it>Erwinia pyrifoliae </it>DSM 12163<sup>T </sup><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, <it>Erwinia tasmaniensis </it>Et1/99 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and <it>Erwinia billingiae </it>Eb661 <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Identification of the T6SS loci was done by BlastP analysis <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> with the conserved core proteins identified by Boyer et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> against local protein databases created for the <it>Pantoea </it>and <it>Erwinia </it>strains. Proteins neighboring <it>Pantoea </it>and <it>Erwinia </it>T6SS clusters were compared using BlastP against the NCBI protein database <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> to identify to full extent of the T6SS loci. Sequence manipulations were conducted with multiple subroutines of the LASERGENE package (DNASTAR, Madison, WI, USA).</p>
</sec>
<sec><st><p>Phylogenetic analyses</p></st>
<p>Phylogenetic analyses were done using the procedures outlined by Bingle et al. <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. A ClustalW alignment with default parameters was performed with the IcmF (COG3523) amino acid sequences, as this represents the only protein conserved among all <it>Pantoea </it>and <it>Erwinia </it>T6SS loci. A phylogenetic tree was constructed with the Molecular Evolutionary Genetics Analysis (MEGA) v. 5.0.3 software package <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, using the neighbor-joining method, with Poisson correction, complete gap deletion and bootstrapping (n = 1,000) parameters. The IcmF amino acid sequences from the T6SS loci of several closely related organisms as identified by BlastP analysis against the NCBI protein database <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> were included in the tree. This same procedure was employed to construct phylogenetic trees for Gyrase B (GyrB) and the VgrG proteins.</p>
</sec>
<sec><st><p>Analyses of the <it>hcp</it>/<it>vgrG </it>islands</p></st>
<p>The average G+C contents for the conserved core regions and <it>hcp </it>and <it>vgrG </it>islands of the T6SS loci were determined using the Bioedit v.7.0.5.3 package <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The G+C content was determined using 50 bp windows in 10 bp steps. Similarly, the G+C contents for the <it>vgrG </it>genes were determined for the conserved N-terminal region, which included the conserved Vgr and Gp5 domains, and for the C-terminal extension which was considered as all nucleotides located at the 3' end of the Gp5 domain. Proteins encoded in the <it>vgrG </it>and <it>hcp </it>islands were identified using the FgenesB <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> and Orf finder <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> web servers. The amino acid sequences for the proteins encoded in the <it>hcp </it>and <it>vgrG </it>islands were analyzed for sequence identity by BlastP analysis against the NCBI protein database <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> the presence of conserved domains by Blast analysis against the Conserved Domain Database (CDsearch) <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. Structural homology of the <it>vgrG </it>and <it>hcp </it>island proteins to those for which the chemical structure has been determined was performed using the HHpred server (<url>http://toolkit.tuebingen.mpg.de/hhpred</url>) <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>.</p>
</sec>
</sec>
<sec><st><p>Conflicting interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>PDM, SNV, BD, TAC and THMS conceived the study. PDM, TK and THMS performed experiments and analysis. PDM, SNV, BD, TAC, and THMS wrote the original manuscript. All authors read and approved the final version.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>This study was partially supported by the National Research Foundation (NRF), the Tree Protection Co-operative Programme (TPCP), the NRF/Dept. of Science and Technology Centre of Excellence in Tree Health Biotechnology (CTHB), and the THRIP support program of the Department of Trade and Industry, South Africa, the Swiss Federal Office for Agriculture (BLW Fire Blight Research - Pathogen), and the Swiss Secretariat for Education and Research (SBF C07.0038). It was conducted within the European Science Foundation funded research network COST Action 864 and the Swiss ProfiCrops program. We acknowledge Cameron Currie, Aaron Adams and the JGI for the use of the <it>Pantoea </it>sp. aB-valens genome.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>A virulence locus of <it>Pseudomonas aeruginosa </it>encodes a protein secretion apparatus</p></title><aug><au><snm>Mougous</snm><fnm>JD</fnm></au><au><snm>Cuff</snm><fnm>ME</fnm></au><au><snm>Raunser</snm><fnm>S</fnm></au><au><snm>Shen</snm><fnm>A</fnm></au><au><snm>Zhou</snm><fnm>M</fnm></au><au><snm>Giford</snm><fnm>CA</fnm></au><au><snm>Goodman</snm><fnm>AL</fnm></au><au><snm>Joachimiak</snm><fnm>G</fnm></au><au><snm>Ordo&#241;ez</snm><fnm>CL</fnm></au><au><snm>Lory</snm><fnm>S</fnm></au><au><snm>Walz</snm><fnm>T</fnm></au><au><snm>Joachimiak</snm><fnm>A</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>312</volume><fpage>1526</fpage><lpage>1530</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1128393</pubid><pubid idtype="pmcid">2800167</pubid><pubid idtype="pmpid" link="fulltext">16763151</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Identification of a conserved bacterial protein secretion system in <it>Vibrio cholerae </it>using the <it>Dictyostelium </it>host model system</p></title><aug><au><snm>Pukatzki</snm><fnm>S</fnm></au><au><snm>Sturtevant</snm><fnm>D</fnm></au><au><snm>Krastins</snm><fnm>B</fnm></au><au><snm>Sarracino</snm><fnm>D</fnm></au><au><snm>Nelson</snm><fnm>WC</fnm></au><au><snm>Heidelberg</snm><fnm>JF</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>1528</fpage><lpage>1533</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0510322103</pubid><pubid idtype="pmcid">1345711</pubid><pubid idtype="pmpid" link="fulltext">16432199</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Type VI Secretion: Not just for pathogenesis anymore</p></title><aug><au><snm>Jani</snm><fnm>AJ</fnm></au><au><snm>Cotter</snm><fnm>PA</fnm></au></aug><source>Cell Host Microbe</source><pubdate>2010</pubdate><volume>8</volume><fpage>2</fpage><lpage>6</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.chom.2010.06.012</pubid><pubid idtype="pmcid">2913581</pubid><pubid idtype="pmpid" link="fulltext">20638635</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>What is type VI secretion doing in all those bugs?</p></title><aug><au><snm>Schwarz</snm><fnm>S</fnm></au><au><snm>Hood</snm><fnm>RD</fnm></au><au><snm>Mougous</snm><fnm>JD</fnm></au></aug><source>Trends Microbiol</source><pubdate>2010</pubdate><volume>18</volume><fpage>531</fpage><lpage>537</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tim.2010.09.001</pubid><pubid idtype="pmpid" link="fulltext">20961764</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Anchoring the type VI secretion system to the peptidoglycan</p></title><aug><au><snm>Aschtgen</snm><fnm>M-S</fnm></au><au><snm>Thomas</snm><fnm>MS</fnm></au><au><snm>Cascales</snm><fnm>E</fnm></au></aug><source>Virulence</source><pubdate>2010</pubdate><volume>1</volume><fpage>535</fpage><lpage>540</lpage><xrefbib><pubidlist><pubid idtype="doi">10.4161/viru.1.6.13732</pubid><pubid idtype="pmpid" link="fulltext">21178498</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p><it>sciS</it>, an <it>icmF </it>homolog in <it>Salmonella enterica </it>serovar Typhimurium limits intracellular replication and decreases virulence</p></title><aug><au><snm>Parsons</snm><fnm>DA</fnm></au><au><snm>Heffron</snm><fnm>F</fnm></au></aug><source>Infect Immun</source><pubdate>2005</pubdate><volume>73</volume><fpage>4338</fpage><lpage>4345</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/IAI.73.7.4338-4345.2005</pubid><pubid idtype="pmcid">1168621</pubid><pubid idtype="pmpid" link="fulltext">15972528</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>A pathobiont of the microbiota balances host colonization and intestinal inflammation</p></title><aug><au><snm>Chow</snm><fnm>J</fnm></au><au><snm>Mazmanian</snm><fnm>SK</fnm></au></aug><source>Cell Host Microbe</source><pubdate>2010</pubdate><volume>7</volume><fpage>265</fpage><lpage>276</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.chom.2010.03.004</pubid><pubid idtype="pmcid">2859213</pubid><pubid idtype="pmpid" link="fulltext">20413095</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Quorum sensing coordinates brute force and stealth modes of infection in the plant pathogen <it>Pectobacterium atrosepticum</it></p></title><aug><au><snm>Liu</snm><fnm>H</fnm></au><au><snm>Coulthurst</snm><fnm>SJ</fnm></au><au><snm>Pritchard</snm><fnm>L</fnm></au><au><snm>Hedley</snm><fnm>PE</fnm></au><au><snm>Ravensdale</snm><fnm>M</fnm></au><au><snm>Burr</snm><fnm>T</fnm></au><au><snm>Takle</snm><fnm>G</fnm></au><au><snm>Brurberg</snm><fnm>M-B</fnm></au><au><snm>Birch</snm><fnm>PRJ</fnm></au><au><snm>Salmond</snm><fnm>GPC</fnm></au><au><snm>Toth</snm><fnm>IK</fnm></au></aug><source>PLoS Path</source><pubdate>2008</pubdate><volume>4</volume><fpage>e1000093</fpage><xrefbib><pubid idtype="doi">10.1371/journal.ppat.1000093</pubid></xrefbib></bibl><bibl id="B9"><title><p>Evaluation of a <it>Yersinia pestis </it>mutants impaired in a thermoregulated type VI-like secretion system in flea, macrophage and murine models</p></title><aug><au><snm>Robinson</snm><fnm>JB</fnm></au><au><snm>Telepnev</snm><fnm>MV</fnm></au><au><snm>Zudina</snm><fnm>IV</fnm></au><au><snm>Bouyer</snm><fnm>D</fnm></au><au><snm>Montenieri</snm><fnm>JA</fnm></au><au><snm>Bearden</snm><fnm>SW</fnm></au><au><snm>Gage</snm><fnm>KL</fnm></au><au><snm>Agaer</snm><fnm>SL</fnm></au><au><snm>Foltz</snm><fnm>SM</fnm></au><au><snm>Chauhan</snm><fnm>S</fnm></au><au><snm>Chopra</snm><fnm>AK</fnm></au><au><snm>Motin</snm><fnm>VL</fnm></au></aug><source>Microb Pathog</source><pubdate>2009</pubdate><volume>47</volume><fpage>243</fpage><lpage>251</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.micpath.2009.08.005</pubid><pubid idtype="pmcid">2767420</pubid><pubid idtype="pmpid" link="fulltext">19716410</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Infection-blocking genes of a symbiotic <it>Rhizobium leguminosarum </it>strain that are involved in temperature-dependent protein secretion</p></title><aug><au><snm>Bladergroen</snm><fnm>MR</fnm></au><au><snm>Badelt</snm><fnm>K</fnm></au><au><snm>Spaink</snm><fnm>HP</fnm></au></aug><source>Mol Plant-Microbe Interact</source><pubdate>2003</pubdate><volume>16</volume><fpage>53</fpage><lpage>64</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1094/MPMI.2003.16.1.53</pubid><pubid idtype="pmpid" link="fulltext">12580282</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>A type VI secretion system of <it>Pseudomonas aeruginosa </it>targets a toxin to bacteria</p></title><aug><au><snm>Hood</snm><fnm>RD</fnm></au><au><snm>Singh</snm><fnm>P</fnm></au><au><snm>Hsu</snm><fnm>F</fnm></au><au><snm>Carl</snm><fnm>MA</fnm></au><au><snm>Trinidad</snm><fnm>RRS</fnm></au><au><snm>Silverman</snm><fnm>JM</fnm></au><au><snm>Ohlson</snm><fnm>BB</fnm></au><au><snm>Hicks</snm><fnm>KG</fnm></au><au><snm>Plemel</snm><fnm>RL</fnm></au><au><snm>Li</snm><fnm>M</fnm></au><au><snm>Schwarz</snm><fnm>S</fnm></au><au><snm>Wang</snm><fnm>WY</fnm></au><au><snm>Merz</snm><fnm>AJ</fnm></au><au><snm>Goodlett</snm><fnm>DR</fnm></au><au><snm>Mougous</snm><fnm>JD</fnm></au></aug><source>Microbe</source><pubdate>2010</pubdate><volume>7</volume><fpage>25</fpage><lpage>37</lpage></bibl><bibl id="B12"><title><p>Type VI secretion delivers bacteriolytic effectors to target cells</p></title><aug><au><snm>Russell</snm><fnm>AB</fnm></au><au><snm>Hood</snm><fnm>RD</fnm></au><au><snm>Bui</snm><fnm>NK</fnm></au><au><snm>LeRoux</snm><fnm>M</fnm></au><au><snm>Vollmer</snm><fnm>W</fnm></au><au><snm>Mougous</snm><fnm>JD</fnm></au></aug><source>Nature</source><pubdate>2011</pubdate><volume>475</volume><fpage>343</fpage><lpage>347</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature10244</pubid><pubid idtype="pmpid" link="fulltext">21776080</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>The <it>Vibrio cholerae </it>type VI secretion system displays antimicrobial properties</p></title><aug><au><snm>MacIntyre</snm><fnm>DL</fnm></au><au><snm>Miyata</snm><fnm>ST</fnm></au><au><snm>Kitaoka</snm><fnm>M</fnm></au><au><snm>Pukatzki</snm><fnm>SL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2011</pubdate><volume>9</volume><fpage>6</fpage><lpage>10</lpage></bibl><bibl id="B14"><title><p>Comparative genomic analysis uncovers 3 novel loci encoding type six secretion systems differentially distributed in <it>Salmonella </it>serotypes</p></title><aug><au><snm>Blondel</snm><fnm>CJ</fnm></au><au><snm>Jim&#233;nez</snm><fnm>JC</fnm></au><au><snm>Contreras</snm><fnm>I</fnm></au><au><snm>Santiviago</snm><fnm>CA</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>1</fpage><lpage>17</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-1</pubid><pubid idtype="pmcid">2671524</pubid><pubid idtype="pmpid" link="fulltext">19121221</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Dissecting the bacterial type VI secretion system by a genome wide <it>in silico </it>analysis: what can be learned from available microbial genomic resources?</p></title><aug><au><snm>Boyer</snm><fnm>F</fnm></au><au><snm>Fichant</snm><fnm>G</fnm></au><au><snm>Berthod</snm><fnm>J</fnm></au><au><snm>Vandenbrouck</snm><fnm>Y</fnm></au><au><snm>Attree</snm><fnm>I</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>10</volume><fpage>104</fpage><lpage>11</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-104</pubid><pubid idtype="pmcid">2660368</pubid><pubid idtype="pmpid" link="fulltext">19284603</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Type VI secretion system translocates a phage tail spike-like protein into target cells where it cross-links actin</p></title><aug><au><snm>Pukatzki</snm><fnm>S</fnm></au><au><snm>Revel</snm><fnm>AT</fnm></au><au><snm>Sturtevant</snm><fnm>D</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>15508</fpage><lpage>15513</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0706532104</pubid><pubid idtype="pmcid">2000545</pubid><pubid idtype="pmpid" link="fulltext">17873062</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Type VI secretion apparatus and phage tail-associated protein complexes share a common evolutionary origin</p></title><aug><au><snm>Leiman</snm><fnm>PG</fnm></au><au><snm>Basler</snm><fnm>M</fnm></au><au><snm>Ramagopal</snm><fnm>UA</fnm></au><au><snm>Bonanno</snm><fnm>JB</fnm></au><au><snm>Sauder</snm><fnm>JM</fnm></au><au><snm>Pukatzki</snm><fnm>S</fnm></au><au><snm>Burley</snm><fnm>SK</fnm></au><au><snm>Almo</snm><fnm>SC</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2009</pubdate><volume>106</volume><fpage>4154</fpage><lpage>4159</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0813360106</pubid><pubid idtype="pmcid">2657435</pubid><pubid idtype="pmpid" link="fulltext">19251641</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>The type VI secretion system: translocation of effectors and effector-domains</p></title><aug><au><snm>Pukatzki</snm><fnm>S</fnm></au><au><snm>McAuley</snm><fnm>SB</fnm></au><au><snm>Miyata</snm><fnm>ST</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2009</pubdate><volume>12</volume><fpage>11</fpage><lpage>17</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.mib.2008.11.010</pubid><pubid idtype="pmpid" link="fulltext">19162533</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Type VI secretion system in <it>Pseudomonas aeruginosa</it>: secretion and multimerization of VgrG proteins</p></title><aug><au><snm>Hachani</snm><fnm>A</fnm></au><au><snm>Lossi</snm><fnm>NS</fnm></au><au><snm>Hamilton</snm><fnm>A</fnm></au><au><snm>Jones</snm><fnm>C</fnm></au><au><snm>Bleves</snm><fnm>S</fnm></au><au><snm>Albesa-Jov&#233;</snm><fnm>D</fnm></au><au><snm>Filloux</snm><fnm>A</fnm></au></aug><source>J Biol Chem</source><pubdate>2011</pubdate><volume>286</volume><fpage>12317</fpage><lpage>12327</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M110.193045</pubid><pubid idtype="pmcid">3069435</pubid><pubid idtype="pmpid" link="fulltext">21325275</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Translocation of a <it>Vibrio cholerae </it>type VI secretion effector requires bacterial endocytosis by host cells</p></title><aug><au><snm>Ma</snm><fnm>AT</fnm></au><au><snm>McAuley</snm><fnm>S</fnm></au><au><snm>Pukatzki</snm><fnm>S</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Cell Host Microbe</source><pubdate>2009</pubdate><volume>5</volume><fpage>234</fpage><lpage>243</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.chom.2009.02.005</pubid><pubid idtype="pmcid">3142922</pubid><pubid idtype="pmpid" link="fulltext">19286133</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p><it>In silico </it>analysis reveals multiple putative type VI secretion systems and effector proteins in <it>Pseudomonas syringae </it>pathovars</p></title><aug><au><snm>Sarris</snm><fnm>PF</fnm></au><au><snm>Skandalis</snm><fnm>N</fnm></au><au><snm>Kokkinidis</snm><fnm>M</fnm></au><au><snm>Panopoulos</snm><fnm>NJ</fnm></au></aug><source>Mol Plant Pathol</source><pubdate>2010</pubdate><volume>11</volume><fpage>795</fpage><lpage>804</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">21091602</pubid></xrefbib></bibl><bibl id="B22"><title><p>Complete genome sequence of the fire blight pathogen <it>Erwinia amylovora </it>CFBP 1430 and comparison to other <it>Erwinia </it>spp</p></title><aug><au><snm>Smits</snm><fnm>THM</fnm></au><au><snm>Rezzonico</snm><fnm>F</fnm></au><au><snm>Kamber</snm><fnm>T</fnm></au><au><snm>Blom</snm><fnm>J</fnm></au><au><snm>Goesmann</snm><fnm>A</fnm></au><au><snm>Frey</snm><fnm>JE</fnm></au><au><snm>Duffy</snm><fnm>B</fnm></au></aug><source>BMC Genomics</source><pubdate>2010</pubdate><volume>23</volume><fpage>384</fpage><lpage>393</lpage></bibl><bibl id="B23"><title><p>Ultrastructural investigation of resistant and susceptible maize inbreds infected with <it>Erwinia stewartii</it></p></title><aug><au><snm>Braun</snm><fnm>EJ</fnm></au></aug><source>Phytopathology</source><pubdate>1982</pubdate><volume>72</volume><fpage>159</fpage><lpage>166</lpage><xrefbib><pubid idtype="doi">10.1094/Phyto-72-159</pubid></xrefbib></bibl><bibl id="B24"><title><p><it>Pantoea ananatis</it>: an unconventional plant pathogen</p></title><aug><au><snm>Coutinho</snm><fnm>TA</fnm></au><au><snm>Venter</snm><fnm>SN</fnm></au></aug><source>Mol Plant Pathol</source><pubdate>2009</pubdate><volume>10</volume><fpage>325</fpage><lpage>335</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1364-3703.2009.00542.x</pubid><pubid idtype="pmpid" link="fulltext">19400836</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Genome comparison of the epiphytic bacteria <it>Erwinia billingiae </it>and <it>E. tasmaniensis </it>with the pear pathogen <it>E. pyrifoliae</it></p></title><aug><au><snm>Kube</snm><fnm>M</fnm></au><au><snm>Migdoll</snm><fnm>AM</fnm></au><au><snm>Gehring</snm><fnm>I</fnm></au><au><snm>Heitmann</snm><fnm>K</fnm></au><au><snm>Mayer</snm><fnm>Y</fnm></au><au><snm>Kuhl</snm><fnm>H</fnm></au><au><snm>Knaust</snm><fnm>F</fnm></au><au><snm>Geider</snm><fnm>K</fnm></au><au><snm>Reinhardt</snm><fnm>R</fnm></au></aug><source>BMC Genomics</source><pubdate>2010</pubdate><volume>11</volume><fpage>393</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-11-393</pubid><pubid idtype="pmcid">2897811</pubid><pubid idtype="pmpid" link="fulltext">20565991</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Antibiosis and acidification by <it>Pantoea agglomerans </it>strain E325 may contribute to suppression of <it>Erwinia amylovora</it></p></title><aug><au><snm>Pusey</snm><fnm>PL</fnm></au><au><snm>Stockwell</snm><fnm>VO</fnm></au><au><snm>Rudell</snm><fnm>DR</fnm></au></aug><source>Phytopathology</source><pubdate>2008</pubdate><volume>98</volume><fpage>1136</fpage><lpage>1143</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1094/PHYTO-98-10-1136</pubid><pubid idtype="pmpid" link="fulltext">18943460</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Control of fire blight by <it>Pseudomonas fluorescens </it>A506 and <it>Pantoea vagans </it>C9-1 applied as single strains and mixed inocula</p></title><aug><au><snm>Stockwell</snm><fnm>VO</fnm></au><au><snm>Johnson</snm><fnm>KB</fnm></au><au><snm>Sugar</snm><fnm>D</fnm></au><au><snm>Loper</snm><fnm>JE</fnm></au></aug><source>Phytopathology</source><pubdate>2010</pubdate><volume>100</volume><fpage>1330</fpage><lpage>1339</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1094/PHYTO-03-10-0097</pubid><pubid idtype="pmpid" link="fulltext">20839963</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Metabolic versatility and antibacterial metabolite biosynthesis are distinguishing genomic features of the fire blight antagonist <it>Pantoea vagans </it>C9-1</p></title><aug><au><snm>Smits</snm><fnm>THM</fnm></au><au><snm>Rezzonico</snm><fnm>F</fnm></au><au><snm>Kamber</snm><fnm>T</fnm></au><au><snm>Goesmann</snm><fnm>A</fnm></au><au><snm>Ishimaru</snm><fnm>CA</fnm></au><au><snm>Frey</snm><fnm>JE</fnm></au><au><snm>Stockwell</snm><fnm>VO</fnm></au><au><snm>Duffy</snm><fnm>B</fnm></au></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><fpage>e22247</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0022247</pubid><pubid idtype="pmcid">3137637</pubid><pubid idtype="pmpid" link="fulltext">21789243</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Effects of symbiotic bacteria and tree chemistry on the growth and reproduction of bark beetle fungal symbionts</p></title><aug><au><snm>Adams</snm><fnm>AS</fnm></au><au><snm>Currie</snm><fnm>CR</fnm></au><au><snm>Cardoza</snm><fnm>Y</fnm></au><au><snm>Klepzig</snm><fnm>KD</fnm></au><au><snm>Raffa</snm><fnm>KF</fnm></au></aug><source>Can J For Res</source><pubdate>2009</pubdate><volume>39</volume><fpage>1133</fpage><lpage>1147</lpage><xrefbib><pubid idtype="doi">10.1139/X09-034</pubid></xrefbib></bibl><bibl id="B30"><title><p>Symbiotic nitrogen fixation in the fungus gardens of leaf-cutter ants</p></title><aug><au><snm>Pinto-Tom&#225;s</snm><fnm>AA</fnm></au><au><snm>Anderson</snm><fnm>MA</fnm></au><au><snm>Suen</snm><fnm>G</fnm></au><au><snm>Stevenson</snm><fnm>DM</fnm></au><au><snm>Chu</snm><fnm>FST</fnm></au><au><snm>Cleland</snm><fnm>WW</fnm></au><au><snm>Weimer</snm><fnm>PJ</fnm></au><au><snm>Currie</snm><fnm>CR</fnm></au></aug><source>Science</source><pubdate>2009</pubdate><volume>326</volume><fpage>1120</fpage><lpage>1123</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1173036</pubid><pubid idtype="pmpid" link="fulltext">19965433</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Genome sequence of <it>Pantoea ananatis </it>LMG 20103, the causative agent of Eucalyptus blight and dieback</p></title><aug><au><snm>De Maayer</snm><fnm>P</fnm></au><au><snm>Chan</snm><fnm>WY</fnm></au><au><snm>Venter</snm><fnm>SN</fnm></au><au><snm>Toth</snm><fnm>IK</fnm></au><au><snm>Birch</snm><fnm>PRJ</fnm></au><au><snm>Joubert</snm><fnm>F</fnm></au><au><snm>Coutinho</snm><fnm>TA</fnm></au></aug><source>J Bacteriol</source><pubdate>2010</pubdate><volume>192</volume><fpage>2936</fpage><lpage>2937</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.00060-10</pubid><pubid idtype="pmcid">2876505</pubid><pubid idtype="pmpid" link="fulltext">20348253</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Genome sequence of the biocontrol agent <it>Pantoea vagans </it>C9-1</p></title><aug><au><snm>Smits</snm><fnm>THM</fnm></au><au><snm>Rezzonico</snm><fnm>F</fnm></au><au><snm>Kamber</snm><fnm>T</fnm></au><au><snm>Goesmann</snm><fnm>A</fnm></au><au><snm>Ishimaru</snm><fnm>CA</fnm></au><au><snm>Stockwell</snm><fnm>VO</fnm></au><au><snm>Frey</snm><fnm>JE</fnm></au><au><snm>Duffy</snm><fnm>B</fnm></au></aug><source>J Bacteriol</source><pubdate>2010</pubdate><volume>192</volume><fpage>6486</fpage><lpage>6487</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.01122-10</pubid><pubid idtype="pmcid">3008540</pubid><pubid idtype="pmpid" link="fulltext">20952567</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Complete genome sequence of the fire blight pathogen <it>Erwinia pyrifoliae </it>DSM 12163<sup>T </sup>and comparative genomic insights into plant pathogenicity</p></title><aug><au><snm>Smits</snm><fnm>THM</fnm></au><au><snm>Jaenicke</snm><fnm>S</fnm></au><au><snm>Rezzonico</snm><fnm>F</fnm></au><au><snm>Kamber</snm><fnm>T</fnm></au><au><snm>Goesmann</snm><fnm>A</fnm></au><au><snm>Frey</snm><fnm>JE</fnm></au><au><snm>Duffy</snm><fnm>B</fnm></au></aug><source>BMC Genomics</source><pubdate>2010</pubdate><volume>4</volume><fpage>2</fpage></bibl><bibl id="B34"><title><p>The genome of <it>Erwinia tasmaniensis </it>strain Et1/99, a non-pathogenic bacterium in the genus <it>Erwinia</it></p></title><aug><au><snm>Kube</snm><fnm>M</fnm></au><au><snm>Migdoll</snm><fnm>AM</fnm></au><au><snm>M&#252;ller</snm><fnm>I</fnm></au><au><snm>Kuhl</snm><fnm>H</fnm></au><au><snm>Beck</snm><fnm>A</fnm></au><au><snm>Reinhardt</snm><fnm>R</fnm></au><au><snm>Geider</snm><fnm>K</fnm></au></aug><source>Environ Microbiol</source><pubdate>2008</pubdate><volume>10</volume><fpage>2211</fpage><lpage>2222</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1462-2920.2008.01639.x</pubid><pubid idtype="pmpid" link="fulltext">18462403</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Threonine phosphorylation post-translationally regulates protein secretion in <it>Pseudomonas aeruginosa</it></p></title><aug><au><snm>Mougous</snm><fnm>JD</fnm></au><au><snm>Gifford</snm><fnm>CA</fnm></au><au><snm>Ramsdell</snm><fnm>TL</fnm></au><au><snm>Mekalanos</snm><fnm>JJ</fnm></au></aug><source>Nature Cell Biol</source><pubdate>2007</pubdate><volume>9</volume><fpage>797</fpage><lpage>803</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ncb1605</pubid><pubid idtype="pmpid" link="fulltext">17558395</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Evolutionary diversification of an ancient gene family (<it>rhs</it>) through C-terminal displacement</p></title><aug><au><snm>Jackson</snm><fnm>AP</fnm></au><au><snm>Thomas</snm><fnm>GH</fnm></au><au><snm>Parkhill</snm><fnm>J</fnm></au><au><snm>Thomson</snm><fnm>NR</fnm></au></aug><source>BMC Genomics</source><pubdate>2009</pubdate><volume>16</volume><fpage>1</fpage><lpage>16</lpage></bibl><bibl id="B37"><title><p>TIGRFAM database</p></title><url>http://www.jcvi.org/cgi-bin/tigrfams/</url></bibl><bibl id="B38"><title><p>Protein families (PFAM) database</p></title><url>http://pfam.sanger.ac.uk/</url></bibl><bibl id="B39"><title><p>COG3926 and COG5526: A tale of two new lysozyme-like protein families</p></title><aug><au><snm>Pei</snm><fnm>J</fnm></au><au><snm>Grishin</snm><fnm>NV</fnm></au></aug><source>Prot Sci</source><pubdate>2005</pubdate><volume>14</volume><fpage>2574</fpage><lpage>2581</lpage><xrefbib><pubid idtype="doi">10.1110/ps.051656805</pubid></xrefbib></bibl><bibl id="B40"><title><p>Conserved Orthologous Groups database</p></title><url>http://www.ncbi.nlm.nih.gov/COG/grace/</url></bibl><bibl id="B41"><title><p>Protein Databank (PDB)</p></title><url>http://www.pdb.org/</url></bibl><bibl id="B42"><title><p>Genomic analysis of the type VI secretion systems in <it>Pseudomonas </it>spp: novel clusters and putative effectors uncovered</p></title><aug><au><snm>Barret</snm><fnm>M</fnm></au><au><snm>Egan</snm><fnm>F</fnm></au><au><snm>Fargier</snm><fnm>E</fnm></au><au><snm>Morrisey</snm><fnm>JP</fnm></au><au><snm>O&apos;Gara</snm><fnm>F</fnm></au></aug><source>Microbiology</source><pubdate>2011</pubdate><volume>157</volume><fpage>1726</fpage><lpage>1739</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1099/mic.0.048645-0</pubid><pubid idtype="pmpid" link="fulltext">21474537</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Crystal structure of <it>Pseudomonas aeruginosa </it>lipase in the open conformation</p></title><aug><au><snm>Nardini</snm><fnm>M</fnm></au><au><snm>Lang</snm><fnm>DA</fnm></au><au><snm>Liebeton</snm><fnm>K</fnm></au><au><snm>Jaeger</snm><fnm>K-E</fnm></au><au><snm>Dijstra</snm><fnm>BW</fnm></au></aug><source>J Biol Chem</source><pubdate>2000</pubdate><volume>275</volume><fpage>31219</fpage><lpage>31225</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10893416</pubid></xrefbib></bibl><bibl id="B44"><title><p><it>Burkholderia glumae</it>: next major pathogen of rice?</p></title><aug><au><snm>Ham</snm><fnm>JH</fnm></au><au><snm>Melanson</snm><fnm>RA</fnm></au><au><snm>Rush</snm><fnm>MC</fnm></au></aug><source>Mol Plant Pathol</source><pubdate>2011</pubdate><volume>12</volume><fpage>329</fpage><lpage>339</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1364-3703.2010.00676.x</pubid><pubid idtype="pmpid" link="fulltext">21453428</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>The staphostatin-staphopain complex: a forward binding inhibitor in complex with its target cysteine protease</p></title><aug><au><snm>Filipek</snm><fnm>R</fnm></au><au><snm>Rzychon</snm><fnm>M</fnm></au><au><snm>Olesky</snm><fnm>A</fnm></au><au><snm>Gruca</snm><fnm>M</fnm></au><au><snm>Dubin</snm><fnm>A</fnm></au><au><snm>Potempa</snm><fnm>J</fnm></au><au><snm>Botchler</snm><fnm>M</fnm></au></aug><source>J Biol Chem</source><pubdate>2003</pubdate><volume>278</volume><fpage>40959</fpage><lpage>40966</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M302926200</pubid><pubid idtype="pmpid" link="fulltext">12874290</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>The structure of <it>Clostridium perfringens </it>NanI sialidase and its catalytic intermediates</p></title><aug><au><snm>Newstead</snm><fnm>SL</fnm></au><au><snm>Potter</snm><fnm>JA</fnm></au><au><snm>Wilson</snm><fnm>JC</fnm></au><au><snm>Xu</snm><fnm>G</fnm></au><au><snm>Chien</snm><fnm>C-H</fnm></au><au><snm>Watts</snm><fnm>AG</fnm></au><au><snm>Withers</snm><fnm>SG</fnm></au><au><snm>Taylor</snm><fnm>GL</fnm></au></aug><source>J Biol Chem</source><pubdate>2008</pubdate><volume>283</volume><fpage>9080</fpage><lpage>9088</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M710247200</pubid><pubid idtype="pmcid">2431023</pubid><pubid idtype="pmpid" link="fulltext">18218621</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Crystal structure and enzymatic properties of a bacterial family 19 chitinase reveal differences from plant enzymes</p></title><aug><au><snm>Hoell</snm><fnm>IA</fnm></au><au><snm>Dalhus</snm><fnm>B</fnm></au><au><snm>Heggset</snm><fnm>EB</fnm></au><au><snm>Aspmo</snm><fnm>SI</fnm></au><au><snm>Eijsink</snm><fnm>VGH</fnm></au></aug><source>FEBS J</source><pubdate>2006</pubdate><volume>273</volume><fpage>4889</fpage><lpage>4900</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1742-4658.2006.05487.x</pubid><pubid idtype="pmpid" link="fulltext">17010167</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Structure of the bacteriophage &#966;KZ lytic transglycosylase gp144</p></title><aug><au><snm>Fokine</snm><fnm>A</fnm></au><au><snm>Miroshnikov</snm><fnm>KA</fnm></au><au><snm>Shneider</snm><fnm>MM</fnm></au><au><snm>Mesyanzhinov</snm><fnm>VV</fnm></au><au><snm>Rossmann</snm><fnm>MG</fnm></au></aug><source>J Biol Chem</source><pubdate>2008</pubdate><volume>11</volume><fpage>7247</fpage><lpage>7250</lpage></bibl><bibl id="B49"><title><p>The pyocins of <it>Pseudomonas aeruginosa</it></p></title><aug><au><snm>Michel-Briand</snm><fnm>Y</fnm></au><au><snm>Baysse</snm><fnm>C</fnm></au></aug><source>Biochimie</source><pubdate>2002</pubdate><volume>84</volume><fpage>499</fpage><lpage>510</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0300-9084(02)01422-0</pubid><pubid idtype="pmpid" link="fulltext">12423794</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Dissection of a type VI secretion system in <it>Edwardsiella tarda</it></p></title><aug><au><snm>Zheng</snm><fnm>J</fnm></au><au><snm>Leung</snm><fnm>KY</fnm></au></aug><source>Mol Microbiol</source><pubdate>2007</pubdate><volume>66</volume><fpage>1192</fpage><lpage>1206</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2958.2007.05993.x</pubid><pubid idtype="pmpid" link="fulltext">17986187</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Basic local alignment search tool</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Gish</snm><fnm>W</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Myers</snm><fnm>EW</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>J Mol Biol</source><pubdate>1990</pubdate><volume>215</volume><fpage>403</fpage><lpage>410</lpage><xrefbib><pubid idtype="pmpid">2231712</pubid></xrefbib></bibl><bibl id="B52"><title><p>National Centre for Biotechnology Information protein database</p></title><url>http://www.ncbi.nlm.nih.gov/blast</url></bibl><bibl id="B53"><title><p>Type VI secretion: a beginner's guide</p></title><aug><au><snm>Bingle</snm><fnm>LEH</fnm></au><au><snm>Bailey</snm><fnm>CM</fnm></au><au><snm>Pallen</snm><fnm>MJ</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2009</pubdate><volume>11</volume><fpage>3</fpage><lpage>8</lpage></bibl><bibl id="B54"><title><p>MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences</p></title><aug><au><snm>Kumar</snm><fnm>S</fnm></au><au><snm>Dudley</snm><fnm>J</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Tamura</snm><fnm>K</fnm></au></aug><source>Brief Bioinform</source><pubdate>2008</pubdate><volume>9</volume><fpage>299</fpage><lpage>306</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bib/bbn017</pubid><pubid idtype="pmcid">2562624</pubid><pubid idtype="pmpid" link="fulltext">18417537</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT</p></title><aug><au><snm>Hall</snm><fnm>TA</fnm></au></aug><source>Nucl Acids Symp Ser</source><pubdate>1999</pubdate><volume>41</volume><fpage>95</fpage><lpage>98</lpage></bibl><bibl id="B56"><title><p>Softberry FgenesB bacterial operon and gene prediction server</p></title><url>http://linux1.softberry.com</url></bibl><bibl id="B57"><title><p>National Centre for Biotechnology Information open reading frame prediction server</p></title><url>http://www.ncbi.nlm.nih.gov/gorf/</url></bibl><bibl id="B58"><title><p>CD-Search: protein domain annotations on the fly</p></title><aug><au><snm>Marchler-Bauer</snm><fnm>A</fnm></au><au><snm>Bryant</snm><fnm>SH</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><fpage>327</fpage><lpage>331</lpage><xrefbib><pubid idtype="doi">10.1093/nar/gkh454</pubid></xrefbib></bibl><bibl id="B59"><title><p>National Centre for Biotechnology Information conserved protein domain search server</p></title><url>http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi</url></bibl><bibl id="B60"><title><p>The HHpred interactive server for protein homology detection and structure prediction</p></title><aug><au><snm>S&#246;ding</snm><fnm>J</fnm></au><au><snm>Biegert</snm><fnm>A</fnm></au><au><snm>Lupas</snm><fnm>AN</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><issue>33</issue><fpage>W244</fpage><lpage>248</lpage></bibl><bibl id="B61"><title><p>Homology detection and structure prediction by HMM-HMM comparison server</p></title><url>http://toolkit.tuebingen.mpg.de/hhpred</url></bibl></refgrp>
</bm>
</art>