<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-9-research0047</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>NEAT: a domain duplicated in genes near the components of a putative Fe<sup>3+</sup> siderophore transporter from Gram-positive pathogenic bacteria</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Andrade</snm>
               <mi>A</mi>
               <fnm>Miguel</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>andrade@embl-heidelberg.de</email>
            </au>
            <au id="A2">
               <snm>Ciccarelli</snm>
               <mi>D</mi>
               <fnm>Francesca</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Perez-Iratxeta</snm>
               <fnm>Carolina</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Bork</snm>
               <fnm>Peer</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany</p>
            </ins>
            <ins id="I2">
               <p>Department of Bioinformatics, Max Delbruck Center for Molecular Medicine, 13092, Berlin-Buch, Germany</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>9</issue>
         <fpage>research0047.1</fpage>
         <lpage>research0047.5</lpage>
         <url>http://genomebiology.com/2002/3/9/research/0047</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-9-research0047</pubid>
               <pubid idtype="pmpid">12225586</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>4</month>
               <year>2002</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>17</day>
               <month>5</month>
               <year>2002</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>5</day>
               <month>6</month>
               <year>2002</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>8</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>Andrade et al., licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>NEAT: a duplicated domain in genes near a putative Fe<sup>3+</sup> transporter in pathogenic bacteria</p>
      </shorttitle>
      <shortabs>
         <p>Iron uptake from the host is essential for bacteria that infect animals. A protein domain has been identified that appears in variable copy number in bacterial genes that are usually in the vicinity of a putative Fe<sup>3+</sup> siderophore transporter. This domain has been denoted NEAT for near transporter.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Iron uptake from the host is essential for bacteria that infect animals. To find potential targets for drugs active against pathogenic bacteria, we have searched all completely sequenced genomes of pathogenic bacteria for genes relevant for iron transport.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We identified a protein domain that appears in variable copy number in bacterial genes that are usually in the vicinity of a putative Fe<sup>3+</sup> siderophore transporter. Accordingly, we have denoted this domain NEAT for 'near transporter'. Most of the bacterial species containing this domain are pathogenic. Sequence features indicate that the domain is anchored to the extracellular side of the membrane. The domain seems to be under high selective pressure for rapid independent duplications that are typical of sequences involved in signaling and binding.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The NEAT domain might be functionally related to iron transport. The taxonomic specificity of this domain and its predicted extracellular position could make it an interesting target for designing new drugs against some highly pathogenic bacteria.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010012">Medicine</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010018">Physiology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Iron transport into the cell is very important for the growth of an organism. Pathogenic bacteria, which have to survive within an animal, are able to sequester iron from the iron-containing proteins of the host by secreting siderophores that have a higher affinity for the iron (reviewed in [<abbr bid="B1">1</abbr>]). Then, a specific transport system imports the iron-siderophore complex back into the bacterial cytoplasm. The disruption of this uptake function in bacteria is likely to be a good strategy in fighting infectivity. We searched the genomic neighborhood of putative Fe<sup>3+</sup> siderophore transporters in pathogenic bacteria in order to identify genes that could be associated with this functionality and thus constitute targets for therapy against disease. As a result of our analysis, we characterized a highly duplicated domain that we propose as a receptor for an iron complex.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Survey for putative Fe<sup>3+</sup> siderophore transporters in complete bacterial genomes</p>
            </st>
            <p>In order to find proteins related to iron transport in pathogenic bacteria, we first scanned complete genomes of pathogenic bacteria for sequences homologous to those encoding the three currently known <it>Escherichia coli</it> Fe<sup>3+</sup> siderophore transporters: the Fe<sup>3+</sup> dicitrate transport complex [<abbr bid="B2">2</abbr>], the Fe<sup>3+</sup> enterobactin transport complex [<abbr bid="B3">3</abbr>], and the Fe<sup>3+</sup> hydroxamate transport complex [<abbr bid="B4">4</abbr>]. These transporters import iron from the periplasm into the cytoplasm of <it>E. coli,</it> expending ATP. Several components of the putative transporter were found in contiguous genomic positions of four pathogenic Gram-positive bacteria, three of which are associated with food-borne diseases (<it>Listeria monocytogenes, Clostridium perfringens,</it> and <it>Staphylococcus aureus</it>). In humans, the fourth bacterium, <it>Staphylococcus pyogenes,</it> produces pharyngitis, impetigo, toxic shock syndrome, necrotizing fasciitis, rheumatic fever, and acute glomerulonephritis.</p>
         </sec>
         <sec>
            <st>
               <p>Gene neighborhood</p>
            </st>
            <p>In order to find genes associated with the putative Fe<sup>3+</sup> siderophore transporter that could be characteristic of the pathogenic species, we analyzed the genomic neighborhood of the transporter in complete genomes. The repeated presence of neighboring gene pairs across different species permits us to reach conclusions about the possible functional association of the paired genes [<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>,<abbr bid="B7">7</abbr>].</p>
            <p>The examination of the genomic neighborhood of the transporter indicated the correlated presence of genes containing a conserved domain, with a variable copy number (from one up to five) within the same sequence. We therefore denoted this newly described domain as NEAT, for 'near transporter'. A similar correlation between the transporter and genes containing the NEAT domain was also found in the nonpathogenic species <it>Listeria innocua</it> and <it>Bacillus halodurans</it> (see Figure <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Conserved genome organization around the components of a putative Fe<sup>3+</sup> siderophore transporter (from STRING [26,27] or from the literaturewhen the genome is not present in STRING)</p>
               </caption>
               <text>
                  <p>Conserved genome organization around the components of a putative Fe<sup>3+</sup> siderophore transporter (from STRING [26,27] or from the literature when the genome is not present in STRING). Each gene is represented with an arrow-shaped box (as inSTRING) pointing in the direction of transcription. The genes in yellow contain the NEAT domain; the pink ovals indicate the number of occurrences of the domain within the gene. Genes in green are related to <it>Escherichia coli fecB, fepB, or fhuD</it> (Fe<sup>3+</sup> siderophore transporter, periplasmic component). Genes in red are related to <it>fecC, fepG, fhuB</it> (Fe<sup>3+</sup> siderophore transporter, transmembrane component). Genes in violet are related to <it>fecE, fepC, fhuC</it> (Fe<sup>3+</sup> siderophore transporter, ATPase component). Remaining genes (in white) did not show significant sequence similarity to any of the other genes displayed in the figure. Codes for genes containing the NEAT domain are shown in Table <tblr tid="T1">1</tblr>. For the neighboring genes the names used correspond to those from the corresponding genomic project. Spy1797 should be shown between S_pyog (Spy1798) and Spy1796, but is not displayed here because there was no such corresponding entry in GenBank. Note that there was no neighboring ATPase (violet gene) in <it>S. aureus</it>; the most similar sequence in this species is SA0602 (not shown).</p>
               </text>
               <graphic file="gb-2002-3-9-research0047-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Codes for genes containing the NEAT domain</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Gene code</p>
                     </c>
                     <c ca="left">
                        <p>Protein accession number<sup>*</sup></p>
                     </c>
                     <c ca="left">
                        <p>Bacterium</p>
                     </c>
                     <c ca="left">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B_hal1</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q9K7R1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>B. halodurans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[20]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B_hal2</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q9K7R0</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>B. halodurans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[20]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C_per1</p>
                     </c>
                     <c ca="left">
                        <p>GB:18143877</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>C. perfringens</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[21]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C_per2</p>
                     </c>
                     <c ca="left">
                        <p>GB:18143878</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>C. perfringens</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[21]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S_pyog</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q99YA0</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. pyogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[22]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L_inn1</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q929I6</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>L. innocua</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L_inn2</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q929I5</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>L. innocua</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L_mon1</p>
                     </c>
                     <c ca="left">
                        <p>GB:16804224</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>L. monocytogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L_mon2</p>
                     </c>
                     <c ca="left">
                        <p>GB:16411656</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>L. monocytogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>[23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S_aur1</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q99TD3</p>
                     </c>
                     <c ca="left">
                        <p><it>S. aureus</it> strain N315</p>
                     </c>
                     <c ca="left">
                        <p>[24]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S_aur2</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q99UX5</p>
                     </c>
                     <c ca="left">
                        <p><it>S. aureus</it> strain N315</p>
                     </c>
                     <c ca="left">
                        <p>[24]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S_aur3</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q99UX3</p>
                     </c>
                     <c ca="left">
                        <p><it>S. aureus</it> strain N315</p>
                     </c>
                     <c ca="left">
                        <p>[24]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S_aur4</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q9KW67</p>
                     </c>
                     <c ca="left">
                        <p><it>S. aureus</it> strain N315</p>
                     </c>
                     <c ca="left">
                        <p>[24]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B_anth</p>
                     </c>
                     <c ca="left">
                        <p>SP:Q9X358</p>
                     </c>
                     <c ca="left">
                        <p><it>B. anthracis,</it> virulence plasmid PX01</p>
                     </c>
                     <c ca="left">
                        <p>[25]</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>*</sup>Protein accession numbers are shown as SP:xxxxxx for the SPTREMBL database, and as GB:xxxxxx for GenBank database. The corresponding sequences from <it>Staphylococcus aureus</it> strain Mu50 were identical to those from <it>S. aureus</it> strain N315 and were not included in the analysis.</p>
               </tblfn>
            </tbl>
            <p>The search for homologous sequences in the whole protein database added one more sequence from <it>S. aureus,</it> and a short protein corresponding to the middle part of the domain in the virulence plasmid pXO1 of <it>Bacillus anthracis,</it> which is essential for the manifestation of the disease anthrax. Both genes are apparently not physically close to transport-related genes.</p>
            <p>The alignment of all identified instances of the NEAT domain (Figure <figr fid="F2">2</figr>) indicates a conserved region of about 125 amino acids. The predicted secondary structure (using PHD [<abbr bid="B8">8</abbr>]) suggests that this domain is mostly composed of beta strands. The NEAT domain appears in combination with other domains and sequence features in various proteins (see Figure <figr fid="F3">3</figr>). A distinctive feature of most of these proteins is the prediction of an amino-terminal signal sequence and a carboxy-terminal transmembrane region (from SMART [<abbr bid="B9">9</abbr>,<abbr bid="B10">10</abbr>], using the Bioperl sigcleave module [<abbr bid="B11">11</abbr>], based on [<abbr bid="B12">12</abbr>]; and from THMM2 [<abbr bid="B13">13</abbr>], respectively). In two <it>S. aureus</it> proteins (see Figure <figr fid="F3">3</figr> for details) the transmembrane region prediction is over-ruled by the prediction of a carboxy-terminal motif that is typical of surface proteins of Gram-positive cocci. This motif consists of a fairly conserved hexapeptide followed by a hydrophobic anchor and two or three basic residues ([<abbr bid="B14">14</abbr>]; detected using Pfam [<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>]). These features indicate a highly probable association between the proteins containing the NEAT domain and the membrane, and the signal peptide suggests that they are exported to the extracellular side of the membrane.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Multiple alignment of the occurrences of the NEAT domain, generated with the ClustalW program [8]</p>
               </caption>
               <text>
                  <p>Multiple alignment of the occurrences of the NEAT domain, generated with the ClustalW program [28]. SMART [9,10], which identifies repeats using prospero [29,30], was used to search for domains in some sequences. The internal repeats detected in this manual analysis were used to generate subsequences that were used for building the first alignment. Then, we followed an iterative procedure by building a Hidden Markov Model (HMM) of the alignment and adding to the alignment significant hits from an HMM search [31,32] comparison of the HMM to the NCBI's nonredundant protein database. For the final HMM (derived from the alignment presented in this figure) no more similar sequences were detected below a standard <it>E</it>-value threshold (<it>E</it> = 0.001). The consensus in 70% of the sequences is reported below the alignment. Residue ranges are listed next to the protein code name. The letters h, l, and p indicate hydrophobic, aliphatic, and polar residues, respectively. Hydrophobic residues are highlighted in dark blue, polar residues in green, and a fairly conserved arginine (R in the consensus sequence) in red. Codes are the same as in Figure <figr fid="F1">1</figr> and Table <tblr tid="T1">1</tblr>. The predicted secondary structure [8], mostly beta sheet, is displayed at the bottom of the figure. Although the B_anth sequence corresponds to a fragment of the domain, examination of the corresponding DNA sequence indicates that the actual translation product might extend further in both the amino- and carboxy-terminal directions.</p>
               </text>
               <graphic file="gb-2002-3-9-research0047-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Modular arrangements of sequences containing the NEAT domain</p>
               </caption>
               <text>
                  <p>Modular arrangements of sequences containing the NEAT domain. Protein codes are the same as those shown in Figure <figr fid="F1">1</figr> and Table <tblr tid="T1">1</tblr>. The red line indicates the signal peptide; the blue box represents a transmembrane helix; the gray box indicates the Gram-positive anchor as detected by Pfam [15,16]; the black box represents a hydrophobic carboxy-terminal anchoring domain proposed for two <it>Listeria</it> sequences [23]. PKD is the polycystic kidney disease domain (present in PKD1, chitinases, and collagenases, among others), and LRR stands for leucine-rich repeat (ten copies detected in S_pyog, using the program REP [33,34]). The scale bar indicates the length in amino acids.</p>
               </text>
               <graphic file="gb-2002-3-9-research0047-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis of the new domain</p>
            </st>
            <p>The phylogenetic tree constructed from the alignment of different domain occurrences (Figure <figr fid="F4">4</figr>) indicates a variety of independent duplication events. Some species contain up to four sequences with the domain, some only one. All repeats of B_hal1 (except one) cluster together; this indicates that one of the repeats of an ancestral <it>B. halodurans</it> sequence duplicated quickly (after divergence of <it>B. halodurans</it> from the other species displayed in the tree) into another three copies. The <it>C. perfringens</it> domains and the <it>S. aureus</it> domains are also the result of separated gene duplication and domain duplication events, as indicated by the clustering of the domains from these species. The clustering of the two <it>Listeria</it> species indicates no further duplication event in these species after their (recent) divergence.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of the domain instances generated from the multiple alignment shown in Figure <figr fid="F2">2</figr></p>
               </caption>
               <text>
                  <p>Phylogenetic tree of the domain instances generated from the multiple alignment shown in Figure <figr fid="F2">2</figr>. Bootstrapping values range from 0 to 100. The labels indicate the sequence and position of the repeat in the sequence. Domains from the same sequence have identical color (for example, all B_hal1 repeats are red). Domains from sequences of the same species have similar colors (for example, the <it>S. aureus</it> domains are colored in different hues of green).</p>
               </text>
               <graphic file="gb-2002-3-9-research0047-4"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Some protein domains have a highly variable copy number per protein, but they can exist as a single copy. This is in contrast to structural repeats (such as armadillo or leucine-rich repeats) that fold together and, by definition, never appear as a single copy [<abbr bid="B17">17</abbr>]. Whereas structural repeats are related to DNA or protein binding, occasionally repeated domains can bind either large or small substrates; for example, Ca<sup>2+</sup> (bound by C2, cadherin repeats, epidermal growth factor repeats), nucleotides (bound by zinc finger domains, LIM domains, and homeobox domains), or proteins (kazal inhibits serine proteases, ubiquitin domains in polyubiquitin bind target proteins to be degraded, PDZ domains bind polypeptides, nebulin repeats bind actin, immunoglobulins bind antigens, fibronectin 1 repeats bind fibrin and so on). (See the SMART server for further examples and references [<abbr bid="B9">9</abbr>,<abbr bid="B10">10</abbr>].) Accordingly, occasionally repeated domains are often involved in signaling or transcription regulation. A large copy number is used as a way of increasing the effectiveness of the binding activity. This could be the case with the NEAT domain, which can be found as one single copy per sequence. In this respect, the NEAT domain appears to perform a binding function rather than a structural or an enzymatic one. Accordingly, the multiple alignment of the instances of the domain (Figure <figr fid="F2">2</figr>) indicates the lack of obvious conserved catalytic residues.</p>
         <p>The NEAT domain appears to be associated with iron transport in several Gram-positive species (some of them pathogenic). Given its predicted extracellular location and its close association with the components of an iron transport system, one possible function of the NEAT domain is to be a receptor of the siderophore-iron complex. It would initiate a cascade upon detection of the substrate, ending in the expression of the components of the transporter in a system similar to that used in the induction of FecA [<abbr bid="B18">18</abbr>]. Further evidence in this direction is given by recent experimental results for two of the NEAT-domain proteins from <it>S. aureus,</it> FrpA and FrpB (denoted here as S_aur4 and S_aur2, respectively), which were identified as cell wall proteins expressed under iron-restricted conditions [<abbr bid="B19">19</abbr>].</p>
         <p>The multiple duplication of this domain could reflect competition with an inhibitor. It could also be used for increasing bacterial sensitivity to the presence of the iron complex at very low substrate concentrations, in order to trigger the production of the corresponding transporter. The extracellular location of the domain, its association with a key process for bacterial survival, and its specificity to the group of pathogenic bacteria described, all make it a good candidate for developing a strategy against these pathogens.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Iron metabolism in pathogenic bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Ratledge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dover</snm>
                  <fnm>LG</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>54</volume>
            <fpage>881</fpage>
            <lpage>941</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.54.1.881</pubid>
                  <pubid idtype="pmpid" link="fulltext">11018148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Nucleotide sequences of the <it>fecBCDE</it> genes and locations of the proteins suggest a periplasmic-binding-protein-dependent transport mechanism for iron(III) dicitrate in <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Staudenmaier</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>van Hove</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yaraghi</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Braun</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1989</pubdate>
            <volume>171</volume>
            <fpage>2626</fpage>
            <lpage>2633</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2651410</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genetic organization of multiple <it>fep</it> genes encoding ferric enterobactin transport functions in <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ozenberger</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Nahlik</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>McIntosh</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1987</pubdate>
            <volume>169</volume>
            <fpage>3638</fpage>
            <lpage>3646</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2956250</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Nucleotide sequence of the <it>fhuC</it> and <it>fhuD</it> genes involved in iron (III) hydroxamate transport: domains in FhuC homologous to ATP-binding proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Burkhardt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Braun</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1987</pubdate>
            <volume>209</volume>
            <fpage>49</fpage>
            <lpage>55</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2823072</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Conservation of gene order: a fingerprint of proteins that physically interact.</p>
            </title>
            <aug>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>324</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01274-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9787636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The use of gene clusters to infer functional coupling.</p>
            </title>
            <aug>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fonstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>D'Souza</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pusch</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Maltsev</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>2896</fpage>
            <lpage>2901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15866</pubid>
                  <pubid idtype="pmpid" link="fulltext">10077608</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.6.2896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Exploitation of gene context.</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>366</fpage>
            <lpage>370</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(00)00098-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10851194</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Combining evolutionary information and neural networks to predict protein secondary structure.</p>
            </title>
            <aug>
               <au>
                  <snm>Rost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1994</pubdate>
            <volume>19</volume>
            <fpage>55</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8066087</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Recent improvements to the SMART domain-based sequence annotation resource.</p>
            </title>
            <aug>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Goodstadt</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dickens</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mott</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ciccarelli</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>242</fpage>
            <lpage>244</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99073</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752305</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.242</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>SMART - Simple Modular Architecture Research Tool</p>
            </title>
            <url>http://smart.embl-heidelberg.de/</url>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Perldoc documentation for Bioperl Modules</p>
            </title>
            <url>http://doc.bioperl.org</url>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A new method for predicting signal sequences cleavage sites.</p>
            </title>
            <aug>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1986</pubdate>
            <volume>14</volume>
            <fpage>4683</fpage>
            <lpage>4690</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3714490</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>ELL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>305</volume>
            <fpage>567</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4315</pubid>
                  <pubid idtype="pmpid" link="fulltext">11152613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Conservation of a hexapeptide sequence in the anchor region of surface proteins from gram-positive cocci.</p>
            </title>
            <aug>
               <au>
                  <snm>Fischetti</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Pancholi</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Schneewind</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1990</pubdate>
            <volume>4</volume>
            <fpage>1603</fpage>
            <lpage>1605</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2287281</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The Pfam protein families database.</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cerruti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Etwiller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>276</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99071</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752314</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.276</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Protein families database of alignments and HMMs</p>
            </title>
            <url>http://www.sanger.ac.uk/Pfam/</url>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Protein repeats: structures, functions and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>J Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>134</volume>
            <fpage>117</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11551174</pubid>
                  <pubid idtype="doi">10.1006/jsbi.2001.4392</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Surface signaling in ferric citrate transport gene induction: interaction of the FecA, FecR, and FecI regulatory proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Enz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mahren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stroeher</snm>
                  <fnm>UH</fnm>
               </au>
               <au>
                  <snm>Braun</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2000</pubdate>
            <volume>182</volume>
            <fpage>637</fpage>
            <lpage>646</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">94325</pubid>
                  <pubid idtype="pmpid" link="fulltext">10633096</pubid>
                  <pubid idtype="doi">10.1128/JB.182.3.637-646.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Conservation, surface exposure, and <it>in vivo</it> expression of the Frp family of iron-regulated cell wall proteins in <it>Staphylococcus aureus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Morrissey</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Cockayne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hammacott</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bishop</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Denman-Johnson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2002</pubdate>
            <volume>70</volume>
            <fpage>2399</fpage>
            <lpage>2407</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/IAI.70.5.2399-2407.2002</pubid>
                  <pubid idtype="pmpid" link="fulltext">11953376</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Complete genome sequence of the alkaliphilic bacterium <it>Bacillus halodurans</it> and genomic sequence comparison with <it>Bacillus subtilis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Takami</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakasone</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Maeno</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Masui</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fuji</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hirama</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Horikoshi</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>4317</fpage>
            <lpage>4331</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">113120</pubid>
                  <pubid idtype="pmpid" link="fulltext">11058132</pubid>
                  <pubid idtype="doi">10.1093/nar/28.21.4317</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Complete genome sequence of <it>Clostridium perfringens</it>, an anaerobic flesh-eater.</p>
            </title>
            <aug>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ohtani</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ohshima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>996</fpage>
            <lpage>1001</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117419</pubid>
                  <pubid idtype="pmpid" link="fulltext">11792842</pubid>
                  <pubid idtype="doi">10.1073/pnas.022493799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Complete genome sequence of an M1 strain of <it>Streptococcus pyogenes</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ferretti</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>McShan</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Ajdic</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Savic</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Savic</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lyon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Primeaux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sezate</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Suvorov</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Kenton</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>4658</fpage>
            <lpage>4663</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31890</pubid>
                  <pubid idtype="pmpid" link="fulltext">11296296</pubid>
                  <pubid idtype="doi">10.1073/pnas.071559398</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Comparative genomics of <it>Listeria</it> species.</p>
            </title>
            <aug>
               <au>
                  <snm>Glaser</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Frangeul</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Buchrieser</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rusniok</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Amend</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baquero</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Berche</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bloecker</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Brandt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chakraborty</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>849</fpage>
            <lpage>852</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1063447</pubid>
                  <pubid idtype="pmpid" link="fulltext">11679669</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Whole genome sequencing of meticillin-resistant <it>Staphylococcus aureus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kuroda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ohta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Uchiyama</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yuzawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Oguchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aoki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>Lancet</source>
            <pubdate>2001</pubdate>
            <volume>357</volume>
            <fpage>1225</fpage>
            <lpage>1240</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0140-6736(00)04403-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">11418146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Sequence and organization of pXO1, the large <it>Bacillus anthracis</it> plasmid harboring the anthrax toxin genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Okinaka</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Cloud</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hampton</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hoffmaster</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Keim</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Koehler</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Lamke</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kumano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1999</pubdate>
            <volume>181</volume>
            <fpage>6509</fpage>
            <lpage>6515</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">103788</pubid>
                  <pubid idtype="pmpid" link="fulltext">10515943</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>3442</fpage>
            <lpage>3444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">110752</pubid>
                  <pubid idtype="pmpid" link="fulltext">10982861</pubid>
                  <pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>STRING - Search Tool for Recurring Instances of Neighbouring Genes</p>
            </title>
            <url>http://www.bork.embl-heidelberg.de/STRING/</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7984417</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Accurate formula for P-values of gapped local sequence and profile alignments.</p>
            </title>
            <aug>
               <au>
                  <snm>Mott</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>300</volume>
            <fpage>649</fpage>
            <lpage>659</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.3875</pubid>
                  <pubid idtype="pmpid" link="fulltext">10884359</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>PROSPERO</p>
            </title>
            <url>http://www.well.ox.ac.uk/rmott/ARIADNE/prospero.shtml</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Profile hidden Markov models.</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>HMMER 2.2</p>
            </title>
            <url>http://hmmer.wustl.edu/</url>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Homology-based method for identification of protein repeats using statistical significance estimates.</p>
            </title>
            <aug>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>298</volume>
            <fpage>521</fpage>
            <lpage>537</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.3684</pubid>
                  <pubid idtype="pmpid" link="fulltext">10772867</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>REP</p>
            </title>
            <url>http://www.embl-heidelberg.de/~andrade/papers/rep/search.html</url>
         </bibl>
      </refgrp>
   </bm>
</art>
