<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-330</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>An <it>in silico </it>analysis of T-box regulated genes and T-box evolution in prokaryotes, with emphasis on prediction of substrate specificity of transporters</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Wels</snm>
               <fnm>Michiel</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>michiel.wels@nizo.nl</email>
            </au>
            <au id="A2">
               <snm>Kormelink</snm>
               <mnm>Groot</mnm>
               <fnm>Tom</fnm>
               <insr iid="I2"/>
               <email>tom@cmbi.ru.nl</email>
            </au>
            <au id="A3">
               <snm>Kleerebezem</snm>
               <fnm>Michiel</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>michiel.kleerebezem@nizo.nl</email>
            </au>
            <au id="A4">
               <snm>Siezen</snm>
               <mi>J</mi>
               <fnm>Roland</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>roland.siezen@nizo.nl</email>
            </au>
            <au id="A5">
               <snm>Francke</snm>
               <fnm>Christof</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>c.francke@cmbi.ru.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>TI Food and Nutrition, Wageningen, The Netherlands</p>
            </ins>
            <ins id="I2">
               <p>CMBI, Radboud University Nijmegen-Medical Centre/NCMLS, Nijmegen, The Netherlands</p>
            </ins>
            <ins id="I3">
               <p>NIZO food research, Ede, The Netherlands</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>330</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/330</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18625071</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-330</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>29</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>14</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Wels et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>T-box anti-termination is an elegant and sensitive mechanism by which many bacteria maintain constant levels of amino acid-charged tRNAs. The amino acid specificity of the regulatory element is related to a so-called specifier codon and can in principle be used to guide the functional annotation of the genes controlled via the T-box anti-termination mechanism.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Hidden Markov Models were defined to search the T-box regulatory element and were applied to all completed prokaryotic genomes. The vast majority of the genes found downstream of the retrieved elements encoded functionalities related to transport and synthesis of amino acids and the charging of tRNA. This is completely in line with findings reported in literature and with the proposed biological role of the regulatory element. For several species, the functional annotation of a large number of genes encoding proteins involved in amino acid transport could be improved significantly on basis of the amino acid specificity of the identified T-boxes. In addition, these annotations could be extrapolated to a larger number of orthologous systems in other species. Analysis of T-box distribution confirmed that the element is restricted predominantly to species of the phylum Firmicutes. Furthermore, it appeared that the distribution was highly species specific and that in the case of amino acid transport some boxes seemed to "pop-up" only recently.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We have demonstrated that the identification of the molecular specificity of a regulatory element can be of great help in solving notoriously difficult annotation issues, e.g. by defining the substrate specificity of genes encoding amino acid transporters on basis of the amino acid specificity of the regulatory T-box. Furthermore, our analysis of the species-dependency of the occurrence of specific T-boxes indicated that these regulatory elements propagate in a semi-independent way from the genes that they control.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transcription anti-termination is a regulatory mechanism commonly encountered in all lineages within the bacterial kingdom (see e.g. <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>). In transcription anti-termination, the regulation of transcription occurs after the initiation of RNA synthesis, but before transcription of the coding region. The mechanism of anti-termination involves a structural change in the RNA transcript that is dependent on the interaction of the transcript with, for instance, a regulatory protein <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, a tRNA <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> or a metabolite <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The structural elements that compose these anti-terminators are encoded by conserved sequences on the DNA and can be found by searches for the related sequence motifs in upstream regions of regulated genes <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
         <p>A well-studied anti-termination element is the so-called T-box. T-box anti-termination is an elegant and sensitive mechanism by which many bacteria maintain constant levels of tRNA charged with amino acids <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr></abbrgrp>. When there is a sufficient supply of charged tRNA in a cell, the T-box folds into a terminator structure, thereby blocking further transcription. Transcription can only proceed upon conversion into an anti-terminator structure, which is induced by binding of a highly conserved 5'-NCCA-3' of the uncharged tRNA with a conserved '5-UGGN-3' sequence in the T-box <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Although anti-terminator formation involves contacts between many nucleotides, the specificity of the interaction seems largely dependent on the interaction of a tri-nucleotide (anti-anti)-codon in the so-called specifier loop of the T-box with the anti-codon of an amino acid-specific tRNA <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. The structural and kinetic details of this interaction have been well-studied <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. The appropriate assignment of the specifier codon has been used previously to improve the functional annotation of various genes located downstream of the T-box <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. The T-box controlled genes identified thus far encode functionalities that reflect perfectly the pivotal role of uncharged tRNAs in the regulatory mechanism. These functionalities include not only tRNA ligation, but also amino acid biosynthesis and transport <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B17">17</abbr><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B28">28</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. The encoded proteins are involved in modulation of the level of uncharged tRNA in the cell, either directly by charging the corresponding tRNA with its cognate amino acid or indirectly by controlling the intracellular concentration of the specific amino acid.</p>
         <p>To date, T-boxes have been identified predominantly in the genomes of bacterial species of the phyla <it>Firmicutes </it>(including <it>Mollicutes</it>) and <it>Actinobacteria </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, although anti-termination systems have been argued to be among the oldest regulatory systems in bacteria because of their independence of regulatory proteins <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. To investigate this further, we have explored the occurrence of T-boxes in all sequenced prokaryotic genomes. To circumvent potential differences between T-box systems in different bacterial lineages, an iterative HMM-based identification search was performed using the best conserved region of the T-box sequence. Species- and amino acid-specific T-box regulation networks were reconstructed. Most importantly, the acquired knowledge on amino acid specificity could be used to propose an improved functional annotation for many T-box controlled genes and to shed light on the evolution of the regulatory element itself.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>I) A comprehensive collection of T-boxes</p>
            </st>
            <p>The analysis of the taxonomic and functional distribution of T-boxes was started by <it>de novo </it>identification of T-box motif characteristics. Conserved nucleotide sequence motifs upstream of tRNA-ligase encoding genes in species of the phylum <it>Firmicutes </it>were recovered and used to identify T-boxes located at other positions in the same genome as well as in the genomes of other species (see methods). These searches showed that a T-box could be specified best by a 30 nt motif that is extremely well-conserved and positioned in the 3'-region of the terminator/anti-terminator loop (motif 1 in Figure <figr fid="F1">1</figr>). In fact, this motif is known as 'the T-box sequence' since its discovery <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Later it was recognized that this conserved region belongs to a larger conserved RNA structure known as the T-box element <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This element contains four other highly conserved regions (see Figure <figr fid="F1">1</figr> motifs 2&#8211;5).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Sequence logo <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> visualization of the 5 different T-box motifs</p>
               </caption>
               <text>
                  <p><b>Sequence logo</b><abbrgrp><abbr bid="B64">64</abbr></abbrgrp><b>visualization of the 5 different T-box motifs.</b> Both the consensus sequence and relative conservation of individual residues is displayed. Motif 1 (information content: 19.2 bits) displays the motif used to perform the T-box identification. Validation was performed by checking the presence of motif 2 (29.7 bits) or 3 (20.9 bits) together with motifs 4 (31.4 bits) and 5 (25.8 bits). Motif 1 includes the (a-specific) tRNA interaction site (T-box sequence, consensus GGTGG) located in the antiterminator loop. The other motifs include different parts of the specifier loop <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B26">26</abbr></abbrgrp>: GNTG- and AG-box in motif 2 and 4, TGA-, AGGA- and AGTA-box in motif 3 and a conserved part of the specifier loop in motif 5 (GAG). The specifier codon is to be found within 1 &#8211; 5 nucleotides upstream of this conserved GAG.</p>
               </text>
               <graphic file="1471-2164-9-330-1"/>
            </fig>
            <p>The initial search showed prominent variations in the number of T-boxes per genome between different classes of the phylum <it>Firmicutes </it>and between different phyla. Therefore, additional searches with phylum-specific and class-specific T-box HMMs were performed, but generally did not yield novel hits. Only in the case of the <it>Clostridia </it>a limited number of 10 additional T-boxes were identified. Further iterations did not expand the dataset. Visual inspection of the upstream regions of all genes encoding a t-RNA ligase in the Firmicutes indicated that indeed all those regions that contain the distinctive T-box motifs were identified by our algorithm. A comparison of the number of T-boxes identified by us for a representative set of organisms with the number obtained using the Rfam T-box model <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> proved that our recovery procedure was very efficient (see methods for details).</p>
            <sec>
               <st>
                  <p>Identification of the specifier codon and amino acid specificity</p>
               </st>
               <p>Although T-boxes were readily identified, it was more difficult to define their amino acid specificity. To that end, the phylogeny of homologous genes preceded by a T-box from different species was determined and the upstream regions corresponding to each orthologous group were aligned. In all the cases of T-boxes for which the specifier codon had been identified experimentally <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B21">21</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B33">33</abbr></abbrgrp>, we observed that the specifier codon aligned perfectly within the related orthologous sequences. In fact, this was true for almost all orthologous groups of sequences. Moreover, most of the alignments could easily be clustered by eye into larger groups for which the specifier codon remained directly apparent from the alignment. The resulting alignments and the annotation of the specifier codon can be found at <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Nevertheless, there remained a few less clear cases. For some (~5%) a secondary structure prediction could be used to provide the additional information required to define the specifier codon in the specifier loop <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Taken together, a specifier codon could be identified directly for over 90% of the identified T-boxes.</p>
            </sec>
            <sec>
               <st>
                  <p>Codon usage in the specifier codon</p>
               </st>
               <p>Most amino acids are encoded by multiple codons. Leu for instance, is encoded by six different codons (CUA, CUU, CUG, CUC, UUA and UUG). Remarkably, the T-boxes had a conserved preference for certain codons within as well as between species (Additional file <supplr sid="S1">1</supplr>). Evaluation of these preferences showed that they complied almost perfectly with the rules observed by Elf et al. for the codon usage by <it>E. coli </it><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. In an elegant study these authors analyzed the dependence of the charging of various codon-specific tRNAs on the use of various codons in particular proteins. They concluded that: "when codon reading is part of a control loop that regulates synthesis of missing amino acid, the translation rate of the selected codon should be as sensitive as possible to starvation" <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. And, in their paper they showed which codons are the most sensitive in <it>E. coli</it>. We found that for all but one of the most predominantly used specifier codons in T-boxes, the corresponding tRNA is among the highest in sensing shortage of that specific amino acid in <it>E. coli </it>as reported by <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The only exception was the T-box codon for Ala (GCU). Therefore, assuming the conclusions by Elf et al. are also valid for Gram-positive bacteria, our findings suggest that the codons that are sensitive to depletion are preferentially used in T-box regulation.</p>
               <suppl id="S1">
                  <title>
                     <p>Additional file 1</p>
                  </title>
                  <text>
                     <p>T-box location organized per genome. The occurrence of T-boxes in all analyzed genomes is shown in detail. Position, direction, e-value and specifier codon are shown for the T-box, together with position, name and function of the first gene located downstream and the size of the operon located downstream. All T-boxes are color coded according to the function of the genes located downstream: red; tRNA synthesis; green; amino acid biosynthesis; blue; amino acid transport and purple: other/unknown.</p>
                  </text>
                  <file name="1471-2164-9-330-S1.xls">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Functionalities controlled by a T-box</p>
               </st>
               <p>As expected, the proposed regulatory role of the T-box elements appeared to be perfectly reflected by the genes under their control. The majority of the T-boxes (62%) were found to precede genes encoding tRNA ligases, while most others were found upstream of genes encoding proteins involved in amino acid transport (12%) or amino acid biosynthesis (18%). The remaining T-boxes (8%; 71 genes in total) were found upstream of genes encoding proteins with unknown function (54 genes), or a function that lacks an apparent relation to amino acid metabolism (17 genes). A complete and species-specific subdivision of T-boxes based on function prediction of the proteins encoded downstream and a list of genes with no apparent relation to amino acid metabolism is provided in the supplementary material, in Additional files <supplr sid="S1">1</supplr> and <supplr sid="S2">2</supplr>.</p>
               <suppl id="S2">
                  <title>
                     <p>Additional file 2</p>
                  </title>
                  <text>
                     <p>T-box distribution among different bacterial species. For all species the number of T-boxes is displayed, divided over four different categories (tRNA synthesis, amino acid transport, amino acid biosynthesis and other). Between brackets the number of regulated genes is shown. This number is based on the operon structure of the genes downstream of the T-box. In case multiple strains of a specific species were sequenced, these are only shown when differences between the strains were observed.</p>
                  </text>
                  <file name="1471-2164-9-330-S2.xls">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
         <sec>
            <st>
               <p>II) The use of regulator specificity to improve annotation of molecular function and biological role</p>
            </st>
            <p>We made two important observations: i) In all cases, the T-boxes identified upstream of the genes encoding a tRNA ligase contained a specifier codon that corresponded with the amino acid specificity of the ligase; and ii) in all other cases where the function of the protein encoded by the gene downstream of the T-box had experimentally been verified, the specifier-codon corresponded to the established functionality of the gene. These observations implied that the employed method for the identification of T-box specificity was reliable and, consequently, that predicted T-box specificities could be extrapolated to the molecular function of the protein encoded by the gene located downstream, as had occasionally been done before. Many of the genes preceded by a T-box had not been specifically annotated to date in the sense that, although the functional category was often evident (e.g. proton symport, ABC transport family, etc.), a specific molecular function had not been attributed. In fact, more than two-third of the non-tRNA ligase genes preceded by a T-box lacked such a specific annotation of molecular function. As importantly, the functional annotation of the genes could be extended to a different level entirely by using the knowledge on T-box (regulator) specificity, as this knowledge discloses (in part) under which conditions the regulated genes will play their biological role (for the distinction between molecular function and biological role see Francke et al. <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>).</p>
            <sec>
               <st>
                  <p>a) T-box regulation of amino acid transport</p>
               </st>
               <p>Many of the genes encoding amino acid transporters were found to be preceded by a T-box, especially in the genomes of the <it>Lactobacilli </it>and <it>Bacilli </it>of the <it>Bacillus cereus</it>-group. The transporters controlled by T-boxes belonged to no less than seven distinct transporter families (MFS, 2.A.1; APC, 2.A.3; NSS, 2.A.22; DAACS, 2.A.23; LIVCS, 2.A.26; NhaC, 2.A.35; and ABC-cassette, 3.A.1; Transporter Classification described by Saier <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>). Table <tblr tid="T1">1</tblr> gives an overview of the distribution of the transport systems regulated by a T-box over the various Firmicutes species.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Overview of T-box regulated transporter genes in different <it>Firmicutes</it>. The type and number (between brackets) of transporters are displayed per species and according to their predicted specificity.</p>
                  </caption>
                  <tblbdy cols="15">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p><it>B. anthracis </it>Ames 0581</p>
                        </c>
                        <c ca="left">
                           <p><it>B. licheniformis </it>ATCC 14580</p>
                        </c>
                        <c ca="left">
                           <p><it>B. subtilis </it>168</p>
                        </c>
                        <c ca="left">
                           <p><it>O. iheyensis </it>HTE813</p>
                        </c>
                        <c ca="left">
                           <p><it>L. monocytogenes </it>EGD-e</p>
                        </c>
                        <c ca="left">
                           <p><it>L. plantarum </it>WCFS1</p>
                        </c>
                        <c ca="left">
                           <p><it>L. acidophilus </it>NCFM</p>
                        </c>
                        <c ca="left">
                           <p><it>L. johnsonii </it>NCC 533</p>
                        </c>
                        <c ca="left">
                           <p><it>E. faecalis </it>V583</p>
                        </c>
                        <c ca="left">
                           <p><it>L. Lactis </it>IL1403</p>
                        </c>
                        <c ca="left">
                           <p><it>S. pneumoniae </it>R6</p>
                        </c>
                        <c ca="left">
                           <p><it>C. acetobutylicum </it>ATCC824</p>
                        </c>
                        <c ca="left">
                           <p><it>C. perfringens </it>ATCC13124</p>
                        </c>
                        <c ca="left">
                           <p><it>C. tetani </it>E88</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Asn</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Asp</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>His</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Ile</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Leu</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>APC</p>
                        </c>
                        <c ca="left">
                           <p>APC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>NSS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Lys</p>
                        </c>
                        <c ca="left">
                           <p>MFS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Met</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC (5)</p>
                        </c>
                        <c ca="left">
                           <p>ABC (3)</p>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c ca="left">
                           <p>ABC (3)</p>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Phe</p>
                        </c>
                        <c ca="left">
                           <p>NSS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Thr</p>
                        </c>
                        <c ca="left">
                           <p>APC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Trp</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>NSS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Tyr</p>
                        </c>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>NHAC (2)</p>
                        </c>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>NSS</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Val</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>?</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>DAACS</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ABC</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>1|73</p>
                        </c>
                        <c ca="left">
                           <p>8|77</p>
                        </c>
                        <c ca="left">
                           <p>6|48</p>
                        </c>
                        <c ca="left">
                           <p>3|59</p>
                        </c>
                        <c ca="left">
                           <p>4|79</p>
                        </c>
                        <c ca="left">
                           <p>1|57</p>
                        </c>
                        <c ca="left">
                           <p>1|78</p>
                        </c>
                        <c ca="left">
                           <p>1|93</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>1|60</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>APC</p>
                        </c>
                        <c ca="left">
                           <p>1|19</p>
                        </c>
                        <c ca="left">
                           <p>1|20</p>
                        </c>
                        <c ca="left">
                           <p>1|18</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>LIVCS</p>
                        </c>
                        <c ca="left">
                           <p>2|6</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>2|3</p>
                        </c>
                        <c ca="left">
                           <p>2|3</p>
                        </c>
                        <c ca="left">
                           <p>2|2</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>1|1</p>
                        </c>
                        <c ca="left">
                           <p>1|3</p>
                        </c>
                        <c ca="left">
                           <p>2|4</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MFS</p>
                        </c>
                        <c ca="left">
                           <p>1|69</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NHAC</p>
                        </c>
                        <c ca="left">
                           <p>1|4</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>1|3</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>2|2</p>
                        </c>
                        <c ca="left">
                           <p>1|1</p>
                        </c>
                        <c ca="left">
                           <p>1|1</p>
                        </c>
                        <c ca="left">
                           <p>1|2</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NSS</p>
                        </c>
                        <c ca="left">
                           <p>3|4</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>1|5</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>At the bottom the total fraction of regulated transporters is shown per family. ABC: ATP-binding cassette superfamily, APC: Amino acid-polyamine-organocation family, DAACS: dicaboxylate/amino acid:cation symporter family, LIVCS: leucine/isoleucine/valine cation symporter family, MFS: major facilitator superfamily, NHAC: Na<sup>+</sup>:H<sup>+ </sup>antiporter family, NSS: neurotransmitter:sodium family. Classification adopted from Saier <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
                  </tblfn>
               </tbl>
               <p>Overall, for more than 85% of the T-box regulated transporters the functional annotation (molecular function and/or biological role) could be improved as compared to the entries in the reference database of NCBI. A full list can be found in Additional file <supplr sid="S1">1</supplr>. We have limited the substrate specificity definition in our annotation to putatively dominant substrates based on the amino acid specificity of the T-box. However, broader substrate specificity is probably more common for transporters. Especially transport systems consisting of only a permease are expected (and have been shown) to display broader substrate specificity (see <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> for examples), whereas systems that require prior substrate-binding (like in ABC transport) will be more specific. We discuss the T-box based functional annotation of some transport systems in more detail in the following paragraphs and in Additional file <supplr sid="S3">3</supplr>.</p>
               <suppl id="S3">
                  <title>
                     <p>Additional file 3</p>
                  </title>
                  <text>
                     <p>T-box enhanced functional annotation of amino acid transport. Extensive description of the improved annotation of the T-box regulated transporter systems not given in the main text.</p>
                  </text>
                  <file name="1471-2164-9-330-S3.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>The ABC family</p>
               </st>
               <p>T-box regulation of ABC transport systems was found in most lineages of the <it>Firmicutes </it>but not in the <it>Bacilli</it>. The T-box regulated ABC transporters could be sub-divided into four sub-families, based on the specificity of the substrate-binding protein and the permease. A striking use of extensive T-box regulation in ABC transport was observed in <it>L. plantarum</it>. It appears that in the absence of methionine, <it>L. plantarum </it>uses a single mechanism to switch on not only transport of the amino acid itself, but also of the precursors and co-factors needed for its biosynthesis (see Figure <figr fid="F2">2</figr> and Additional file <supplr sid="S3">3</supplr>).</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Overview of T-box-regulated methionine biosynthesis in <it>L. plantarum</it></p>
                  </caption>
                  <text>
                     <p><b>Overview of T-box-regulated methionine biosynthesis in <it>L. plantarum</it>.</b> Reactions coloured in blue are catalyzed by proteins encoded by genes regulated by a T-box. The figure was generared using the the Simpheny software tool (Genomatica, San Diego, USA). Reactions were based on the metabolic model of <it>L plantarum </it>published by <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>.</p>
                  </text>
                  <graphic file="1471-2164-9-330-2"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>The APC family</p>
               </st>
               <p>A T-box was identified in front of an APC-family protein encoding gene in all the studied <it>Bacillus </it>genomes. In <it>B. subtilis </it>and <it>B. licheniformis </it>the gene <it>ybvW </it>is preceded by a Leu T-box, whereas such a box is lacking upstream of the orthologous genes, which are found in <it>E. faecalis</it>, <it>G. kaustophilus </it>and in <it>L. lactis </it>(co-orthologs: <it>yibG </it>and <it>ysjA</it>) (Figure <figr fid="F3">3</figr>). The Leu T-box suggests that the YbvW protein is a Leucine transporter, in line with the general functionality of transporters of the APC family (family characteristics described in <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>). Surprisingly, in the members of the <it>Bacillus cereus</it>-group another APC family gene is preceded by a T-box, specific for threonine. Although an orthologous gene is present in most of the Firmicutes genomes, e.g. <it>ykbA </it>in <it>B. subtilis</it>, it is regulated by a T-box only in the species of the <it>Bacillus cereus</it>-group (Figure <figr fid="F3">3</figr>). The protein encoded by <it>ykbA </it>in <it>B. subtilis </it>has recently been shown to be a Ser/Thr exchanger and was consequently renamed SteT <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. A similar functionality of the protein ortholog in the members of the <it>Bacillus cereus </it>group is supported by the codon identification of the T-box.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Occurrence of T-boxes in relation with different transporter families</p>
                  </caption>
                  <text>
                     <p><b>Occurrence of T-boxes in relation with different transporter families.</b> The figure displays a NJ-tree of the LIVCS-family transporters of the Firmicutes (left) and partial trees related to the APC-, MFS- and NSS-family transporters. It appears T-boxes are only associated with very few proteins of these families and the association appears to be very species-specific. Bootstrap values are given for those clusters that contain T-box regulated systems (indicated in black). Those systems that are controlled by a T-box are colored. For the LIVCS-family the sequences of the experimentally studied transporters from Pseudomonas aeruginosa (BraZ <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>), Corynebacterium glutamicum (BrnQ <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>) and Lactobacillus delbrueckii (BrnQ <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>) were included in the analysis. For the APC-family all B. subtilis sequences were included together with the orthologous clusters containing T-box regulated systems. For the MFS- and NSS-family only the orthologous clusters containing T-box regulated systems are shown. In the case of the MFS-family the asterisk indicates that the upstream sequence of these systems contains a box that seems to be degenerated.</p>
                  </text>
                  <graphic file="1471-2164-9-330-3"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>The LIVCS family</p>
               </st>
               <p>The <it>Bacilli </it>of the <it>Bacillus cereus </it>group, the <it>Lactobacilli </it>and the <it>Clostridia </it>contain several branched-chain amino acid cation symporters of the LIVCS-family <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, some of which are T-box regulated (Figure <figr fid="F3">3</figr>). Since the three branched-chain amino acids share very similar molecular properties (e.g. size and hydrophobicity) we expect that these transporters are not highly specific despite their proposed amino acid specific control, but merely that expression of the "multi-specific" system has been brought under the control of the individual amino acids. Indeed, the orthologous transporters that have been characterized in <it>L. delbrueckii </it>(BrnQ; <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>), <it>C. glutamicum </it>(BrnQ; <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>) and <it>P. aeruginosa </it>(BraZ; <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>) displayed transport of all three branched-chain amino acids.</p>
            </sec>
            <sec>
               <st>
                  <p>The NSS family</p>
               </st>
               <p>Finally, the LIVCS family branched-chain amino acid transporter BraZ of <it>P. aeruginosa</it>, was shown to have a clear preference for isoleucine and valine over leucine <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. In this respect it is noteworthy that in the <it>Bacilli </it>of the <it>Bacillus cereus </it>group the expression of one of the homologs of the NSS family (neurotransmitter:sodium symport) is controlled by a Leu T-box, whereas such a box is lacking for the LIVCS homologs in those species (Figure <figr fid="F4">4</figr>). Besides the Leu T-box regulated NSS transporter, the <it>Bacilli </it>of the <it>Bacillus cereus </it>group contain three other homologs of the same family, two of which are controlled by a Trp and a Phe T-box, respectively (Figure <figr fid="F3">3</figr>). These two amino acids agree well with the experimentally determined tryptophan transport functionality of the NSS homolog TnaT in <it>S. thermophilum </it><abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. The presence of a regulatory T-box ranging from Leu to Trp, and Phe suggest that the members of the NSS transporter family may display a rather broad amino acid specificity.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>T-box regulation of tRNA ligase encoding genes in the <it>Firmicutes</it></p>
                  </caption>
                  <text>
                     <p><b>T-box regulation of tRNA ligase encoding genes in the <it>Firmicutes</it>.</b> The color coding relates to the presence or absence of a T-box upstream of the genes encoding the amino acid-specific tRNA ligases in the various species and strains. Green indicates the tRNA ligase(s) is (are) regulated by a T-box and red that the tRNA ligase(s) is (are) not regulated by a T-box. Although most tRNA ligases are present in one copy on the genome, several organisms contain two, or in some cases three copies of specific ligases (indicated by a number in the box). Orange indicates that 1 of the 2 tRNA ligases is regulated by a T-box or 1 out of 3 in the case of the <it>argS </it>genes in <it>B. cereus </it>ATCC 10987 and the <it>aspS </it>genes in <it>C. acetobutylicum</it>. Light green indicates that the tRNA ligase is not the first in the operon, but is regulated by a T-box with the same specificity. Yellow color coding indicates that the regulated tRNA ligase is the second gene in an operon in combination with another tRNA ligase gene regulated by a T-box with different specificity. White indicates that no tRNA ligase of this type is present in the organism. In principle, a species needs at least one specific tRNA ligase for each amino acid. Nevertheless, there are exceptions. For instance, all but one (<it>Clostridium perfringens</it>) of the analyzed genomes lack the gene that encodes a Gln-tRNA ligase and the genomes of the <it>Chloroflexi, Actinobacteria a</it>nd <it>Thermoanaerobacter tencongens </it>also lack an Asn-tRNA ligase. In these cases, the biological role of the Gln-tRNA ligase is taken over by the Glu tRNA ligase, which couples a Glu residue to the tRNA<sup>Gln</sup>. The residue is subsequently transformed into a Gln by a tRNA specific amidotransferase <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Similarly, an Asn-tRNA<sup>Asn </sup>is formed via transamidation of an Asp residue (Asp-tRNA<sup>Asn </sup>to Asn-tRNA<sup>Asn</sup>) in bacteria that lack an Asn tRNA ligase <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Consequently, we found that all species lacking either the Gln-tRNA ligase or the Asn-tRNA ligase have an orthologous gene coding for the corresponding amidotransferase. No T-boxes were identified upstream of those genes.</p>
                  </text>
                  <graphic file="1471-2164-9-330-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>b) Hypothetical proteins controlled by a T-box</p>
               </st>
               <p>Another class of proteins to which significant functional information could be added (when compared to the NCBI-annotation) using the specificity of the detected T-box is that of the so-called hypothetical proteins or unknown function proteins (data accumulated in Table <tblr tid="T2">2</tblr>). Obviously, when orthologous proteins in related species were also of unknown function, specifier codon information clearly improved the annotation. Examples of new annotations related to amino acid biosynthesis or transport are enzymes (methionine synthase, cystathionine gamma synthase, chorismate mutase, anthranilate synthase), transporters (Leu-, Lys- and His-specific permeases), tRNA-ligase related functions, and regulation (anti-TRAP protein).</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Proposed annotation of the genes regulated by a T-box that were assigned as "hypothetical protein" in the original NCBI annotation file.</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="left">
                           <p>
                              <b>Species</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Gene ID</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>T-box</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Proposed function</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Bacillus halodurans </it>C-125</p>
                        </c>
                        <c ca="left">
                           <p>BH0807</p>
                        </c>
                        <c ca="left">
                           <p>Lys</p>
                        </c>
                        <c ca="left">
                           <p>Lysine-specific permease</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Bacillus subtilis </it>168</p>
                        </c>
                        <c ca="left">
                           <p>BSU02530</p>
                        </c>
                        <c ca="left">
                           <p>Trp</p>
                        </c>
                        <c ca="left">
                           <p>Anti TRAP protein</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>BSU34010 (yvbW)</p>
                        </c>
                        <c ca="left">
                           <p>Leu</p>
                        </c>
                        <c ca="left">
                           <p>Leucine-specific permease</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Enterococcus faecalis </it>V583</p>
                        </c>
                        <c ca="left">
                           <p>EF2480</p>
                        </c>
                        <c ca="left">
                           <p>Gly</p>
                        </c>
                        <c ca="left">
                           <p>Gly related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Lactobacillus acidophilus </it>NCFM</p>
                        </c>
                        <c ca="left">
                           <p>LBA1071</p>
                        </c>
                        <c ca="left">
                           <p>Ile</p>
                        </c>
                        <c ca="left">
                           <p>Ile related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Lactobacillus johnsonii </it>NCC 533</p>
                        </c>
                        <c ca="left">
                           <p>LJ0632</p>
                        </c>
                        <c ca="left">
                           <p>Met</p>
                        </c>
                        <c ca="left">
                           <p>5-methyltetrahydropteroyltriglutamate &#8211; homocysteine methyltransferase (Methionine synthase)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Lactobacillus plantarum </it>WCFS1</p>
                        </c>
                        <c ca="left">
                           <p>lp_3283</p>
                        </c>
                        <c ca="left">
                           <p>Met</p>
                        </c>
                        <c ca="left">
                           <p>5-methyltetrahydropteroyltriglutamate &#8211; homocysteine methyltransferase (Methionine synthase)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>Listeria sp</it>.<sup>1</sup></p>
                        </c>
                        <c ca="left">
                           <p>lmo1740</p>
                        </c>
                        <c ca="left">
                           <p>His</p>
                        </c>
                        <c ca="left">
                           <p>Histidine transport system permease protein hisM</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>lmo2587</p>
                        </c>
                        <c ca="left">
                           <p>Met</p>
                        </c>
                        <c ca="left">
                           <p>Met related cytosolic hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Staphylococcus aureus</it>
                              <sup>2</sup>
                           </p>
                        </c>
                        <c ca="left">
                           <p>SA0347</p>
                        </c>
                        <c ca="left">
                           <p>Met</p>
                        </c>
                        <c ca="left">
                           <p>Cystathionine gamma-synthase</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>SA1199</p>
                        </c>
                        <c ca="left">
                           <p>Trp</p>
                        </c>
                        <c ca="left">
                           <p>Anthranilate synthase component I</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Streptococcus agalactiae</it>
                              <sup>3</sup>
                           </p>
                        </c>
                        <c ca="left">
                           <p>SAG0809</p>
                        </c>
                        <c ca="left">
                           <p>Ala</p>
                        </c>
                        <c ca="left">
                           <p>Ala-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Streptococcus pneumoniae</it>
                              <sup>4</sup>
                           </p>
                        </c>
                        <c ca="left">
                           <p>spr0489</p>
                        </c>
                        <c ca="left">
                           <p>Val</p>
                        </c>
                        <c ca="left">
                           <p>Val-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>spr1241</p>
                        </c>
                        <c ca="left">
                           <p>Ala</p>
                        </c>
                        <c ca="left">
                           <p>Ala-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>spr1331</p>
                        </c>
                        <c ca="left">
                           <p>Gly</p>
                        </c>
                        <c ca="left">
                           <p>Gly-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>spr1471</p>
                        </c>
                        <c ca="left">
                           <p>Thr</p>
                        </c>
                        <c ca="left">
                           <p>Thr-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>spr1638</p>
                        </c>
                        <c ca="left">
                           <p>Trp</p>
                        </c>
                        <c ca="left">
                           <p>Trp biosynthesis related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Streptococcus thermophilus</it>
                              <sup>5</sup>
                           </p>
                        </c>
                        <c ca="left">
                           <p>str0474</p>
                        </c>
                        <c ca="left">
                           <p>Val</p>
                        </c>
                        <c ca="left">
                           <p>Val-tRNA ligase related hypothetical</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>str1594</p>
                        </c>
                        <c ca="left">
                           <p>Trp</p>
                        </c>
                        <c ca="left">
                           <p>Chorismate mutase</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p><sup>1 </sup><it>Listeria innocua </it>Clip11262, <it>Listeria monocytogenes </it>EGD-e, <it>L. monocytogenes </it>4bF2365. <sup>2 </sup><it>Staphylococcus aureus </it>MW2, <it>S. aureus </it>N315. <sup>3 </sup><it>Streptococcus agalactiae </it>2603, <it>S. agalactiae </it>A909, <it>S. agalactiae </it>NEM316.<sup>4 </sup><it>Streptococcus pneumoniae </it>TIGR4, <it>S. pneumoniae </it>R6. <sup>5 </sup><it>Streptococcus thermophilus </it>CNRZ106, <it>S. thermophilus </it>LMG18311.</p>
                  </tblfn>
               </tbl>
            </sec>
         </sec>
         <sec>
            <st>
               <p>III) Taxonomic variation and T-box evolution</p>
            </st>
            <p>The comprehensive list of T-boxes that was generated for all sequenced genomes (see Table <tblr tid="T3">3</tblr> for the phylogenetic distribution) confirmed the previous attribution that T-boxes are predominantly encountered in species of the phylum <it>Firmicutes </it>(>95% of the hits)<abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Our analyses uncovered many previously unidentified T-boxes. In species of the class <it>Mollicutes </it>two T-box elements were found, but only in the subclass <it>Endoplasmatales </it><abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, in <it>Proteobacteria </it>(i.e. in <it>Geobacter sulfurreducens </it>and <it>Pelobacter carbinolicus</it>) a typical T-box element was identified upstream of the <it>leuA </it>gene (2-isopropylmalate synthase) in both species. <it>Deinococcus radiodurans </it>contained two T-boxes (related to ile and gly t-RNA ligase) whereas species of the phylum of <it>Chloroflexi </it>(<it>Dehalococcoides CBDB1 </it>and <it>Dehalococcoides ethenogenes </it>195) contained three T-box elements (one related to an ile tRNA ligase and two related to tryptophan biosynthesis (<it>trpE </it>and <it>trpB</it>-like). Earlier analysis of riboswitches in Actinobacteria showed that some species belonging to this phylum contain a T-box upstream of <it>ileS </it><abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. However, <it>Symbiobacterium thermophilum </it>contained not less than eighteen T-boxes, comparable to species of the <it>Firmicutes</it>. This finding is in line with the conclusion of <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> that <it>S. thermophilum </it>is probably more closely related to <it>Firmicutes </it>than to <it>Actinobacteria</it>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>The occurrence of T-boxes in different bacterial phyla. The phyla are taken from the NCBI taxonomy <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Phylum</p>
                     </c>
                     <c ca="left">
                        <p>Genomes sequenced</p>
                     </c>
                     <c ca="left">
                        <p>Genomes with at least one T-box</p>
                     </c>
                     <c ca="left">
                        <p>Number of T-boxes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Firmicutes</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>855</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Actinobacteria</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chloroflexi</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deinococcus/Thermus</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Proteobacteria</p>
                     </c>
                     <c ca="left">
                        <p>125</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cyanobacteria</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chlamydiae</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bacteroidetes/Chlorobi</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spirochaetes</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Planctomycetes</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Aquificiates</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fusobacteria/Thermotogae</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>242</p>
                     </c>
                     <c ca="left">
                        <p>70</p>
                     </c>
                     <c ca="left">
                        <p>897</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>To evaluate the phylogenetic distribution of T-boxes in more detail, the correlation between the presence of T-box regulatory elements and the regulated genes was analyzed for the Firmicutes and will be described shortly in the next sections. Furthermore, the scattered appearance of these regulatory boxes as observed for the various transporter families will be discussed.</p>
            <sec>
               <st>
                  <p>T-box regulation of genes encoding tRNA ligases in the Firmicutes</p>
               </st>
               <p>It appeared (Figure <figr fid="F4">4</figr>) that regulation by T-boxes is conserved in almost all (at least 29 out of 34) <it>Firmicutes </it>for several tRNA ligases (<it>ileS</it>, a <it>laS</it>, <it>serS </it>and <it>thrS</it>), whereas some tRNA ligases (<it>lysS, asnS </it>and <it>gltX</it>; the latter gene encodes a Glu-tRNA ligase (see also the legend of Figure <figr fid="F4">4</figr>)) appeared to be controlled by a T-box in only a few species. The genes encoding the tRNA ligases for cysteine and asparagine were often found as the second gene in a putative operon that was T-box regulated. In several organisms, multiple copies of amino-acid specific tRNA-ligase encoding genes are found and in more than half of the cases (58%) only one of them is subject to T-box regulation.</p>
               <p>A clear phylogenetic effect is observed when the four major orders within the <it>Firmicutes </it>(<it>Bacillales</it>, <it>Clostridia</it>, <it>Lactobacillales </it>and <it>Mollicutes</it>) are compared. This is true for both the number and the type of tRNA-ligase encoding genes regulated by a T-box. Most T-box regulated tRNA-ligase encoding genes are found in the <it>Bacillales </it>and especially in species of the <it>Bacillus cereus </it>group. Within the <it>Lactobacillales</it>, it appears that <it>Streptococci </it>have far less tRNA- ligase encoding genes regulated by T-boxes than <it>Lactobacillus </it>species. The lowest number of tRNA ligases regulated by T-boxes was found for <it>S. thermophilus, S. pneumoniae </it>and some strains of <it>S. pyogenes</it>. The relatively low amount of T-box regulation in these species could be the result of regressive evolution, a process that was suggested to be the underlying mechanism for the large loss of functionally active genes in <it>S. thermophilus </it><abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>T-box regulation of genes involved in amino acid biosynthesis in the Firmicutes</p>
               </st>
               <p>T-box regulation of genes related to amino acid biosynthesis has been described previously for various amino acids <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. Like in the case of the t-RNA ligases, T-box control of amino acid biosynthesis displayed clear phylogenetic patterns (Figure <figr fid="F5">5</figr>). For instance, the biosynthesis of Branched Chain Amino Acids (BCA: isoleucine, leucine, valine) was found to be T-box regulated in <it>Bacillales </it>and <it>Clostridia</it>, whereas several families within the <it>Bacillales </it>(e.g. <it>Staphylococci </it>and <it>Listeria</it>) as well as several <it>Streptococci </it>consistently lack T-box control of BCA biosynthesis. Similarly, we found that the species of the <it>B. cereus </it>group contain a T-box in the upstream region of the tyrosine biosynthesis operon consistent with the experimental data that showed that tyrosine biosynthesis from shikimate is T-box regulated in <it>B. anthracis </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The <it>B. cereus </it>group representatives are the only organisms in our study that encode a phenylalanine-4-hydroxylase ortholog (converting phenylalanine into tyrosine). We found this gene also to be T-box regulated in all members of the <it>B. cereus </it>group.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>T-boxes preceding the genes related to amino acid biosynthesis in <it>Firmicutes</it></p>
                  </caption>
                  <text>
                     <p><b>T-boxes preceding the genes related to amino acid biosynthesis in <it>Firmicutes</it>.</b> Color coding identifies the presence of the biosynthesis pathway and whether it is regulated by a T-box: Green; T-box regulated; red; not T-box regulated; no color; pathway absent. <sup>+</sup>TRAP protein is present. <sup>M </sup>Pathway genes organized in multiple operons. BCA indicates the branched chain amino acids valine, leucine and isoleucine.</p>
                  </text>
                  <graphic file="1471-2164-9-330-5"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>The evolution and propagation of T-boxes</p>
               </st>
               <p>An important observation related to T-box evolution was made by Grundy et al. <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr></abbrgrp>. They showed that a single nucleotide change in the specifier codon of the Tyr T-box of <it>tyrS </it>in <it>B. subtilis </it>was enough to change the amino acid specificity. In addition, while analyzing the distribution of T-boxes over the various transporter families we were struck by the fact that although some of these families are very large, there was only one (or a few) family-member(s) found to be regulated by a T-box and only in a restricted number of species (see Table <tblr tid="T1">1</tblr> and e.g. Figure <figr fid="F3">3</figr>). In our opinion, the only likely scenario to explain the phylogenetically limited occurrence of the transporter T-box associations that would not imply massive loss of the T-box regulation was that of acquisition of the regulatory element by the transporter encoding gene in a specific lineage. Moreover, the results of Grundy et al. <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr></abbrgrp> imply that in principle the T-Box can change specificity easily.</p>
               <p>To analyze this further, the T-boxes preceding the genes that encode the t-RNA-ligases for the branched-chain amino acids (<it>ileS</it>, <it>leuS </it>and <it>valS</it>) were examined. These were chosen because T-box regulation of these genes is most wide-spread (Figure <figr fid="F4">4</figr>) and because the proximal genes themselves, <it>ileS</it>, <it>leuS </it>and <it>valS </it>form a separate tRNA-ligase sub-family with a very clear evolutionary lineage (Figure <figr fid="F6">6A</figr>). In sharp contrast, the NJ-tree of the aligned complete T-boxes (approximately 200 &#8211; 300 nt) appeared extremely unreliable (Figure <figr fid="F6">6B</figr>). This finding implies that amino acid specificity of a T-box is not prominent on the overall sequence level and that the apparent overall sequence variability in time of the T-box is relatively high. Simultaneously, in case T-box sequences do cluster in reliable clusters (high bootstrap support) in a NJ-tree they must therefore be closely related in time.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>The evolutionary relationship between some T-boxes</p>
                  </caption>
                  <text>
                     <p><b>The evolutionary relationship between some T-boxes.</b> (A) shows a putative phylogeny of the branched-chain amino acid tRNA ligases of <it>B. anthracis </it>Ames, <it>B. subtilis </it>168, <it>C. acetobutylicum </it>ATCC824D, <it>L. acidophilus </it>NCFM, <it>L. plantarum </it>WCFS1, <it>L. mesenteroides </it>ATCC8293 and <it>S. aureus </it>Mu50. (B) shows the Neighbor-Joining tree for the related T-boxes. The underlying alignments were made with the complete 200 &#8211; 300 nucleotides of the identified T-boxes. These alignments were homogeneous in the sense that the fully conserved motifs aligned perfectly and that between those conserved elements there were little gaps. Nevertheless, the low bootstrap support for the various branches indicates that this tree is unlikely to reflect the true phylogeny of the regulatory elements. (C) shows the Neighbor-Joining tree for various T-boxes found in <it>B. anthracis </it>Ames. Next to the tree, the part of the corresponding multiple sequence alignment containing the specifier codon (indicated in white letters) is depicted. The amino acid specificity of the specifier codon is color-coded: Red and orange relate to Ile, green to Leu, light blue to Phe, beige to Pro or Gln, pink to Ser, brown to Thr, turquoise to Trp, purple to Tyr and dark blue to Val. The family of the protein encoded by the regulated gene is indicated by the letters that follow the amino acid code. These protein families included the APC, LIVCS, MFS, NhaC and NSS transporter protein families and various tRNA-ligase families (S or Smr for mupirocin resistant tRNA ligase). The NSS-family transport proteins regulated by a Leu, Phe and Trp T-box are in-paralogs characteristic for the species of the <it>Bacillus cereus </it>group. The purple numbers between brackets indicate the bootstrap support for the displayed clusters (out of 1000).</p>
                  </text>
                  <graphic file="1471-2164-9-330-6"/>
               </fig>
               <p>To limit possible obscuring effects of comparing sequences between species, we collected and compared the T-box sequences within species and for clarity restricted the comparison to the T-boxes that accompany transport systems and the related tRNA ligases. For all three analyzed species: <it>B. anthracis, L. acidophilus </it>and <it>L. plantarum</it>. similar phenomena were observed. The multiple sequence alignment of the analyzed T-box sequences and the associated NJ-tree (depicted in Figure <figr fid="F6">6C</figr> for <it>B. anthracis</it>) are strongly suggestive of a close evolutionary relationship between several of the T-boxes. For example in <it>B. anthracis</it>, the Thr T-boxes found in front of <it>BA4970 </it>(transport systems of LIVCS-type) and the Thr-tRNA ligase (<it>BA4820</it>) were highly similar and the same was observed for the Ile T-boxes found in front of another LIVCS homolog and the Ile-tRNA ligase. As the LIVCS homologs appear closely related in time (apparent duplication in the <it>B. cereus </it>group ancestor, see Figure <figr fid="F3">3</figr>), the data imply that the regulatory T-box was not inherited in a similar way but 'acquired' independently. In fact, this explanation fits the observed scattered appearance of the T-boxes for the various transporter families perfectly.</p>
               <p>The results presented in Figure <figr fid="F6">6</figr> are also suggestive of another way in which the T-boxes have evolved. The NJ-tree relates the Phe T-box found in front of one of the NSS family transporters to the Tyr T-box associated with the Tyr-tRNA ligase (Figure <figr fid="F6">6C</figr>). It thus seems that the Tyr T-box of the tRNA ligase was duplicated -as this T-box is present in various Firmicutes species- and has diverged/adapted to control a Phe transporter in the Bacilli of the <it>B. cereus </it>group. In fact, the similarity between the T-box upstream of Tyr-tRNA ligase and the Phe T-box in front of the transporter is higher than between the Tyr T-box and the Tyr T-box preceding <it>BA4353 </it>(NhaC family transporter). This is consistent with the fact that: the Tyr T-box acquisition of the NhaC ortholog should have occurred earlier in history, as the Tyr T-box control of the NhaC ortholog is present in several <it>Firmicutes</it>, and the sequences thus had more time to diverge.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The sequence signature of a T-box is very specific and as a result T-boxes can be readily identified. Using specific T-box HMMs, we identified a large number of the T-boxes and their amino acid specificity in sequenced prokaryote genomes.</p>
         <p>An important aspect of this work is that we show that the prediction of the amino acid specificity of the various T-boxes can be used to improve the functional annotation of a large number of genes. In particular, the functional annotation of genes related to amino acid transport and genes with unknown substrate specificity, genes for which it is normally quite difficult to find functional attributes, could be improved significantly. In our opinion, the procedure of improving annotation through knowledge of the regulatory signals can be generalized and should be used on a much broader scale than currently is being done.</p>
         <p>Riboswitches have been argued to be among the oldest regulatory systems in bacteria because of their independence of regulatory proteins and widespread biological distribution <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. One might therefore have expected that T-boxes are abundantly present among all different lineages of bacteria. This clearly can not be concluded from our results and those presented in other studies <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B55">55</abbr></abbrgrp>. In fact, these regulatory elements can only be found in a few bacterial phyla and only abundantly in the phylum <it>Firmicutes</it>. This implies that either <it>Firmicutes </it>developed T-box regulation after their branching off from the other bacteria or that the other bacteria lost the system soon after the branching off of the <it>Firmicutes </it>to evolve more complex regulatory systems. Which of the two scenarios is most likely remains unclear.</p>
         <p>Nevertheless our data do allow some extrapolation of the propagation of T-boxes within the phylum Firmicutes. We conclude on basis of our observations that the T-boxes have evolved in four clearly distinct ways: i) by co-evolution with the regulated gene or operon; ii) by co-evolution and divergence with the regulated gene or operon to adopt a new specificity; by iii) duplication and insertion of the regulatory element in front of a gene or operon that encodes functions related to the T-box-specified amino acid; and finally iv) by duplication and divergence toward a new amino acid specificity after duplication. In short, this means that every T-box regulatory element acts as a connected yet independent "functional module". The fact that the isoleucine specific T-box connected to the ile-tRNA ligase encoding gene is the only box present in all T-box containing bacteria suggests this box could very well compose the archetype T-box.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sequence information and tools</p>
            </st>
            <p>Genome sequences and annotation files of completely sequenced bacterial and archaeal genomes were downloaded from the NCBI repository <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. In addition, for the analysis of the molecular functions encoded by T-box controlled genes, genomic information was obtained from the ERGO genome analysis and discovery system <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Multiple sequence alignments were created with MUSCLE <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and bootstrapped Neighbor-Joining trees with CLUSTALX <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> (corrected for multiple substitutions). HMMs were constructed from a multiple sequence alignment using HMMER 2.3.2 <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Conserved nucleotide sequence motifs were recovered via MEME <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> (settings: maximally 5 motifs, modus ZOOPs, minimal width 10 nt, maximal width 30 nt) and visualized using Weblogo <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Hidden Markov Models (HMMs) were constructed on basis of the recovered motifs to perform genome-wide searches. Although HMMsearch <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> yielded essentially identical results to the search-tool MAST <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>, in our hands the procedure was much faster and the output was easier to interpret (including motif e-values rather than p-values). RNA secondary structure predictions were performed by Mfold <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> using default settings.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of a general T-box sequence</p>
            </st>
            <p>The T-boxes reported so far in literature were mainly located upstream of tRNA-ligase encoding genes in species of the phylum <it>Firmicutes </it><abbrgrp><abbr bid="B7">7</abbr><abbr bid="B26">26</abbr></abbrgrp>. We therefore initiated the search for a general T-box sequence motif by collecting the nucleotide sequences (300 nt) preceding all tRNA ligases (n = 910) found in sequenced genomes of the <it>Firmicutes</it>. Five characteristic consecutive motifs were recovered in the nucleotide sequences (displayed in Figure <figr fid="F1">1</figr>). They were part of a generic T-box sequence that spans a length of about 250 nt, as described by <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. The best-conserved motif (Figure <figr fid="F1">1</figr>, motif 1, E-value = 1.2e<sup>-2841</sup>), which is located closest to the translation start, had a length of 30 nt and was found in about 61% (n = 553) of the nucleotide sequences. A genome-wide HMMsearch based upon the best-conserved motif yielded 374 new hits. To probe eventual in-homogeneity induced by the fact that only T-boxes associated with tRNA ligases were taken to create the initial T-box HMM, a new HMM based on the non-tRNA ligase associated T-boxes was made and the genomes were searched anew. We found that all t-RNA ligase associated T-boxes were recovered (with an e-value &lt; 1) with this new HMM. This finding implied that the set of recovered T-boxes was relatively homogeneous. The upstream 500 nt of every putative best-conserved T-box motif was then checked for the presence of at least one of the four other conserved T-box specific motifs recovered using MEME. In 40 cases (less than 4% of total) none of the other motifs was detected. Without exception, these were not located in the proximity of a coding sequence and thus unlikely to be related to transcription attenuation. The hits were therefore considered false-positives and were removed. To potentially increase the recovery rate, the remaining 887 T-box sequences were subdivided on basis of the taxonomy of the species and taxonomic class-specific HMMs were built as before. This procedure yielded only 10 additional T-boxes. Finally, the position of all boxes with respect to the start-codon of the proximal gene was determined. It appeared that In 839 cases, motif 1 was found to be located within 300 nucleotides upstream of a predicted gene start. The remaining 48 T-boxes were located 300 to 480 nt upstream of a start-codon. Manual inspection of those T-boxes revealed an overrepresentation of the proximal genes <it>thrZ </it>(11&#215;), <it>trpE </it>(11&#215;), branched amino acid transferase (5&#215;), chorismate mutase (5&#215;) and a sodium symporter (4&#215;). In fact, for both <it>thrZ </it>and <it>trpE </it>it was previously shown that they are preceeded by multiple (2 or 3) adjacently located T-boxes in different <it>bacilli </it><abbrgrp><abbr bid="B3">3</abbr><abbr bid="B29">29</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Functional classification of the genes regulated by T-boxes</p>
            </st>
            <p>The genes downstream of the recovered T-boxes were divided into four different classes on basis of the gene annotation information of the proximal gene: 'tRNA ligation', 'amino acid biosynthesis, 'amino acid transport' and 'other'. In those cases the T-box preceded an operon (45%) it appeared that most of these operons (>75%) contained genes of only one functional class.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison with the Rfam T-box model</p>
            </st>
            <p>The Rfam database <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> is an excellent reference database to start the identification of specific RNA-motifs. To evaluate the performance of our recovery procedure, we compared our results with predictions based on the Rfam T-box HMM. Unfortunately, a direct comparison with the contents of the database proved not informative as the Rfam database includes all nucleotide sequences present in the TREMBL database, which includes many genome fragments or partially sequenced genomes, whereas our analysis was restricted to completed genomes.</p>
            <p>Therefore, we used the Rfam model to search for T-boxes against the same set of complete genome sequences that was used for our analysis. A striking difference between the Rfam-search and our analysis was that the Rfam-search was computationally far more expensive (more than three weeks on an 8 node,16 core, linux cluster compared to 16 hours on a 2 core linux system). For the selected species we predicted 883 T-boxes characterized by the presence of all the characteristic motifs, of which 835 (95%) were within the first 300 nt upstream of a gene start. When using a cut-off of 53.000 bits (described by Rfam as reliable) only 501 (60%) of the 883 T-boxes were predicted using Rfam. In addition to these 501 shared T-boxes Rfam identified 8 additional genuine T-boxes (~0.9 gain) within the first 300 nt. upstream of a gene start. At a lower cutoff-value (25.000 bits), chosen such that ~81% (683) of the boxes identified by us were recovered, the prediction with our HMM and the Rfam results were more alike albeit that Rfam now also yielded a considerable number of false positive identifications: only ~90% of the total number of T-boxes was found within the first 300 nt. upstream of a gene start. Using this cutoff, 6 additional genuine T-boxes were found (~0.7% gain) to be present within the first 300 nt. upstream of a gene start. When applying no cutoff, the Rfam-search retrieved 89% (745) of the T-boxes that were found in our analysis. However, in this case the number of putative false-positives (not located within 300 nt upstream of a gene start) had increased to 65% of the total.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MW conceived, designed and carried out the tRNA-ligase and amino acid biosynthesis analysis, Rfam comparison and drafted and revised the manuscript. TGK carried out the T-box detection and specificity analysis, RJS and MK helped in the design and coordination of the research and drafting and revising the manuscript. CF helped in the design and coordination of the research, carried out the transporter analysis and helped drafting and revising the manuscript. All authors have read and approved the final manuscript</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Victor de Jager for his assistance in applying the Rfam calculations, Jos Boekhorst and Richard Notebaart for insightful discussions. Christof Francke gratefully acknowledges the support of NBIC/the Netherlands Genomics Initiative via the Kluyver Centre for Genomics of Industrial Fermentations and the BioRange program. For our work we used the life-science grid, which is part of the Dutch e-Science Grid (BIG GRID) and which is a collaborative effort of Netherlands National Computing Facilities foundation (NCF), the Netherlands Bioinformatics Centre (NBIC) and the National Institute for Nuclear Physics and High Energy Physics (NIKHEF).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Regulation by transcription attenuation in bacteria: how RNA provides instructions for transcription termination/antitermination decisions</p>
            </title>
            <aug>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2002</pubdate>
            <volume>24</volume>
            <issue>8</issue>
            <fpage>700</fpage>
            <lpage>707</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.10125</pubid>
                  <pubid idtype="pmpid" link="fulltext">12210530</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Transcription attenuation: a highly conserved regulatory strategy used by bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Merino</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>5</issue>
            <fpage>260</fpage>
            <lpage>264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.03.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15851059</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>New insights into regulation of the tryptophan biosynthetic operon in Gram-positive bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Gutierrez-Preciado</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Merino</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>8</issue>
            <fpage>432</fpage>
            <lpage>436</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.06.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">15953653</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A protein-dependent riboswitch controlling ptsGHI operon expression in Bacillus subtilis: RNA structure rather than sequence provides interaction specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Schilling</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Langbein</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schmalisch</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Stulke</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>9</issue>
            <fpage>2853</fpage>
            <lpage>2864</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419612</pubid>
                  <pubid idtype="pmpid" link="fulltext">15155854</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh611</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Monitoring uncharged tRNA during transcription of the <it>Bacillus subtilis glyQS</it> gene</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Yousef</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>346</volume>
            <issue>1</issue>
            <fpage>73</fpage>
            <lpage>81</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.11.051</pubid>
                  <pubid idtype="pmpid" link="fulltext">15663928</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Comparative genomics of thiamin biosynthesis in procaryotes. New genes and regulatory mechanisms</p>
            </title>
            <aug>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <issue>50</issue>
            <fpage>48949</fpage>
            <lpage>48959</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M208965200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12376536</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Conserved regulatory motifs in bacteria: riboswitches and beyond</p>
            </title>
            <aug>
               <au>
                  <snm>Abreu-Goodger</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ontiveros-Palacios</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ciria</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Merino</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>10</issue>
            <fpage>475</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.08.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">15363900</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The T box and S box transcription termination control systems</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Front Biosci</source>
            <pubdate>2003</pubdate>
            <volume>8</volume>
            <fpage>d20</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2741/908</pubid>
                  <pubid idtype="pmpid" link="fulltext">12456320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>tRNA as a positive regulator of transcription antitermination in B. subtilis</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1993</pubdate>
            <volume>74</volume>
            <issue>3</issue>
            <fpage>475</fpage>
            <lpage>482</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(93)80049-K</pubid>
                  <pubid idtype="pmpid" link="fulltext">8348614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Aminoacyl-tRNA synthetase gene regulation in <it>Bacillus subtilis</it>: induction, repression and growth-rate regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Putzer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Laalami</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brakhage</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Condon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grunberg-Manago</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1995</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>709</fpage>
            <lpage>718</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1995.tb02432.x</pubid>
                  <pubid idtype="pmpid">7476165</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>tRNATrp as a key element of antitermination in the <it>Lactococcus lactis trp</it> operon</p>
            </title>
            <aug>
               <au>
                  <snm>van de Guchte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ehrlich</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Chopin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>29</volume>
            <issue>1</issue>
            <fpage>61</fpage>
            <lpage>74</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1998.00903.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9701803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Specificity of tRNA-mRNA interactions in <it>Bacillus subtilis</it><it>tyrS</it> antitermination</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Hodil</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Rollins</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <issue>8</issue>
            <fpage>2587</fpage>
            <lpage>2594</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">179008</pubid>
                  <pubid idtype="pmpid" link="fulltext">9098057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Interaction between the acceptor end of tRNA and the T box stimulates antitermination in the <it>Bacillus subtilis</it><it>tyrS</it> gene: a new role for the discriminator base</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Rollins</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1994</pubdate>
            <volume>176</volume>
            <issue>15</issue>
            <fpage>4518</fpage>
            <lpage>4526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">196270</pubid>
                  <pubid idtype="pmpid" link="fulltext">8045882</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A CUC triplet confers leucine-dependent regulation of the <it>Bacillus subtilis</it> ilv-leu operon</p>
            </title>
            <aug>
               <au>
                  <snm>Marta</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Ladner</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Grandoni</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1996</pubdate>
            <volume>178</volume>
            <issue>7</issue>
            <fpage>2150</fpage>
            <lpage>2153</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">177919</pubid>
                  <pubid idtype="pmpid" link="fulltext">8606198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>tRNA-mediated transcription antitermination in vitro: codon-anticodon pairing independent of the ribosome</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Winkler</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>17</issue>
            <fpage>11121</fpage>
            <lpage>11126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123220</pubid>
                  <pubid idtype="pmpid" link="fulltext">12165569</pubid>
                  <pubid idtype="doi">10.1073/pnas.162366799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p><it>In vitro</it> and <it>in vivo</it> secondary structure probing of the thrS leader in <it>Bacillus subtilis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Luo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Condon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grunberg-Manago</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Putzer</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1998</pubdate>
            <volume>26</volume>
            <issue>23</issue>
            <fpage>5379</fpage>
            <lpage>5387</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148014</pubid>
                  <pubid idtype="pmpid" link="fulltext">9826762</pubid>
                  <pubid idtype="doi">10.1093/nar/26.23.5379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p><it>In vivo</it> and <it>in vitro</it> processing of the <it>Bacillus subtilis</it> transcript coding for glutamyl-tRNA synthetase, serine acetyltransferase, and cysteinyl-tRNA synthetase</p>
            </title>
            <aug>
               <au>
                  <snm>Pelchat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lapointe</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>1999</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>281</fpage>
            <lpage>289</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1369759</pubid>
                  <pubid idtype="pmpid" link="fulltext">10024179</pubid>
                  <pubid idtype="doi">10.1017/S1355838299980858</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Kinetic analysis of tRNA-directed transcription antitermination of the <it>Bacillus subtilis</it> glyQS gene<it> in vitro</it></p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>16</issue>
            <fpage>5392</fpage>
            <lpage>5399</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">490933</pubid>
                  <pubid idtype="pmpid" link="fulltext">15292140</pubid>
                  <pubid idtype="doi">10.1128/JB.186.16.5392-5399.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>tRNA determinants for transcription antitermination of the <it>Bacillus subtilis</it> tyrS gene</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Rollins</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>2000</pubdate>
            <volume>6</volume>
            <issue>8</issue>
            <fpage>1131</fpage>
            <lpage>1141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1369987</pubid>
                  <pubid idtype="pmpid" link="fulltext">10943892</pubid>
                  <pubid idtype="doi">10.1017/S1355838200992100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Identity elements in tRNA-mediated transcription antitermination: implication of tRNA D- and T-arms in mRNA recognition</p>
            </title>
            <aug>
               <au>
                  <snm>van de Guchte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ehrlich</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Chopin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2001</pubdate>
            <volume>147</volume>
            <issue>Pt 5</issue>
            <fpage>1223</fpage>
            <lpage>1233</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11320125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Structural transitions induced by the interaction between tRNA(Gly) and the <it>Bacillus subtilis glyQS</it> T box leader RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Yousef</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>349</volume>
            <issue>2</issue>
            <fpage>273</fpage>
            <lpage>287</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2005.03.061</pubid>
                  <pubid idtype="pmpid" link="fulltext">15890195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>In vitro selection to identify determinants in tRNA for <it>Bacillus subtilis tyrS</it> T box antiterminator mRNA binding</p>
            </title>
            <aug>
               <au>
                  <snm>Fauzi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jack</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Hines</snm>
                  <fnm>JV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>8</issue>
            <fpage>2595</fpage>
            <lpage>2602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1090546</pubid>
                  <pubid idtype="pmpid" link="fulltext">15879350</pubid>
                  <pubid idtype="doi">10.1093/nar/gki546</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Comparative genomics of the methionine metabolism in Gram-positive bacteria: a variety of regulatory systems</p>
            </title>
            <aug>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>11</issue>
            <fpage>3340</fpage>
            <lpage>3353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">443535</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215334</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh659</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Regulation of biosynthesis and transport of aromatic amino acids in low-GC Gram-positive bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Panina</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2003</pubdate>
            <volume>222</volume>
            <issue>2</issue>
            <fpage>211</fpage>
            <lpage>220</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12770710</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Analysis of the <it>Bacillus subtilis tyrS</it> gene: conservation of a regulatory sequence in multiple tRNA synthetase genes</p>
            </title>
            <aug>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1992</pubdate>
            <volume>174</volume>
            <issue>4</issue>
            <fpage>1299</fpage>
            <lpage>1306</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">206425</pubid>
                  <pubid idtype="pmpid" link="fulltext">1735721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Conservation of a transcription antitermination mechanism in aminoacyl-tRNA synthetase and amino acid biosynthesis genes in gram-positive bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>235</volume>
            <issue>2</issue>
            <fpage>798</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.1038</pubid>
                  <pubid idtype="pmpid" link="fulltext">8289305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Sequence requirements for terminators and antiterminators in the T box transcription antitermination system: disparity between conservation and functional requirements</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Moir</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Haldeman</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>7</issue>
            <fpage>1646</fpage>
            <lpage>1655</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">101844</pubid>
                  <pubid idtype="pmpid" link="fulltext">11917026</pubid>
                  <pubid idtype="doi">10.1093/nar/30.7.1646</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Regulation of expression of the <it>Lactococcus lactis</it> histidine operon</p>
            </title>
            <aug>
               <au>
                  <snm>Delorme</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ehrlich</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Renault</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1999</pubdate>
            <volume>181</volume>
            <issue>7</issue>
            <fpage>2026</fpage>
            <lpage>2037</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">93613</pubid>
                  <pubid idtype="pmpid" link="fulltext">10094678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Co-ordinate expression of the two threonyl-tRNA synthetase genes in <it>Bacillus subtilis</it>: control by transcriptional antitermination involving a conserved regulatory sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Putzer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gendron</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Grunberg-Manago</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1992</pubdate>
            <volume>11</volume>
            <issue>8</issue>
            <fpage>3117</fpage>
            <lpage>3127</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">556796</pubid>
                  <pubid idtype="pmpid">1379177</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Analysis of the <it>Bacillus subtilis</it> genome sequence reveals nine new T-box leaders</p>
            </title>
            <aug>
               <au>
                  <snm>Chopin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Biaudet</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ehrlich</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>29</volume>
            <issue>2</issue>
            <fpage>662</fpage>
            <lpage>664</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1998.00912.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9720882</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Genome wide identification of regulatory motifs in <it>Bacillus subtilis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Mwangi</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Siggia</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>18</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165661</pubid>
                  <pubid idtype="pmpid" link="fulltext">12749771</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-18</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>tRNA-directed transcription antitermination</p>
            </title>
            <aug>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1994</pubdate>
            <volume>13</volume>
            <issue>3</issue>
            <fpage>381</fpage>
            <lpage>387</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1994.tb00432.x</pubid>
                  <pubid idtype="pmpid">7527891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The <it>Staphylococcus aureus ileS</it> gene, encoding isoleucyl-tRNA synthetase, is a member of the T-box family</p>
            </title>
            <aug>
               <au>
                  <snm>Grundy</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Haldeman</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Hornblow</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Chalker</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <issue>11</issue>
            <fpage>3767</fpage>
            <lpage>3772</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">179176</pubid>
                  <pubid idtype="pmpid" link="fulltext">9171428</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A Bacillus subtilis operon containing genes of unknown function senses tRNATrp charging and regulates expression of the genes of tryptophan biosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Sarsero</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Merino</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <issue>6</issue>
            <fpage>2656</fpage>
            <lpage>2661</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15985</pubid>
                  <pubid idtype="pmpid" link="fulltext">10706627</pubid>
                  <pubid idtype="doi">10.1073/pnas.050578997</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Riboswitches: the oldest mechanism for the regulation of gene expression?</p>
            </title>
            <aug>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>1</issue>
            <fpage>44</fpage>
            <lpage>50</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2003.11.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">14698618</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Rfam: annotating non-coding RNAs in complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database issue</issue>
            <fpage>D121</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540035</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608160</pubid>
                  <pubid idtype="doi">10.1093/nar/gki081</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>T-box data</p>
            </title>
            <url>http://www.cmbi.ru.nl/T_box_analysis</url>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Using reliability information to annotate RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jacobson</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>1998</pubdate>
            <volume>4</volume>
            <issue>6</issue>
            <fpage>669</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1369649</pubid>
                  <pubid idtype="pmpid" link="fulltext">9622126</pubid>
                  <pubid idtype="doi">10.1017/S1355838298980116</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Selective charging of tRNA isoacceptors explains patterns of codon usage</p>
            </title>
            <aug>
               <au>
                  <snm>Elf</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tenson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ehrenberg</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <issue>5626</issue>
            <fpage>1718</fpage>
            <lpage>1722</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1083811</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805541</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Reconstructing the metabolic network of a bacterium from its genome</p>
            </title>
            <aug>
               <au>
                  <snm>Francke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Siezen</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Teusink</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>13</volume>
            <issue>11</issue>
            <fpage>550</fpage>
            <lpage>558</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tim.2005.09.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">16169729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A functional-phylogenetic classification system for transmembrane solute transporters</p>
            </title>
            <aug>
               <au>
                  <snm>Saier</snm>
                  <fnm>MH</fnm>
                  <suf>Jr.</suf>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2000</pubdate>
            <volume>64</volume>
            <issue>2</issue>
            <fpage>354</fpage>
            <lpage>411</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98997</pubid>
                  <pubid idtype="pmpid" link="fulltext">10839820</pubid>
                  <pubid idtype="doi">10.1128/MMBR.64.2.354-411.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Membrane potential-generating malate (MleP) and citrate (CitP) transporters of lactic acid bacteria are homologous proteins. Substrate specificity of the 2-hydroxycarboxylate transporter family</p>
            </title>
            <aug>
               <au>
                  <snm>Bandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ansanay</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rachidi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dequin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lolkema</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1997</pubdate>
            <volume>272</volume>
            <issue>29</issue>
            <fpage>18140</fpage>
            <lpage>18146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.272.29.18140</pubid>
                  <pubid idtype="pmpid" link="fulltext">9218448</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Cloning and characterization of <it>brnQ</it>, a gene encoding a low-affinity, branched-chain amino acid carrier in <it>Lactobacillus delbruckii</it> subsp. lactis DSM7290</p>
            </title>
            <aug>
               <au>
                  <snm>Stucky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hagting</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klein</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Matern</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Henrich</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Konings</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Plapp</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1995</pubdate>
            <volume>249</volume>
            <issue>6</issue>
            <fpage>682</fpage>
            <lpage>690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00418038</pubid>
                  <pubid idtype="pmpid">8544834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The amino acid/polyamine/organocation (APC) superfamily of transporters specific for amino acids, polyamines and organocations</p>
            </title>
            <aug>
               <au>
                  <snm>Jack</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Saier</snm>
                  <fnm>MH</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2000</pubdate>
            <volume>146 ( Pt 8)</volume>
            <fpage>1797</fpage>
            <lpage>1814</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10931886</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Functional and structural characterization of the first prokaryotic member of the L-amino acid transporter (LAT) family: a model for APC transporters</p>
            </title>
            <aug>
               <au>
                  <snm>Reig</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>del Rio</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Casagrande</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ratera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gelpi</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Torrents</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Zorzano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fotiadis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Palacin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2007</pubdate>
            <volume>282</volume>
            <issue>18</issue>
            <fpage>13270</fpage>
            <lpage>13281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M610695200</pubid>
                  <pubid idtype="pmpid" link="fulltext">17344220</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>TransportDB: a relational database of cellular membrane transport systems</p>
            </title>
            <aug>
               <au>
                  <snm>Ren</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D284</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308751</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681414</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh016</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Transport of branched-chain amino acids in <it>Corynebacterium glutamicum</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ebbighausen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Weil</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kramer</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Arch Microbiol</source>
            <pubdate>1989</pubdate>
            <volume>151</volume>
            <issue>3</issue>
            <fpage>238</fpage>
            <lpage>244</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00413136</pubid>
                  <pubid idtype="pmpid">2705860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Isoleucine uptake in Corynebacterium glutamicum ATCC 13032 is directed by the brnQ gene product</p>
            </title>
            <aug>
               <au>
                  <snm>Tauch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hermann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Burkovski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kramer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Puhler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kalinowski</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Arch Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>169</volume>
            <issue>4</issue>
            <fpage>303</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002030050576</pubid>
                  <pubid idtype="pmpid" link="fulltext">9531631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Isolation of the braZ gene encoding the carrier for a novel branched-chain amino acid transport system in Pseudomonas aeruginosa PAO</p>
            </title>
            <aug>
               <au>
                  <snm>Hoshino</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kose-Terai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Uratani</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1991</pubdate>
            <volume>173</volume>
            <issue>6</issue>
            <fpage>1855</fpage>
            <lpage>1861</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">207713</pubid>
                  <pubid idtype="pmpid" link="fulltext">1900503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Characterization of a functional bacterial homologue of sodium-dependent neurotransmitter transporters</p>
            </title>
            <aug>
               <au>
                  <snm>Androutsellis-Theotokis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Ueda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Beppu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Beckman</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Das</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Javitch</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Rudnick</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>15</issue>
            <fpage>12703</fpage>
            <lpage>12709</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M206563200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12569103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Swine and poultry pathogens: the complete genome sequences of two strains of <it>Mycoplasma hyopneumoniae</it> and a strain of <it>Mycoplasma synoviae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Vasconcelos</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Ferreira</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Bizarro</snm>
                  <fnm>CV</fnm>
               </au>
               <au>
                  <snm>Bonatto</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alves-Filho</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Assuncao</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Azevedo</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Bogo</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Brigido</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Brocchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burity</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Carepo</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Carraro</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>de Mattos Cascardo</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Castro</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Cavalcanti</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chemale</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Collevatti</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Cunha</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Dallagiovanna</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dambros</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Dellagostin</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Falcao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fantinatti-Garboggini</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Felipe</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Fiorentin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Franco</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Freitas</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Frias</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grangeiro</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Grisard</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Guimaraes</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Hungria</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jardim</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Krieger</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Laurino</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Lima</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Lopes</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Loreto</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Madeira</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Manfio</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Maranhao</snm>
                  <fnm>AQ</fnm>
               </au>
               <au>
                  <snm>Martinkovics</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Medeiros</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Neiva</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ramalho-Neto</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Nicolas</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Oliveira</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Paixao</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Pedrosa</snm>
                  <fnm>FO</fnm>
               </au>
               <au>
                  <snm>Pena</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Pereira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pereira-Ferrari</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Piffer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Potrich</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Salim</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Santos</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Schrank</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schrank</snm>
                  <fnm>IS</fnm>
               </au>
               <au>
                  <snm>Schuck</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Seuanez</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Staats</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Steffens</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Teixeira</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Urmenyi</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Vainstein</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Zuccherato</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Zaha</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <issue>16</issue>
            <fpage>5568</fpage>
            <lpage>5577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1196056</pubid>
                  <pubid idtype="pmpid" link="fulltext">16077101</pubid>
                  <pubid idtype="doi">10.1128/JB.187.16.5568-5577.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Comparative analysis of RNA regulatory elements of amino acid metabolism genes in Actinobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Seliverstov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Putzer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lyubetsky</snm>
                  <fnm>VA</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>54</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1262725</pubid>
                  <pubid idtype="pmpid" link="fulltext">16202131</pubid>
                  <pubid idtype="doi">10.1186/1471-2180-5-54</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Genome sequence of <it>Symbiobacterium thermophilum</it>, an uncultivable bacterium that depends on microbial commensalism</p>
            </title>
            <aug>
               <au>
                  <snm>Ueda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Watsuji</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Morimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ikeda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beppu</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>16</issue>
            <fpage>4937</fpage>
            <lpage>4944</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">519118</pubid>
                  <pubid idtype="pmpid" link="fulltext">15383646</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh830</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Complete sequence and comparative genome analysis of the dairy bacterium <it>Streptococcus thermophilus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Bolotin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Quinquis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Renault</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ehrlich</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Kulakauskas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lapidus</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goltsman</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mazur</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pusch</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Fonstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kyprides</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Purnelle</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Prozzi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ngui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Masuy</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hancy</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Burteau</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Boutry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Delcour</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hols</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <issue>12</issue>
            <fpage>1554</fpage>
            <lpage>1558</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1034</pubid>
                  <pubid idtype="pmpid" link="fulltext">15543133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Control of transcription termination in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Henkin</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1996</pubdate>
            <volume>30</volume>
            <fpage>35</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.30.1.35</pubid>
                  <pubid idtype="pmpid" link="fulltext">8982448</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>NCBI repository</p>
            </title>
            <url>ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/</url>
         </bibl>
         <bibl id="B57">
            <title>
               <p>The ERGO genome analysis and discovery system</p>
            </title>
            <aug>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Walunas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>D'Souza</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pusch</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Selkov</snm>
                  <fnm>E</fnm>
                  <suf>Jr.</suf>
               </au>
               <au>
                  <snm>Liolios</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Joukov</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kaznadzey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bhattacharyya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Burd</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hanke</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kapatral</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mikhailova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Vasieva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Osterman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vonstein</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fonstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ivanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>164</fpage>
            <lpage>171</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165577</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519973</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>5</issue>
            <fpage>1792</fpage>
            <lpage>1797</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390337</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034147</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>24</issue>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
                  <pubid idtype="doi">10.1093/nar/25.24.4876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids</p>
            </title>
            <aug>
               <au>
                  <snm>Durbin R</snm>
                  <fnm>ES</fnm>
                  <suf>Krogh A, Mitchison G</suf>
               </au>
            </aug>
            <publisher>Cambridge , Cambridge University Press</publisher>
            <pubdate>1998</pubdate>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Fitting a mixture model by expectation maximization to discover motifs in biopolymers</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Elkan</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>2</volume>
            <fpage>28</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7584402</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>WebLogo: a sequence logo generator</p>
            </title>
            <aug>
               <au>
                  <snm>Crooks</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Hon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chandonia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>1188</fpage>
            <lpage>1190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419797</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173120</pubid>
                  <pubid idtype="doi">10.1101/gr.849004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Combining evidence using p-values: application to sequence homology searches</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Gribskov</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>1</issue>
            <fpage>48</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.1.48</pubid>
                  <pubid idtype="pmpid" link="fulltext">9520501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Sequence logos: a new way to display consensus sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <issue>20</issue>
            <fpage>6097</fpage>
            <lpage>6100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">332411</pubid>
                  <pubid idtype="pmpid" link="fulltext">2172928</pubid>
                  <pubid idtype="doi">10.1093/nar/18.20.6097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Analysis of growth of <it>Lactobacillus plantarum</it> WCFS1 on a complex medium using a genome-scale metabolic model</p>
            </title>
            <aug>
               <au>
                  <snm>Teusink</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wiersma</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Molenaar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Francke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>de Vos</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Siezen</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Smid</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2006</pubdate>
            <volume>281</volume>
            <issue>52</issue>
            <fpage>40041</fpage>
            <lpage>40048</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M606263200</pubid>
                  <pubid idtype="pmpid" link="fulltext">17062565</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Characterization of the glutamyl-tRNA(Gln)-to-glutaminyl-tRNA(Gln) amidotransferase reaction of <it>Bacillus subtilis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Strauch</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Zalkin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Aronson</snm>
                  <fnm>AI</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1988</pubdate>
            <volume>170</volume>
            <issue>2</issue>
            <fpage>916</fpage>
            <lpage>920</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">210742</pubid>
                  <pubid idtype="pmpid" link="fulltext">2892827</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>tRNA-dependent asparagine formation</p>
            </title>
            <aug>
               <au>
                  <snm>Curnow</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Ibba</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1996</pubdate>
            <volume>382</volume>
            <issue>6592</issue>
            <fpage>589</fpage>
            <lpage>590</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/382589b0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8757127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>NCBI taxonomy webpage</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/Taxonomy/</url>
         </bibl>
      </refgrp>
   </bm>
</art>
