<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1755-8794-1-18</ui>
   <ji>1755-8794</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Whole genome identification of <it>Mycobacterium tuberculosis </it>vaccine candidates by comprehensive data mining and bioinformatic analyses</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Zvi</snm>
               <fnm>Anat</fnm>
               <insr iid="I1"/>
               <email>anatz@iibr.gov.il</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Ariel</snm>
               <fnm>Naomi</fnm>
               <insr iid="I1"/>
               <email>naomia@iibr.gov.il</email>
            </au>
            <au id="A3">
               <snm>Fulkerson</snm>
               <fnm>John</fnm>
               <insr iid="I2"/>
               <email>jfulkerson@aeras.org</email>
            </au>
            <au id="A4">
               <snm>Sadoff</snm>
               <mi>C</mi>
               <fnm>Jerald</fnm>
               <insr iid="I2"/>
               <email>jsadoff@aeras.org</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Shafferman</snm>
               <fnm>Avigdor</fnm>
               <insr iid="I1"/>
               <email>avigdors@iibr.gov.il</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Israel Institute for Biological Research, Ness Ziona 74100, Israel</p>
            </ins>
            <ins id="I2">
               <p>Aeras Global TB Vaccine Foundation, Rockville, MD, USA</p>
            </ins>
         </insg>
         <source>BMC Medical Genomics</source>
         <issn>1755-8794</issn>
         <pubdate>2008</pubdate>
         <volume>1</volume>
         <issue>1</issue>
         <fpage>18</fpage>
         <url>http://www.biomedcentral.com/1755-8794/1/18</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18505592</pubid>
               <pubid idtype="doi">10.1186/1755-8794-1-18</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>10</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Zvi et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p><it>Mycobacterium tuberculosis</it>, the causative agent of tuberculosis (TB), infects ~8 million annually culminating in ~2 million deaths. Moreover, about one third of the population is latently infected, 10% of which develop disease during lifetime. Current approved prophylactic TB vaccines (BCG and derivatives thereof) are of variable efficiency in adult protection against pulmonary TB (0%&#8211;80%), and directed essentially against early phase infection.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>A genome-scale dataset was constructed by analyzing published data of: (1) global gene expression studies under conditions which simulate intra-macrophage stress, dormancy, persistence and/or reactivation; (2) cellular and humoral immunity, and vaccine potential. This information was compiled along with revised annotation/bioinformatic characterization of selected gene products and <it>in silico </it>mapping of T-cell epitopes. Protocols for scoring, ranking and prioritization of the antigens were developed and applied.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Cross-matching of literature and <it>in silico</it>-derived data, in conjunction with the prioritization scheme and biological rationale, allowed for selection of 189 putative vaccine candidates from the entire genome. Within the 189 set, the relative distribution of antigens in 3 functional categories differs significantly from their distribution in the whole genome, with reduction in the Conserved hypothetical category (due to improved annotation) and enrichment in Lipid and in Virulence categories. Other prominent representatives in the 189 set are the PE/PPE proteins; iron sequestration, nitroreductases and proteases, all within the Intermediary metabolism and respiration category; ESX secretion systems, resuscitation promoting factors and lipoproteins, all within the Cell wall category. Application of a ranking scheme based on qualitative and quantitative scores, resulted in a list of 45 best-scoring antigens, of which: 74% belong to the dormancy/reactivation/resuscitation classes; 30% belong to the Cell wall category; 13% are classical vaccine candidates; 9% are categorized Conserved hypotheticals, all potentially very potent T-cell antigens.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The comprehensive literature and <it>in silico</it>-based analyses allowed for the selection of a repertoire of 189 vaccine candidates, out of the whole-genome 3989 ORF products. This repertoire, which was ranked to generate a list of 45 top-hits antigens, is a platform for selection of genes covering all stages of <it>M. tuberculosis </it>infection, to be incorporated in rBCG or subunit-based vaccines.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Mycobacterium tuberculosis (Mtb), the causative agent of tuberculosis (TB), remains a major health threat. Each year, 8 million new TB cases occur and 2 million individuals die of TB <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Moreover, it is estimated that one third of the population is latently infected with Mtb, of which ~10% will develop active disease during lifetime. The development of active TB occurs when the balance between natural immunity and the pathogen changes (e.g. upon waning of protective immune response during adolescence and in HIV patients, <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>). In addition, at present ~50 million individuals are probably infected with multi drug-resistance (MDR) strains of Mtb (WHO, 2006), rendering antibiotic treatment difficult.</p>
         <p>The current vaccine, introduced over 80 years ago, is the live attenuated bacterium <it>Mycobacterium bovis </it>Bacillus Calmette-Geurin (BCG), designed as a prophylactic vaccine for pre-infection administration. BCG is known to protect young children against severe forms of TB however it does not efficiently and consistently protect adults against the most prevalent form of the disease, namely, pulmonary TB (variable protective efficacy ranges from 0% to 80%), nor does BCG offer protection from establishment of latent TB and subsequent reactivation <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>In principle, current putative vaccination strategies against TB can be divided into two groups: (a) prophylactic vaccines (aimed at disease prevention), based on BCG with or without antigens secreted by replicating bacteria recognized during the early stage of infection (e.g. ESAT-6 or Ag85A/B), or protective mutants (live attenuated BCG substitutes) and (b) as yet undefined post-exposure vaccines (boosting BCG) aimed at elimination/containment of latent TB and prevention of reactivation. Ideally, a prime-boost approach comprising of a prophylactic vaccine with subunit, viral vectored or DNA-based vaccines, based on late-stage antigens induced in the dormant stage (transition from replicating to non-replicating stage, latency antigens) or resuscitation/reactivation stage, should have maximum impact on all stages of Mtb infection <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B6">6</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>In the past few years there have been important breakthroughs in the development of improved prophylactic TB vaccines. Novel vaccine candidates, mostly selected as single gene products, include: rBCG vaccines (e.g. rBCG30 &#8211; expressing Ag85B or &#916;ureCHly+rBCG &#8211; an urease deficient strain expressing listeriolysin), virus-based recombinant vaccines (e.g. MVA85A &#8211; Rv3804c expressed in replication-deficient vaccinia virus), live attenuated Mtb strains and subunit vaccines comprising of dominant secreted antigens which could boost the immune response after priming with BCG (e.g. Mtb72F, Ag85B-ESAT-6 fusion), (reviewed by <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B10">10</abbr><abbr bid="B16">16</abbr></abbrgrp>).</p>
         <p>Post-exposure vaccine development requires identification of gene products participating in adaptation of Mtb to the intracellular habitat as Mtb changes from replication to dormancy or from dormancy to resuscitation. Moreover, in the absence of effective prediction models and animal models assessing protective immunity, evaluation of a large number of individual antigens remains laborious <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. <it>In vitro </it>gene expression studies under conditions which mimic dormancy and/or reactivation constitute the major source of information. Availability of whole genome sequences of diverse mycobacterial strains in conjunction with data from global analyses such as whole-genome DNA microarray and proteomic technologies have been the subject of intensive research in the recent years, both at site of pulmonary TB and in <it>ex vivo </it>macrophages <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B17">17</abbr></abbrgrp>. To identify genes expressed specifically during latency, different <it>in vitro </it>conditions have been suggested to model the harsh environment within macrophages or granulomas. These include oxygen deprivation, nutrient starvation and iron limitation. Together with expression studies in lungs/granulomas, in phagocytized bacteria inside activated macrophages or in murine/guinea pig infection models, these datasets provide better understanding of the transition of bacteria through stages of active multiplication, dormancy and resuscitation. Based on bioinformatic studies and analysis of multiple dormancy-related datasets, novel drug targets against the dormant phase of Mtb infection have been recently identified <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>Several attempts have been made to modify the immunogenicity or antigenicity of BCG by generating recombinant strains expressing cytokines, pore-forming listeriolysin/perfringolysin, immunodominant antigens or additional antigens missing from the genome of the avirulent <it>M. bovis </it>BCG genome. Over 200 genes located in 14 gene segments assigned to defined regions of difference (RDs) are missing in the vaccine strain. In principle, potency of BCG vaccines could be improved by supplementing with missing immunodominant RD genes; however critical evaluation is required since RD regions are associated with virulence. Indeed, when BCG was supplemented by the classical RD-1 antigen ESAT-6 (with or without Ag85B <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>), enhanced protection was observed in mice however the recombinant strain was more virulent as compared to the wild-type. In a more recent study, Kalra &amp; Grover <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> demonstrated the enhanced prophylactic potential of selected combinations of RD antigens supplementing BCG, improving protection in aerosol exposed mice.</p>
         <p>Mtb is an intracellular pathogen and as such cell mediated immunity rather than antibody-mediated immunity is essential for the control of bacterial replication and subsequent protection against TB <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. It is thought that a coordinated response of the cellular immune response is fundamental to the protective immunity <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, including both CD4 and CD8 T-cells and several cytokines such as IFN&#947; and TNF&#945;. It has been suggested that CD4 T-cells are mainly crucial during the early phases of infection, while the CD8 T-cells play an important role in the chronic phase of the disease <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, however little is known about antigen-specific human T-cell responses against persisting mycobacteria, particularly in the context of latent infection and protection against disease, with the exception of some products of the DosR regulon <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Therefore, rational selection of sequences that may function as T-cell epitopes in vaccine formulations is crucial. Relatively few human CD8 T-cell epitopes have been found by conventional methods. The availability of the genomic sequence <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> provides new horizons in analyzing the potential immunome of the bacilli, using <it>in silico </it>identification of CTL binders. Several prediction screens of Mtb T-cell epitopes were reported to date <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, nevertheless these analyses were mostly restricted either to few MHC alleles or to a limited number of preselected subset of genes.</p>
         <p>In this study, we have combined multiple published datasets from: (a) gene expression and DNA microarray experiments mimicking conditions leading to dormancy or at the dormant state; (b) genome-wide insertional mutagenesis examining gene essentiality under different conditions; (c) genes expressed in macrophages under conditions mimicking persistence; (d) genes expressed in lungs/granulomas; (e) expression under iron limiting conditions; (f) identification of genes products reported to elicit humoral and cellular response or potential vaccine candidates; (g) <it>in silico </it>identification of T-cell epitopes by dedicated epitope-mapping algorithms and databases compiling relevant experimental data. We designed a whole-genome scoring, ranking and prioritization algorithm and used it to analyze the combined dataset. Accordingly, we selected a set of 189 putative vaccine candidate proteins covering all disease phases, from which we further derived a shorter list of 45 top-ranking antigens for vaccine studies.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>Genome sequences and annotations of <it>Mycobacterium tuberculosis </it>H37Rv [GenBank: <ext-link ext-link-type="gen" ext-link-id="AL123456">AL123456</ext-link>] were downloaded from the NCBI <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Sequence similarity searches were conducted using the Blast algorithm, vs. the non-redundant (nr, NCBI) database. Comparisons against secondary databases of domains and motifs were performed as follows: CDD <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, SMART <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, Pfam <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, Interpro <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, Prosite <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Secretion signals were analyzed by SignalP <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> for the secretion via the SecA pathway, and TATfind for searching of the Twin-Arginine Translocation (Tat) motif of the TAT pathway <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Transmembrane helical segments were predicted by Tmpred <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The database of bacterial lipoproteins, Dolop <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> was searched for putative lipoproteins. Predictions of CTL epitopes were carried out using the NetCTL integrative approach <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, which combines predictions of MHC class I binding, TAP transport efficiency, and proteasomal cleavage. The analysis was conducted for each of 12 HLA supertypes. A threshold of 1.25 was set on the combined prediction score. The results were parsed by in-house perl scripts. Immune epitope information was complemented by querying the IEDB database <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> for documented B- and T-cell epitopes. The following Mtb-related servers and databases were screened for additive relevant data: TB-sgc &#8211; The TB Structural Genomics Consortium <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, Tuberculist-the database on <it>Mycobacterium tuberculosis </it>genetics <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, TBDB &#8211; an integrated platform for TB drug discovery <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, MTBreg &#8211; The Database of Conditionally Regulated Proteins in <it>Mycobacterium tuberculosis </it><abbrgrp><abbr bid="B49">49</abbr></abbrgrp> and BioHealthBase &#8211; the Biodefense and Public Health Database <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Functional categories provided for the selected ORF products were as implemented in <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B51">51</abbr></abbrgrp>, and their assignment to each of the antigen was derived from the Tuberculist database (see above). Classes/phases of Mtb infection were assigned as well to the selected antigens: the well established classical antigens, DosR-regulated antigens (as reported in <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>), reactivation antigens (antigens listed by Talaat and his coworkers <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, as well as documented putative implications of individual antigens), and the 5 known resuscitation antigens <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>; antigens not belonging to any of the above classes, were denoted as "Others". p-values were calculated using the binomical distribution of frequencies.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>The strategy for whole genome-based selection of candidate genes to be included in a vaccine platform, consisted of: (1) compilation of documented data originating from global analyses pertaining to criteria relevant to vaccine and to <it>M. tuberculosis </it>(Mtb) pathogenesis (2) selection of a subset of genes for further evaluation, based on the accumulated data, by cross matching the data in the different criteria; (3) bioinformatic analyses aimed at both further characterization of the candidate genes, in terms of annotation, gene context and cellular localization and; immunoinformatic analysis conducted for the prediction of T-cell epitopes; (4) development and application of a ranking scheme, based on qualitative and quantitative measures, as a tool for prioritizing the selected candidates. A schematic presentation of the various steps, as well as the reductive scheme generating a list of best-hit antigens, is given in Figure <figr fid="F1">1</figr>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Flowchart of the selection procedure &#8211; data mining and bioinformatic analyses</p>
            </caption>
            <text>
               <p><b>Flowchart of the selection procedure &#8211; data mining and bioinformatic analyses</b>. A schematic presentation of the reductive approach applied for the whole-genome selection of Mtb vaccine candidates.</p>
            </text>
            <graphic file="1755-8794-1-18-1"/>
         </fig>
         <sec>
            <st>
               <p>Generation of a comprehensive, literature-based dataset</p>
            </st>
            <p>The first step towards the selection of vaccine candidates consisted of accumulating published experimental data, by an extensive literature scan of documented analyses, with emphasize on global analyses. Target genes to be involved in a future vaccine are either ORF products which have the potential to elicit an immune response (humoral and/or cellular), and/or involved in various manifestations and phases of the infection and/or virulence (as a basis for an attenuated strain). Accordingly, we assembled all evidences into the following general 11 categories (Table <tblr tid="T1">1</tblr>):</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Information sources for the knowledge dataset used in this study.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c cspan="4" ca="left">
                        <p>
                           <ul>(I) Literature-derived evidences:</ul>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Category</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Reference</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Experimental model/method</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Macrophages</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Growth/Survival</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Sassetti <it>et al</it>., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Genes essential for growth (in strains H37Rv &amp; BCG) (TraSH, microarray)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Rengarajan <it>et al</it>., 2005</p>
                     </c>
                     <c ca="left">
                        <p>Genes necessary for survival in macrophages (TraSH, microarray)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Stewart <it>et al.</it>, 2005</p>
                     </c>
                     <c ca="left">
                        <p>Screening of mutants unable to inhibit phagosome acidification (STM, microarray analysis)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Rosas-Magallanes <it>et al</it>., 2007</p>
                     </c>
                     <c ca="left">
                        <p>Screening of mutants attenuated in human macrophages (STM)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Expression profile</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Monahan <it>et al</it>., 2001</p>
                     </c>
                     <c ca="left">
                        <p><it>M. bovis </it>BCG protein expression in macrophages (human cell line), as compared to growth in culture media or conditions of heat shock (1,2-DE, proteomic analysis)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Fisher <it>et al</it>., 2002</p>
                     </c>
                     <c ca="left">
                        <p>Genes induced following <it>in vitro </it>acid shock (microarray, RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Schnappinger <it>et al</it>., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Differential transcriptome of genes in the phagosome, in comparison to their expression in culture (microarray, RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Talaat <it>et al.</it>, 2004</p>
                     </c>
                     <c ca="left">
                        <p>Comparison of H37Rv expression profile between growth in lungs of BALB/c vs. macrophages (microarray, RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cappelli <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>Comparison of H37Rv gene expression in human macrophages vs. synthetic medium (microarray, RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>MprAB</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Regulation</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>He <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>Genes upregulated by MprA (in SDS treated culture); (microarray RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Hypoxia</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Expression profile</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Voskuil <it>et al</it>., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Expression profile at low O2 and low concentrations of NO (inhibitor of aerobic respiration); dormancy regulated gene set; mostly overlaps with Sherman; microarray</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Sherman <it>et al</it>., 2001</p>
                     </c>
                     <c ca="left">
                        <p>H37Rv gene expression under hypoxia (from ambient to 0.2% O<sub>2 </sub>for 2 h)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Schnappinger et al., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Analysis of mutants deficient in NO synthase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Reactivation</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Expression profile</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Talaat <it>et al</it>., 2007</p>
                     </c>
                     <c ca="left">
                        <p>Difference in expression in BALB/c vs. broth after incubation with dexamethasone (microarray)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Tufariello <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>Effect of rpf deletions on persistence and reactivation in mouse</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Dormancy</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Expression profile</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Voskuil <it>et al. </it>2003</p>
                     </c>
                     <c ca="left">
                        <p>Expression profile at low O2 and low concentrations of NO; dormancy regulated gene set; mostly overlaps with Sherman; microarray</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Voskuil <it>et al</it>., 2004</p>
                     </c>
                     <c ca="left">
                        <p>Transcription profile in non-proliferating conditions; genes induced under oxygen-depleted conditions (nrp-non-replicating persistence)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Starck <it>et al</it>., 2004</p>
                     </c>
                     <c ca="left">
                        <p>Proteome comparison of aerobic and anerobic conditions (MTB Harlingen strain)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Lungs</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Expression profile</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Fenhalls <it>et al</it>., 2002</p>
                     </c>
                     <c ca="left">
                        <p>Expression of genes in human tuberculous granulomas (in situ hybridization)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Shi <it>et al</it>., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Transcription pattern of 6 H37Rv genes in mouse lungs (RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Shi <it>et al</it>., 2004</p>
                     </c>
                     <c ca="left">
                        <p>Transcription pattern of H37Rv major secreted antigens in mouse lungs (RT-PCR)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Dubnau <it>et al</it>., 2005</p>
                     </c>
                     <c ca="left">
                        <p>Expression of genes during infection in mouse lungs vs. medium (promoter trap)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Rachman <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>Identification of genes expressed during pulmonary TB; transcription profile from clinical lung samples (granuloma vs. <it>in vitro</it>) (microarray)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Tufariello <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>rpf gene expression in the lungs of infected mice</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Growth/Survival</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Lamichhane <it>et al</it>., 2005</p>
                     </c>
                     <c ca="left">
                        <p>Genes required for in vivo survival im mouse lungs (microarray screening of transposon mutants)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Jain <it>et al</it>., 2007</p>
                     </c>
                     <c ca="left">
                        <p>Mutants tested for lung implantation in survival in guinea pigs and mouse aerosol models</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Acr</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Co-regulation</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Florczyk <it>et al</it>. 2003</p>
                     </c>
                     <c ca="left">
                        <p>Identification of a 18-bp palindromic sequence motif</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Secreted</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Gomez <it>et al</it>., 2000</p>
                     </c>
                     <c ca="left">
                        <p><it>in silico </it>identification of proteins harboring signal peptides but lacking membrane anchoring moieties.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Immunogenicity</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>B-cell response</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Brusasca <it>et al</it>., 2001</p>
                     </c>
                     <c ca="left">
                        <p>Antibody response to 6 H37Rv RD1 proteins in guinea pigs and sera from pulmonary TB patients</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Yeremeev <it>et al</it>., 2003</p>
                     </c>
                     <c ca="left">
                        <p>Elicitation of B-cell response in mice immunized with rpf proteins (H37Rv)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Weldingh <it>et al</it>., 2005</p>
                     </c>
                     <c ca="left">
                        <p>Seropotential of 35 proteins, tested by response with sera of TB patients</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Amor <it>et al</it>., 2005</p>
                     </c>
                     <c ca="left">
                        <p>Seroreactivity of MTB specific proteins previously predicted as secreted</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>T-cell response</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cockle et al., 2002</p>
                     </c>
                     <c ca="left">
                        <p>Immune response in cattle against 13 ORFs (RD1, RD2 and RD14 antigens).</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Vekemans <it>et al.</it>, 2004</p>
                     </c>
                     <c ca="left">
                        <p>Profile of immune response in healthy and TB patients against a series of mycobacterial antigens</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mustafa <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>Characterization of Th1 cell reactivity with RD1 antigens and peptides</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Iron-regulated</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Regulation</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Rodriguez <it>et al</it>., 2002</p>
                     </c>
                     <c ca="left">
                        <p>Identification of genes induced by iron and by the iron-dependent regulator IdeR &#8211; comparison of H37Rv and ideR-mutant strains (microarray)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Vaccine</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Immunization/protection</it>
                           </b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mollenkopf <it>et al</it>., 2004</p>
                     </c>
                     <c ca="left">
                        <p>DNA vaccine candidates preselected by comparative proteomics (present in MTB, absent from BCG) evaluated for their protective potential (aerosol challenge of H37Rv, mice)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Vipond <it>et al</it>., 2006</p>
                     </c>
                     <c ca="left">
                        <p>DNA vaccine candidates chosen by supporting data, such as virulence-associated, level of expression, growth in various conditions etc. (aerosol challenge of H37Rv, guinea pigs)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Roupie <it>et al</it>., 2007</p>
                     </c>
                     <c ca="left">
                        <p>DNA vaccine candidates chosen from the DosR regulon (on the basis of strong T-cell responses in infected humans), evaluated for their immunogenicity potential (mice immunizations).</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4" ca="left">
                        <p>
                           <ul>(II) <it>in silico</it>-based evidences:</ul>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Category</b>
                        </p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>
                           <b>Source of information/analyses</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cell wall</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Membranal and anchored</it>
                           </b>
                        </p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Assignment of ORF products as membrane-attached, by:</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(1) Prediction of membrane-spanning regions by TMpred</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(2) Inference from annotation and/or domain analysis</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Repeats</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Inference from annotation and/or domain analysis</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>T-cell immunogenicity</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>MHC class I and class II binders</it>
                           </b>
                        </p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Compilation of experimental and predicted data from:</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(1) Screening of the public repository database of immune epitope data (IEDB)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(2) Particular experimental evidences from the literature</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(3) Literature-derived predicted T-cell epitopes</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>(4) Prediction of CTL epitopes by an integrative approach (NetCTL)</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <sec>
               <st>
                  <p>Macrophages</p>
               </st>
               <p>Numerous comprehensive studies related to the infection of the macrophages by the bacilli were conducted, aiming at identifying the genes involved in processes as adhesion to the host cell, phagocytosis-mediated entry and survival in the macrophage, adaptation to the environment and resistance to both phagocytosis and lysis, and to free radicals. Different approaches were applied to characterize both the profile of expression at various relevant conditions and the growth/survival of wild-type Mtb and selected mutants. These include subtractive hybridization, transcriptomics, proteomics and mutagenesis. In our screen, we included representative studies covering the abovementioned aspects, as detailed in Table <tblr tid="T1">1</tblr><abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>MprAB</p>
               </st>
               <p>This two component system is one of two major families of transcriptional regulators necessary for stress adaption processes. The system is induced in response to various stress conditions and was shown to be required for growth <it>in vivo </it>during the persistent stage of infection. Comparative DNA microarray analysis of Mtb H37Rv and an <it>mprA </it>mutant strain enabled to profile the gene expression in each of the strains and to identify 221 genes which were presumably regulated by MprA under stress conditions <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Hypoxia</p>
               </st>
               <p>One of the well established models of the tuberculi non-replicating dormant state, is the Wayne model <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, which is based on culturing <it>Mycobacteria </it>in decreasing concentrations of oxygen, mimicking the environment in which the bacteria transform to a non-replicating form, residing in granulomas, which are of hypoxic nature. In this respect, growth under low O<sub>2 </sub>and NO concentration or under depletion of NO synthase are the major conditions implemented to identify the expression profile related to hypoxic conditions. Sherman and his coworkers <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> used a whole genome microarray to identify 101 genes whose expression is significantly altered by hypoxic conditions, of which 47 are induced. A similar approach was followed by Voskuil and coworkers <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, resulting in a set of 48 genes affected by low O<sub>2 </sub>concentration or by the presence of nontoxic concentration of NO (known to inhibit bacterial respiration). Likewise, mutants deficient in NO synthase were used to profile the genes expressed during hypoxia-like conditions <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Reactivation</p>
               </st>
               <p>Very little is known about the genetic basis and the signals which are involved in the reactivation from the dormant state of the mycobacterium. In an attempt to decipher the factors which govern the phase of reactivation, some researchers adopted an immunossupressive model, in which reactivation of latent infection occurs following treatment with the immunossupressive reagent dexamethazone <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp>. Using this model, Talaat and coworkers <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> recently identified a total of 174 genes that were up-regulated during the reactivation phase of tuberculosis. A family of 5 proteins, denoted resuscitation promoting factors (Rpfs), has been shown to be involved in regulating mycobacterial growth <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B70">70</abbr></abbrgrp>. Single deletions of the Rpf members were used to determine the effect of these proteins on the kinetics of reactivation <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Dormancy</p>
               </st>
               <p>The shifting to the non-replicating persistence state ("dormancy") of the bacteria is accompanied by modulation of expression of genes related to the adaptation to the non-replicating conditions. The adaptive processes include: starvation of essential nutrients, cessation of growth in stationary phase, and depletion of oxygen. Of these, the later is the most investigated and constitutes the basis for the <it>in vitro </it>model used in dormancy studies. The set of genes found to be induced by hypoxia, nitric oxide and adaptation to the non-replicating conditions, under the regulation of DosR (Dormancy survival regulator, Rv3133c), were denoted as the dormancy/DosR regulon <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. Transcription profiles were determined by a whole-genome comparative DNA microarray analysis of the exponential growth at time points in the stationary phase, as well as under specific non-replicating persistent conditions of low oxygen and nitric oxide <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B62">62</abbr></abbrgrp>. In a separate analysis, the differential expression of genes under aerobic and anaerobic conditions was deciphered by a proteome (2D-PAGE) analysis, revealing proteins which were unique or more abundant in the anaerobic conditions <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Lungs</p>
               </st>
               <p>As the first point of entry of the <it>M. tuberculosis </it>upon infection, following implantation the bacteria reside and replicate in granulomas in the lungs. Various strategies were employed to depict the pattern of expression both in human granulomas and in guinea pig and mouse lungs, and these revealed an extensive list of genes necessary for growth and survival, as well as identification of genes expressed during infection in the lungs (detailed in Table <tblr tid="T1">1</tblr>, <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B71">71</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp>).</p>
            </sec>
            <sec>
               <st>
                  <p>Acr-coregulated</p>
               </st>
               <p>The 16 kDa &#945;-crystallin (Acr) protein was shown to be induced and required inside macrophages and highly expressed in the presence of NO, low O<sub>2 </sub>concentrations and in the stationary phase (<abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr></abbrgrp> and references therein). The Acg gene product was later identified as a novel macrophage-induced gene, whose expression is co-regulated with that of <it>acr </it><abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. Aiming at identifying other genes which are under the same regulation, an <it>acr-</it>coregulated gene (ACG) family was denoted, following the identification of a conserved sequence motif found in the promoter region of the 15 family members <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Secreted</p>
               </st>
               <p>Proteins which are exported from the cytoplasm and either secreted to the milieu or anchored to the cell wall, are of a major importance as targets of the immune system, in view of their exposure in the host. The secretion pathways in <it>M. tuberculosis </it>are less established than in other bacilli, in particular those implicated in virulence factor secretion. In addition to the classical pathways (the SecA and TAT systems), the pathogen harbors a unique virulence-related secretion system &#8211; ESX system and probably additional systems which remain to be identified. Identification of secreted proteins by a whole genome global analysis was conducted by Gomez and coworkers <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>. A predictive approach of all proteins harboring a secretion signal of the classical SecA pathway, but lacking membrane spanning segments, was carried out, resulting in the identification of 52 proteins, the location of which was further confirmed by fusion to a marker of subcellular localization.</p>
            </sec>
            <sec>
               <st>
                  <p>Immunogenicity</p>
               </st>
               <p>Data on immune response was compiled from global analyses covering both elicitation of humoral and cellular responses following vaccination of model animals, as well as seroreactivity profiles of selected antigens with sera from healthy and TB patients (see Table <tblr tid="T1">1</tblr> and data pertaining to individual ORF products, in the text, <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr><abbr bid="B86">86</abbr><abbr bid="B87">87</abbr><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr></abbrgrp>).</p>
            </sec>
            <sec>
               <st>
                  <p>Iron regulated</p>
               </st>
               <p>As for many other pathogens, <it>Mycobacterium tuberculosis </it>infection is dependent on the availability of iron in the depleted macrophage, and therefore iron sequestration from the host is necessary for virulence <abbrgrp><abbr bid="B90">90</abbr><abbr bid="B91">91</abbr></abbrgrp>. Iron is also known to influence both the innate and adaptive immune responses to Mtb <abbrgrp><abbr bid="B92">92</abbr></abbrgrp>. Failure to assemble the iron acquisition machinery or to repress iron uptake has deleterious effects for Mtb <abbrgrp><abbr bid="B90">90</abbr><abbr bid="B91">91</abbr></abbrgrp>. In addition to expressing iron-uptake systems during iron deficiency, Mtb displays several changes in gene expression in response to iron availability. All changes taking place during the response to iron deficiency are controlled by iron-dependent regulatory networks. The IdeR gene product is a dual functional regulator controlling transcription of genes involved in iron acquisition, iron storage and macrophage survival <abbrgrp><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr></abbrgrp>. <it>ideR </it>is an essential gene in <it>M. tuberculosis </it>and its deletion results in deregulated siderophore biosynthesis and sensitivity to oxidative stress <abbrgrp><abbr bid="B94">94</abbr></abbrgrp>. The transcription profile of genes whose expression is modulated by iron levels was mapped by a DNA microarray analysis, and revealed 155 genes which were iron-regulated; one third of these were regulated by IdeR <abbrgrp><abbr bid="B94">94</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Vaccine</p>
               </st>
               <p>Data pertaining to vaccine potential of the antigens was derived from publications describing global analyses of preselected antigens for their potential to induce an immunoprotective response <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B95">95</abbr><abbr bid="B96">96</abbr><abbr bid="B97">97</abbr></abbrgrp>. These include mostly evaluation of DNA vaccine candidates, as listed in Table <tblr tid="T1">1</tblr>. For this specific category, a large number of evidence was also derived from information pertaining to particular antigens studied both as DNA or protein vaccine candidates <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B20">20</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B71">71</abbr><abbr bid="B84">84</abbr><abbr bid="B98">98</abbr><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr><abbr bid="B101">101</abbr><abbr bid="B102">102</abbr><abbr bid="B103">103</abbr><abbr bid="B104">104</abbr><abbr bid="B105">105</abbr><abbr bid="B106">106</abbr><abbr bid="B107">107</abbr><abbr bid="B108">108</abbr><abbr bid="B109">109</abbr><abbr bid="B110">110</abbr><abbr bid="B111">111</abbr><abbr bid="B112">112</abbr><abbr bid="B113">113</abbr><abbr bid="B114">114</abbr><abbr bid="B115">115</abbr><abbr bid="B116">116</abbr><abbr bid="B117">117</abbr><abbr bid="B118">118</abbr><abbr bid="B119">119</abbr><abbr bid="B120">120</abbr><abbr bid="B121">121</abbr><abbr bid="B122">122</abbr><abbr bid="B123">123</abbr><abbr bid="B124">124</abbr><abbr bid="B125">125</abbr><abbr bid="B126">126</abbr><abbr bid="B127">127</abbr><abbr bid="B128">128</abbr><abbr bid="B129">129</abbr><abbr bid="B130">130</abbr><abbr bid="B131">131</abbr><abbr bid="B132">132</abbr><abbr bid="B133">133</abbr><abbr bid="B134">134</abbr><abbr bid="B135">135</abbr><abbr bid="B136">136</abbr><abbr bid="B137">137</abbr><abbr bid="B138">138</abbr><abbr bid="B139">139</abbr><abbr bid="B140">140</abbr><abbr bid="B141">141</abbr></abbrgrp>, among which are obviously the well established classical antigens.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Selection of a dataset of ~200 vaccine candidates</p>
            </st>
            <p>Of the total 3989 ORF products, 1209 had documented evidence in at least one of the sources listed in Table <tblr tid="T1">1</tblr>. Cross-matching of the compiled data for all 1209 ORF products, allowed for a preliminary selection of 344 antigens, for which either more than two values above the threshold exist, or, alternatively, one value above the threshold together with at least two other values with lower values (below the threshold) were present. The threshold for each literature source was set as the median of the experimental values extracted from the particular literature source.</p>
            <p>The literature sources which were used to generate the dataset (Table <tblr tid="T1">1</tblr>) were grouped within each category (e.g. all 9 studies in the Macrophage category were treated as a single piece of evidence). Following the threshold selection described above (step I in Figure <figr fid="F1">1</figr>), each ORF product was assigned a "+" sign in a given category if experimental data has been reported at least in one study. Summation of the "+" signs across all categories, for each ORF product, produced a total, un-weighted score, with a maximal possible total score of 11 (according to the number of categories). Among the 344 antigens, 143 had a relatively high total score of 3&#8211;8. Of the remaining 201 low scoring antigens, 46 had documented information related to vaccine potential and/or a critical role in the pathogenesis traits of the bacilli. As such, these 46 antigens were added to the 143 high scoring antigens (having a total score &#8805; 3), generating the final list of 189 candidates for further evaluation (Additional file <supplr sid="S1">1</supplr>: List of 189 selected antigens).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>List of 189 selected antigens</b>. (a) In bold and underlined: top-ranking antigens included in the list of 45 candidate genes. (b) The Gene name and annotation are based on the data deposited at the NCBI [GenBank: <ext-link ext-link-type="gen" ext-link-id="AL123456">AL123456</ext-link>]. In square brackets: updated gene name, annotation, and functional category, resulting from the analyses conducted in this study. (c) The functional categories are as provided by Cole and his coworkers <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and the assignment of a functional category to each ORF product is according to the Tuberculist database <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, unless an updated category is assigned (in square brackets), according to the re-annotation performed in this study (see (b)). (d) The classes/phases of Mtb infection assigned to each selected ORF (see "Methods").</p>
               </text>
               <file name="1755-8794-1-18-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Bioinformatics analysis of the 189 vaccine candidates</p>
            </st>
            <p>Inspection of the list of 189 candidates reveals that, according to the annotation deposited at the NCBI, 69 antigens have no assigned function. In an attempt to further characterize the candidate genes, we followed an in-depth bioinformatic analysis which was aimed at revising the existing annotation, as well as determining the cellular localization and the presence of secretion signals, and protein repeats patterns. The revised annotation was thus based on sequence similarity searches against updated databases, and domain/motif assignment following querying secondary databases (see: Methods). Also, Mtb-related servers were screened to extract function-related information (see: Methods). Additive information was derived from the accumulating data in the literature, and specifically on functional characterization of particular genes or their mycobacterial orthologs. These <it>in silico </it>analyses and data mining resulted in the assignment of a putative function to 60% of the 69 candidates with unknown function, and the addition of further functional details to 15 candidates (Additional file <supplr sid="S1">1</supplr>: List of 189 selected antigens).</p>
         </sec>
         <sec>
            <st>
               <p>Immunoinformatic analysis of the 189 vaccine candidates</p>
            </st>
            <p>As mentioned above, the significance of the cellular immune response in protection against intracellular pathogens in general and Mtb in particular, is well documented. We therefore conducted a comprehensive mapping of the T-cell immunity potential of the selected antigens, by compiling both the results of an <it>in-silico </it>search for putative T-cell epitopes and experimental published data.</p>
            <p>Using NetCTL as an integrative approach for prediction of 9-mer CTL epitopes <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, we analyzed the 189 proteins for the presence of MHC binding peptides. The analysis was conducted on 12 HLA supertypes, representing ~120 human MHC alleles and providing a population coverage of 99.8% worldwide. For our large scale analysis, we have chosen a relatively high combined score threshold of 1.25, which gives preferentiality to true binders. At this threshold, the specificity is 0.993 and the sensitivity is 0.54 (as reported by the NetCTL server). Most potent candidates were considered as those having binders for as many supertypes (this argument was further used to rank the antigens at a later stage, see below). Compared to <it>in silico </it>studies of the Mtb antigens immunomic potential <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B125">125</abbr></abbrgrp>, the analysis described herein is inclusive both with respect to its scope and to the number of HLA alleles covered.</p>
            <p>In addition to the theoretical analysis, experimental evidences for T-cell epitopes were retrieved both from the IEDB database <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, by querying each of the antigens as well as data for cellular immunoreactivity of individual gene products from the extensive study of Leyten <it>et al. </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and other literature sources <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr><abbr bid="B86">86</abbr><abbr bid="B88">88</abbr><abbr bid="B99">99</abbr><abbr bid="B102">102</abbr><abbr bid="B106">106</abbr><abbr bid="B115">115</abbr><abbr bid="B119">119</abbr><abbr bid="B124">124</abbr><abbr bid="B127">127</abbr><abbr bid="B142">142</abbr><abbr bid="B143">143</abbr><abbr bid="B144">144</abbr></abbrgrp>. The compiled T-cell immunity data was added as yet an additional criterion for further evaluation and ranking of the 189 antigens, leading to a total of 14 criteria.</p>
         </sec>
         <sec>
            <st>
               <p>Assignment of a qualitative score to the 189 vaccine candidates</p>
            </st>
            <p>The compilation of the experimental data with the bioinformatic and immunoinformatic analyses, together with biological reasoning resulted in a comprehensive knowledge-based dataset of the 189 vaccine candidates (Additional file <supplr sid="S2">2</supplr>: "Raw data of the 189 selected antigens"), according to a total of 14 criteria (Table <tblr tid="T1">1</tblr>). In order to rank the list, and further prioritize the antigens, we developed a scheme which is based on both a qualitative and a quantitative score calculated for each antigen. The scores are calculated according to 14 criteria, encompassing the 11 literature-based categories as well as T-cell immunome data, cell wall and repeats-derived information. The first scoring iteration assigns an equal weight to each criterion. A "+" sign is indicated for each indication in each of the criteria, and an arithmetic score is calculated, by summing the "+" signs (See Additional file <supplr sid="S3">3</supplr>: 'Qualitative scores for 189 selected antigens'). The scores obtained ranged between 9-1, exhibiting a normal distribution of number of antigens per scores, as follows: scores 9&#8211;8: 11.6%; scores 7&#8211;6: 32.8%; scores 5&#8211;4: 45.5% and; scores 3-1: 12%. This rather na&#239;ve scoring is of value in rendering a preliminary ranking, which allows to limit the bias that may arise in favor of antigens with an extensive coverage in the literature. Nevertheless, it suffers from some inherent limitations: (1) as described above, an equal weight was assigned to each criterion, yet their differential relevance to vaccine development is not reflected; (2) any data in a specific criterion contributed equally to the total qualitative score, regardless of the relative potency of the actual result reported in the publication from which the data was extracted, or for data obtained from our analyses. (3) in spite of a broad range of qualitative scores, their distributions reveal large clusters of genes. To note, groups of 17 and 19 genes harbor a qualitative score of 8 and 7, respectively, while a group of 44 genes have a qualitative score of 6. This implies that in order to prioritize the equally-scored antigens within the groups, we had to further refine our scoring system and apply additional measures.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Raw data of the 189 selected antigens</b>. The data (a '+' sign or an experimental value) extracted from global and particular analyses are provided, for each category and references therein (see Table <tblr tid="T1">1</tblr>). In addition, the qualitative and quantitative scores for each antigen in each of the categories are given.</p>
               </text>
               <file name="1755-8794-1-18-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Qualitative scores of the 189 selected antigens</b>. The qualitative score for each of the 189 selected ORF in each of the categories is provided. A "+" sign is indicated if data exists at least in one study in that particular category (see Table <tblr tid="T1">1</tblr> for data resources), and the arithmetic sum of all "+" for a particular ORF is given in the right column as the total qualitative score (Qual_Total). (a) The Gene name and annotation are based on the data deposited at the NCBI, [GenBank: <ext-link ext-link-type="gen" ext-link-id="AL123456">AL123456</ext-link>]. In square brackets: updated gene name and/or annotation, resulting from the analyses conducted in this study. For the complete data values of the 189 selected antigens in each of the categories, see Additional file <supplr sid="S2">2</supplr>: "Raw data of the 189 selected antigens".</p>
               </text>
               <file name="1755-8794-1-18-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Employing a numerical scheme for the assignment of quantitative scores</p>
            </st>
            <p>To address the above-described limitations of the qualitative measure, we introduced weighted internal scores to each of the 14 criteria, as presented in Table <tblr tid="T2">2</tblr>. The actual data for determination of the internal scores was extracted from the relevant experimental studies. The internal score scaling was based on the number of sources and/or the intensity of the results which contributed to the particular criterion. Accordingly, the maximal internal score was 2 for most criteria, except for Macrophage and Vaccine, which were up-weighted due to their significance/relevance to vaccine development, by giving a maximal internal score of 3. These scoring rules were also useful for the final ranking among high scoring genes. The final quantitative score calculated for each ORF product, is the total of the internal scores assigned in each of the 14 criteria for a particular ORF product. As expected, the quantitative scores allowed dissecting the large groups of equally qualitatively scored antigens, and ranking further the candidates. These scores can be used as a tool for further trimming down the number of vaccine candidates, and selecting for the top-hit targets. This process generated a list of 55 antigens with a qualitative score value of 6 and above. Of this list, 10 antigens were predicted to harbor more than 5 membrane-spanning domains (Rv0286, Rv0290, Rv0450c, Rv0754, Rv1196, Rv1348, Rv1736c, Rv1737c, Rv1997, Rv2123). Although potentially valuable vaccine targets, these antigens were removed from the list due to the potential technical problems which may occur during cloning and production of such recombinant proteins. Consequently, a list of 45 top ranking antigens is provided (Table <tblr tid="T3">3</tblr>). In this list, the antigens are sorted by their qualitative score and consecutively &#8211; by their quantitative score. According to this type of ranking, the antigens are clustered into groups, by limiting both the qualitative and the quantitative lower value scores in a certain group. For instance, Group I of antigens includes candidates having a qualitative score of 8 and above, provided that the quantitative score is not lower than 12. According to such an approach for ranking, the antigens were delineated into three groups: the first includes the 12 top best-hit antigens both in terms of qualitative and quantitative scores and the following second and third groups comprise of 20 and 13 antigens, respectively (Table <tblr tid="T3">3</tblr>). There are other modes of sorting which could be considered as well, giving more weight to the quantitative score vs. the qualitative score and vice versa. It should be mentioned though, that using these different ranking approaches had marginal consequences on the grouping, affecting mainly those antigens at the edges of each of the groups.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Numerical internal scores.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Criterion</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Internal score</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Maximal score</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Macrophage</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) significant evidence from one source</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) significant evidence from two sources</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(3) significant evidence from > two sources + high value</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MprAB</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) &lt;3.0 fold expression</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) >3.0 fold expression</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hypoxia</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) 1 evidence</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) >1 evidence, high values</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Reactivation</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) evidence from one source</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) evidence from two sources</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dormancy</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) low values</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) high values</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lung</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) evidence from one source</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) multiple evidences (or Rachman/Jain source)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>acr</it>-coregulated</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) up regulated</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Secreted</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) secreted</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) secreted+virulence-related function</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B-cell immunogen</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) evidence from one source</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) multiple evidences</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Iron regulated</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) low values</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) high values</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell wall</p>
                     </c>
                     <c ca="left">
                        <p>(0) not related</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) general association (without tm)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) virulence-related function</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Vaccine</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) DNA/protein immunization, immune respone but no protection</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) part of a multivalent construct, protection</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(3) DNA/protein vaccine protection</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Repeats</p>
                     </c>
                     <c ca="left">
                        <p>(0) no repeats</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) repeats only</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) repeats + virulence-related function</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T-cell</p>
                     </c>
                     <c cspan="2">
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Experimental</p>
                     </c>
                     <c ca="left">
                        <p>(0) no evidence</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) evidence from one source</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) multiple different evidences</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Predictions</p>
                     </c>
                     <c ca="left">
                        <p>(0) 0&lt; #supertypes &lt;6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(1) 6 &lt; #supertypes &lt;10</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>(2) 10&lt;#supertypes &lt;12</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Top-ranking 45 antigens (sorted by quantitative and qualitative scores).</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Group</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv #</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Gene <sup>(a)</sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Length (aa)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Annotation <sup>(a)</sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Qual Total</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Quant Total</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>
                              <it>I</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1738</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2450c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>rpfE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>172</p>
                     </c>
                     <c ca="left">
                        <p>probable resuscitation-promoting factor rpfE [transglycosylase]</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2623</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>TB31.7</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>297</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein TB31.7 [universal stress protein]</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1009</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>rpfB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>362</p>
                     </c>
                     <c ca="left">
                        <p>possible resuscitation-promoting factor rpfB [transglycosylase, C5 adhesion domain]</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv0867c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>rpfA</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>407</p>
                     </c>
                     <c ca="left">
                        <p>possible conserved trans-membrane protein [transglycosylase, rpfA]</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2031c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>acr (&#945;-crystallin)</p>
                     </c>
                     <c ca="center">
                        <p>144</p>
                     </c>
                     <c ca="left">
                        <p>heat-shock protein HspX (alpha-crystallin homolog) 14 kDa antigen Hsp16.3</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1886c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>fbpB </it>(Ag85B)</p>
                     </c>
                     <c ca="center">
                        <p>325</p>
                     </c>
                     <c ca="left">
                        <p>secreted antigen 85-B FBPB (85-B) (mycolyl-transferase 85B)</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv0288</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>esxH </it>(TB10.4)</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c ca="left">
                        <p>Low Mw protein antigen 7 esxH (10 kDa antigen) CFP-7, TB10.4)</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2032</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>acg</p>
                     </c>
                     <c ca="center">
                        <p>331</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein Acg [nitroreductase]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2626c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>143</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [CBS pair &#8211; binding/regulation, euk]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv3873</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>PPE68</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>368</p>
                     </c>
                     <c ca="left">
                        <p>PPE family protein [PPE68, RD1 T/B immunogen]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2005c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>295</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [USP-like]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv3127</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>344</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [possible nitroreductase]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>
                              <it>II</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1733c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>210</p>
                     </c>
                     <c ca="left">
                        <p>probable conserved trans-membrane protein</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1996</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>317</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [USP]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2389c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>rpfD</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>154</p>
                     </c>
                     <c ca="left">
                        <p>probable resuscitation-promoting factor rpfD [transglycosylase]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv0685</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Tuf</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>396</p>
                     </c>
                     <c ca="left">
                        <p>elongation factor Tu [iron-regulated]</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2628</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1980c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>mpb64</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>228</p>
                     </c>
                     <c ca="left">
                        <p>immunogenic protein MPT64</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv3804c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>fbpA </it>(Ag85A)</p>
                     </c>
                     <c ca="center">
                        <p>338</p>
                     </c>
                     <c ca="left">
                        <p>secreted antigen 85-A FBPA (85-A) (mycolyl-transferase 85A)</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv0079</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>273</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv3130c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>[<it>tgs1</it>]</p>
                     </c>
                     <c ca="center">
                        <p>463</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [diacylglycerol acyltransferase]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv3131</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>[<it>bfnB</it>]</p>
                     </c>
                     <c ca="center">
                        <p>332</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein [possible nitroreductase NfnB]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv0824c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>desA1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>389</p>
                     </c>
                     <c ca="left">
                        <p>probable acyl [-acyl-carrier-desaturase desA1]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1908c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>katG</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>740</p>
                     </c>
                     <c ca="left">
                        <p>catalase-peroxidase-peroxinitritase-T katG</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1174c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>[<it>sak5</it>]</p>
                     </c>
                     <c ca="center">
                        <p>110</p>
                     </c>
                     <c ca="left">
                        <p>Low Mw T-cell antigen TB8.4 [secretion antigen SA5K]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1349</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>[<it>irtB</it>]</p>
                     </c>
                     <c ca="center">
                        <p>579</p>
                     </c>
                     <c ca="left">
                        <p>probable drugs transport ATP-binding protein ABC transporter [ATM1 ABC siderophore-iron transporter]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1813c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>143</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2006</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>otsB1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1327</p>
                     </c>
                     <c ca="left">
                        <p>probable trehalose-6-phosphate phosphatase OTSB1</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2029c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>pfkB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>339</p>
                     </c>
                     <c ca="left">
                        <p>possible phosphofructokinase (pfkB)</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2627c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>413</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2780</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>ald</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>371</p>
                     </c>
                     <c ca="left">
                        <p>secreted L-alanine dehydrogenase ald (40 kDa antigen, TB43)</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>
                              <it>III</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv1884c</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>rpfC</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>176</p>
                     </c>
                     <c ca="left">
                        <p>probable resuscitation-promoting factor rpfC [transglycosylase]</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2620c</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>141</p>
                     </c>
                     <c ca="left">
                        <p>probable conserved transmembrane protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rv2744c</b>
                    