<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-244</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A survey of orphan enzyme activities</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Pouliot</snm>
               <fnm>Yannick</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>ypouliot@stanford.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Karp</snm>
               <mi>D</mi>
               <fnm>Peter</fnm>
               <insr iid="I1"/>
               <email>pkarp@ai.sri.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Research Group, Artificial Intelligence Center, SRI International, 333 Ravenswood Ave, Menlo Park, California, 94025-3493, USA</p>
            </ins>
            <ins id="I2">
               <p>Lane Medical Library and Knowledge Management Center, Information Resources and Technology, Stanford University Medical Center, 300 Pasteur Drive. Stanford, CA 94305-5123, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>244</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/244</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17623104</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-244</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>10</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>10</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Pouliot and Karp; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Using computational database searches, we have demonstrated previously that no gene sequences could be found for at least 36% of enzyme activities that have been assigned an Enzyme Commission number. Here we present a follow-up literature-based survey involving a statistically significant sample of such "orphan" activities. The survey was intended to determine whether sequences for these enzyme activities are truly unknown, or whether these sequences are absent from the public sequence databases but can be found in the literature.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We demonstrate that for ~80% of sampled orphans, the absence of sequence data is bona fide. Our analyses further substantiate the notion that many of these enzyme activities play biologically important roles.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This survey points toward significant scientific cost of having such a large fraction of characterized enzyme activities disconnected from sequence data. It also suggests that a larger effort, beginning with a comprehensive survey of all putative orphan activities, would resolve nearly 300 artifactual orphans and reconnect a wealth of enzyme research with modern genomics. For these reasons, we propose that a systematic effort to identify the cognate genes of orphan enzymes be undertaken.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>After a decade of comprehensive genomic sequencing, more than 500 genomes have been sequenced to completion, mostly prokaryotes. The prodigious rate of new sequence annotation is highlighted by the fact that there were just over 300 genomes available when this study was carried out in late 2004. However, the fraction of genes for which no function can be predicted remains high (30%&#8211;50%). In response, proposals have been put forth for the bioinformatics analysis of bacterial genomes to identify genes with high likelihood of scoring true in confirmatory laboratory assays of their respective function <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. This would increase the field's pool of experimentally characterized proteins, with concomitant increases in the accuracy and coverage of genome annotation. We believe the return on investment of this approach would be particularly high when addressing the problem of orphan activities, that is, enzymatic activities for which no sequence information is available <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Decades of detailed enzymology have created a wealth of knowledge about enzymes and their activities. However, crucial aspects of these enzymes are absent from bioinformatics databases with surprising frequency. For example, recent computational analyses of sequence databases demonstrate that at least 36% of enzyme activities that have been assigned an Enzyme Commission (EC) number <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> appear to be devoid of a gene or protein sequence <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Since then, similar analyses have been published, with similar results <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The existence of such a large fraction of orphan activities is surprising, given that many of these enzymes have been described decades ago and are often involved in basic cellular functions. Several examples exist of the recent identification of genes involved in important enzymatic functions (reviewed in <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>). Indeed, in our study 44 orphans were found to be present in one or more primary metabolic pathways in a variety of species (described below). Details of many of the orphan enzymes uncovered during this survey point to multiple and significant consequences for the lack of sequence information in areas such as genome annotation, computational pathway prediction, and metabolic engineering. For these reasons, the orphan problem and related issues were highlighted in a recent report of the American Society for Microbiology <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. In view of the biological richness associated with orphan enzymatic activities (Figure <figr fid="F1">1</figr>, Table <tblr tid="T1">1</tblr>), we have taken the first steps in creating the foundations of an Enzyme Genomics Initiative <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Example of a metabolic pathway involving a validated orphan</p>
            </caption>
            <text>
               <p>Example of a metabolic pathway involving a validated orphan.</p>
            </text>
            <graphic file="1471-2105-8-244-1"/>
         </fig>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Biological significance of selected validated orphans. The extent and significance of published research associated with a selection of validated orphans is detailed</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Activity</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Year first published</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>No. PubMed Publications Involving Orphan</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Significance</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.43</p>
                  </c>
                  <c ca="left">
                     <p>Phosphogluconate 2-dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p>1961</p>
                  </c>
                  <c ca="left">
                     <p>2417</p>
                  </c>
                  <c ca="left">
                     <p>Positive reports of evaluation as a drug target against Trypanosome; trypanocidal activity has been reported; involved in 2-dehydro-D-gluconate degradation pathway</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>2.3.1.23</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>1-acylglycerophosphocholine O-acyltransferase</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>1967</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>256</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Activity is present in lower eukaryotes, plants, and multiple mammalian tissues</it>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>5.1.3.17</p>
                  </c>
                  <c ca="left">
                     <p>Heparosan-<it>N-</it>sulfate-glucuronate 5-epimerase</p>
                  </c>
                  <c ca="left">
                     <p>1979</p>
                  </c>
                  <c ca="left">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>Involved in the biosynthesis of heparan sulfate, which binds proteins to modulate signaling events in embryogenesis. Mouse gene   knock-out results in late lethal phenotype.</p>
                     <p>  Correction added in proof: Thanks to a comment by Dr. K. Robison and research by Dr. A. Shearer, we have found that 5.1.3.17 is an   artifactual orphan rather than a validated orphan.  Genes for this enzyme have been identified in cow and mouse (J Biol Chem   272:28158 1997; J Biol Chem 276:20069 2001).</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.3.1.105</p>
                  </c>
                  <c ca="left">
                     <p>Alkylglycerophosphate 2-O-acetyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>1986</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Involved in platelet activating factor biosynthesis; possible involvement in ischemia</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>3.1.3.59</p>
                  </c>
                  <c ca="left">
                     <p>Alkylacetylglycerophosphatase</p>
                  </c>
                  <c ca="left">
                     <p>1986</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Involved in platelet activation factor biosynthesis</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.7.1.106</p>
                  </c>
                  <c ca="left">
                     <p>Glucose-1,6-bisphosphate synthase</p>
                  </c>
                  <c ca="left">
                     <p>1975</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Present in several mammalian tissues. Involved in glucose metabolism</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.23</p>
                  </c>
                  <c ca="left">
                     <p>2-oxoaldehyde dehydrogenase (NAD+)</p>
                  </c>
                  <c ca="left">
                     <p>1967</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Involved in the development of diabetic complications</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.11.6</p>
                  </c>
                  <c ca="left">
                     <p>Thymine dioxygenase</p>
                  </c>
                  <c ca="left">
                     <p>1972</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Present in both lower and higher eukaryotes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.16</p>
                  </c>
                  <c ca="left">
                     <p>Galactitol 2-dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p>1956</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>Insulin dysregulation</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.3.1.14</p>
                  </c>
                  <c ca="left">
                     <p>Glutamine N-phenylacetyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>1957</p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
                  <c ca="left">
                     <p>Investigated as a predictor of carotid endarterectomy in middle-aged individuals</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.25</p>
                  </c>
                  <c ca="left">
                     <p>2-oxoisovalerate dehydrogenase (acylating)</p>
                  </c>
                  <c ca="left">
                     <p>1969</p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
                  <c ca="left">
                     <p>Present in prokaryotes and eukaryotes. In the latter, participates in primary metabolism pathway for valine degradation</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>E.C. 2.3.1.23 is listed in italics because it was cloned and sequenced in 2006, after the completion of this study</p>
            </tblfn>
         </tbl>
         <p>Here we describe a literature-based survey of presumed orphans intended to further validate and characterize these activities (Figure <figr fid="F2">2</figr>). The confidence of the results of this survey was designed to be within a 5% error margin relative to the universe of orphan activities, based on a randomly selected subset of orphan activities from the Nomenclature Committee of the IUBMB (NC-IUBMB). We have also assessed the practicability of identifying the genes associated with these orphans. As a consequence, the survey captures data from the literature that should facilitate the identification of cognate genes for the orphan activities evaluated. Here, we define the cognate gene for an activity as a gene that has been shown to code for an enzyme that carries out that activity.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Literature survey process</p>
            </caption>
            <text>
               <p>Literature survey process.</p>
            </text>
            <graphic file="1471-2105-8-244-2"/>
         </fig>
         <p>The survey confirmed that ~80% of the sampled orphans do not have sequence information associated with them. Consequently, this lack represents a true information deficit. Weaknesses in database integration and a lack of information capture from the literature to databases appear to be largely responsible for most of the artifactual orphans making up the other 20%. Given the importance of these enzymatic activities, we propose that the public sequence databases assign high priority to correcting database entries for artifactual orphans. We further propose that a systematic effort be undertaken to sequence the genes of validated orphans, as this survey demonstrates that primary literature data and database analyses combined with current proteomics and genomic technologies should be adequate to enable the rapid identification of many of these genes.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Most orphan enzymatic activities are <it>bona fide </it>(Table <tblr tid="T2">2</tblr>). Our survey found that more than 80% of orphans are not due to artifacts such as missing database annotations (primarily failure to capture information from the literature), or lack of database cross-referencing, such as the availability of a sequence in one database not being reflected in a second database. Specifically, a total of 187 orphans out of 228 surveyed activities were validated in at least one of 287 species (species are listed in Table <tblr tid="T3">3</tblr> and Table <tblr tid="T4">4</tblr>, the list of validated orphans is in Table <tblr tid="T5">5</tblr>). A majority of orphans (54.36%) occurred in Eukaryotes, followed by Eubacteria (39.37%) (Table <tblr tid="T6">6</tblr>). Within the Eubacteria, genus <it>Pseudomonas </it>was significantly overrepresented (35%) (Table <tblr tid="T7">7</tblr>). While a systematic determination of the species spectrum of orphan activities was not performed here, we did notice several cases of an orphan activity reported in more than one species, as well as one case of an orphan activity occurring in species from different domains.</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Summary of survey results</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Number</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Proportion</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Total number of putative orphans</p>
                  </c>
                  <c ca="left">
                     <p>1,356</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Number required to achieve 95% significance</p>
                  </c>
                  <c ca="left">
                     <p>180</p>
                  </c>
                  <c ca="left">
                     <p>13.3%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Number orphans evaluated</p>
                  </c>
                  <c ca="left">
                     <p>228</p>
                  </c>
                  <c ca="left">
                     <p>16.8%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Out of 228 orphans:</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Number of artifactual orphans</p>
                  </c>
                  <c ca="left">
                     <p>41</p>
                  </c>
                  <c ca="left">
                     <p>18.0%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Number of valid orphans</p>
                  </c>
                  <c ca="left">
                     <p>187</p>
                  </c>
                  <c ca="left">
                     <p>82.0%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Max. number of salvageable orphans (all rankings)</p>
                  </c>
                  <c ca="left">
                     <p>57</p>
                  </c>
                  <c ca="left">
                     <p>25.0%</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Out of 57 salvageable orphans:</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Number</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Proportion</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Excellent</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>15.8%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Good</p>
                  </c>
                  <c ca="left">
                     <p>23</p>
                  </c>
                  <c ca="left">
                     <p>40.4%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Marginal</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>15.8%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Poor</p>
                  </c>
                  <c ca="left">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>28.1%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Bacterial salvageable orphans</p>
                  </c>
                  <c ca="left">
                     <p>26</p>
                  </c>
                  <c ca="left">
                     <p>45.6%</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Eukaryotic salvageable orphans</p>
                  </c>
                  <c ca="left">
                     <p>31</p>
                  </c>
                  <c ca="left">
                     <p>54.4%</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>The survey was designed to achieve a maximum sampling error of 5%, 19 times out of 20. This corresponds to a minimum sample size of ~80 orphans. A total of 228 orphans were in fact surveyed. In a number of cases more than one instance of an orphan activity was evaluated because the activity was reported in more than one species. Consequently, 286 instances were evaluated.</p>
            </tblfn>
         </tbl>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Species distribution of Eubacterial validated orphans</p>
            </caption>
            <tblbdy cols="4">
               <r>
                  <c ca="left">
                     <p>Species</p>
                  </c>
                  <c ca="left">
                     <p>No. of Orphans</p>
                  </c>
                  <c ca="left">
                     <p>Species</p>
                  </c>
                  <c ca="left">
                     <p>No. of Orphans</p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Acinetobacter NCIB 9871</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pasteurella tuberculosis</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Actinoplanes missouriensis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pedobacter heparinus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Aerom onas sp.</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Propionibacterium pentosaceum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Alcaligenes eutrophus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Proteus mirabilis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Alcaligenes faecalis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas (species undefined)</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Arthrobacter GJM -1</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas fluorescens</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Arthrobacter oxydans</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas graveolens</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Arthrobacter sp.</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas MS</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Azotobacter vinelandii</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas MSU-1</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Bacillus subtilis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas P-2</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cellulom onas sp.</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas putida</p>
                  </c>
                  <c ca="left">
                     <p>7</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Clostridium cylindrosporum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas putida P2</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Clostridium kluyveri</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas saccharophilia</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Clostridium pasteurianum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas sp.</p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Clostridium SB4</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas sp. P-501</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Clostridium sporogenes</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas syringae GG</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Corynebacterium cyclohexanicum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pseudomonas testosteroni</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Escherichia coli</p>
                  </c>
                  <c ca="left">
                     <p>8</p>
                  </c>
                  <c ca="left">
                     <p>Rhodococcus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Flavobacterium</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Rhodopseudomonas sphaeroides</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Flavobacterium sp.</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Salmonella typhimurium</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Klebsiella aerogenes</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Streptococcus faecalis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Micrococcus denitrificans</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Streptococcus mutans</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Microorganism</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Streptomyces virginiae</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mycobacterium tuberculosis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Thiobacillus thioparus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Nocardia (species undefined)</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Unknown</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>The total number of orphans is greater than the number of activities because a given activity may be present in more than one species. The exact species of some orphans can be unclear or unstated, in which case these are classified under a generic term ("species undefined", "unknown", etc). The total number of orphans is greater than the number of activities because a given activity may be present in more than one species. The exact species of some orphans can be unclear or unstated, in which case these are classified under a generic term ("species undefined", "unknown", etc).</p>
            </tblfn>
         </tbl>
         <tbl id="T4">
            <title>
               <p>Table 4</p>
            </title>
            <caption>
               <p>Species distribution of Eukaryotic validated orphans</p>
            </caption>
            <tblbdy cols="4">
               <r>
                  <c ca="left">
                     <p>
                        <b>Species</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>No. of Orphans</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Organism</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>No. of Orphans</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Acrocylindrium sp.</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Nectria haem atococca/Fusarium solani f.sp. Phaseoli</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Arachis hypogaea</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Neurospora (subspecies undefined)</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ASparagus officinalis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Neurospora crassa</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Aspergillus niger</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Ochromonas malhamensis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Avena coleoptiles</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Ovis aries</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Bauhenia monandra</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pea sativum var. Alaska</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Bostaurus</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
                  <c ca="left">
                     <p>Penicillium atrovenetum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Capra hircus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Penicillium chrysogenum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Catharanthus roseus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Penicillium patulum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cavia porcellus</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
                  <c ca="left">
                     <p>Phaseolus aureus</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Chlorella</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Phaseolus radiatus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Chrysosplenium americanum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pisum sativum (variety unspecified)</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cichorium endivia</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Pycnoporus coccineus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Citrus (subspecies undefined)</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Raphanus sativus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Corydalis cava</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Rat (subspecies undefined)</p>
                  </c>
                  <c ca="left">
                     <p>18</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cucurbita maxima</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Rat Sprague-Dawley</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Daucus carota</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Rhodotorula glutinis</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Entamoeba histolytica</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Saccharomyces cerevisiae</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Euglena gracilis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Saccharum officinarum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Flaveria spp.</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Secale cereale</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Fundulus heteroclitus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Sesamum indicum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Gallus gallus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Several</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Homo sapiens</p>
                  </c>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>Sorghum bicolor</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Hordeum (species undefined)</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Spinacia</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Hordeum vulgare subsp. Vulgare</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Spinacia oleracea</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Lasallia pustulata</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Sus scrofa</p>
                  </c>
                  <c ca="left">
                     <p>7</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Lilium longiflorum</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Tecoma stans</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Lupinus albus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Thea sinensis</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Lycopersicon esculentum</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Trypanosoma brucei brucei</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Macaca mulatta</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Tulipa cv. Apeldoorn</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mentha piperita</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Unknown</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mesocricetus auratus</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Yeast (species undefined)</p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mold</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Zea mays</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mouse (species undefined)</p>
                  </c>
                  <c ca="left">
                     <p>3</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>The total number of orphans is greater than the number of activities because a given activity may be present in more than one species. The exact species of some orphans can be unclear or unstated, in which case these are classified under a generic term ("mold", "mouse", "unknown", etc). The total number of orphans is greater than the number of activities because a given activity may be present in more than one species. The exact species of some orphans can be unclear or unstated, in which case these are classified under a generic term ("mold", "mouse", "unknown", etc).</p>
            </tblfn>
         </tbl>
         <tbl id="T5">
            <title>
               <p>Table 5</p>
            </title>
            <caption>
               <p>Validated orphan activities</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Ranking</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Ranking</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Ranking</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.13</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>1.14.99.24</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.3.47</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.16</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>1.21.3.2</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.3.59</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.43</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>1.97.1.3</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.3.72</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.54</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.112</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.4.43</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.84</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.137</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.1.6.17</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.92</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.141</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.1.8.2</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.101</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.143</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.56</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.144</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.147</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.77</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.146</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.84</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.100</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.163</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.1.1.99</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.112</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.172</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.1.2.4</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.115</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.196</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.14</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.128</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.208</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.23</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.136</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.226</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.24</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.2.1.137</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.245</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.33</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.2.2.10</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.258</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.49</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.4.11.16</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.265</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.68</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>3.4.13.7</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.2.5</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.96</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.4.17.16</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.3.23</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.98</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>3.4.21.103</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.17.1.1</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.102</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.4.22.44</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.17.99.2</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.103</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.4.22.46</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.18</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.105</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.4.23.28</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.20</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.114</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.4.23.30</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.23</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.133</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>3.4.24.54</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.25</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.140</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.30</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.32</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.3.1.161</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.33</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.33</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.2.3</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.39</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.52</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.3.2.7</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.58</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.54</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.3.3.3</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.62</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.63</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.23</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.67</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.1.64</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.29</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.71</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.3.6</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.41</p>
                  </c>
                  <c ca="left">
                     <p>valid</p>
                  </c>
                  <c ca="left">
                     <p>3.5.1.79</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.3.7</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.43</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.5.2.13</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.2.3.8</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.57</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.5.2.16</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.1.4</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.66</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.5.3.2</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.1.5</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.73</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.5.5.2</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.1.6</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.97</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.6.1.18</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.1.11</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.110</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.6.1.2</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.1.37</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.125</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.6.1.52</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.7.1</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.126</p>
                  </c>
                  <c ca="left">
                     <p>valid</p>
                  </c>
                  <c ca="left">
                     <p>3.6.3.17</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.99.15</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.153</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>3.6.3.24</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.3.99.21</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.167</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.6.3.28</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.4.1.11</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.176</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.6.4.4</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.4.1.17</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.180</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>4.1.1.24</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.4.99.4</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.184</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.1.1.52</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.4.99.5</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.4.1.215</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>4.1.1.56</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.5.1.21</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.4.2.35</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.1.1.75</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.5.99.11</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.5.1.4</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.1.2.23</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.6.5.7</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.5.1.42</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.1.2.28</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.7.3.1</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.6.1.22</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.1.2.35</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.7.3.5</p>
                  </c>
                  <c ca="left">
                     <p>valid</p>
                  </c>
                  <c ca="left">
                     <p>2.6.1.27</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>4.1.3.35</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.8.1.5</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.6.1.32</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.5</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.10.1.1</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.6.1.33</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.43</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.10.3.4</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.6.1.75</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.62</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.12.98.2</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.43</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.77</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.11.14</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.54</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.81</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.11.24</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.64</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.93</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.11.25</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.77</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.97</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.11.35</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.106</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.1.101</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.12.9</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.131</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>4.2.2.14</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.11.10</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.134</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.3.19</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.11.6</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>2.7.1.142</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.2.99.19</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.10</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
                  <c ca="left">
                     <p>2.7.4.20</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.3.1.10</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.23</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>2.7.7.44</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.3.1.20</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.24</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>2.7.7.51</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>4.5.1.4</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.42</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.7.8.10</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.1.1.6</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.51</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.7.8.22</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.1.1.9</p>
                  </c>
                  <c ca="left">
                     <p>marginal</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.58</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>2.8.1.3</p>
                  </c>
                  <c ca="left">
                     <p>excellent</p>
                  </c>
                  <c ca="left">
                     <p>5.1.3.17</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.60</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>2.8.2.28</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.2.1.10</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.72</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.1.36</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.2.1.11</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.13.73</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.1.1.39</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.4.3.5</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.15.2</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>3.1.1.40</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>5.5.1.11</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.16.5</p>
                  </c>
                  <c ca="left">
                     <p>good</p>
                  </c>
                  <c ca="left">
                     <p>3.1.1.78</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>5.5.1.12</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.99.18</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.1.2.11</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>5.5.1.3</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.14.99.22</p>
                  </c>
                  <c ca="left">
                     <p>artifact</p>
                  </c>
                  <c ca="left">
                     <p>3.1.3.14</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
                  <c ca="left">
                     <p>6.3.1.6</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>3.1.3.38</p>
                  </c>
                  <c ca="left">
                     <p>poor</p>
                  </c>
                  <c ca="left">
                     <p>6.3.4.8</p>
                  </c>
                  <c ca="left">
                     <p>difficult</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>All 228 orphans reviewed in this study are listed. The salvageability of an orphan is ranked "difficult" when factors such as unclear species of origin, lack of molecular descriptors, or lack of comprehensive genome sequence hinder cloning of the cognate gene. Note that such rankings do not take into account the availability of molecular descriptors which enable the identification of a candidate gene in one species, and, through orthology, the identification of a candidate gene in a second species for which these descriptors are not available.</p>
            </tblfn>
         </tbl>
         <tbl id="T6">
            <title>
               <p>Table 6</p>
            </title>
            <caption>
               <p>Domain distribution of validated orphans</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>
                        <b>Domain</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>No. Species</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Proportion</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Eukaryota</p>
                  </c>
                  <c ca="left">
                     <p>156</p>
                  </c>
                  <c ca="left">
                     <p>54.36%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Eubacteria</p>
                  </c>
                  <c ca="left">
                     <p>113</p>
                  </c>
                  <c ca="left">
                     <p>39.37%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Unknown</p>
                  </c>
                  <c ca="left">
                     <p>15</p>
                  </c>
                  <c ca="left">
                     <p>5.23%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Viruses</p>
                  </c>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>0.70%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Archaea</p>
                  </c>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>0.35%</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Orphans with "Unknown" listed for their domain tend to be microbes that were insufficiently characterized to place them in either the Eubacteria or Archaea domains.</p>
            </tblfn>
         </tbl>
         <tbl id="T7">
            <title>
               <p>Table 7</p>
            </title>
            <caption>
               <p>Top four most represented Eubacteria</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>
                        <b>Genus</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>No. instances of orphans</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Fraction of all Eubacteria</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Pseudomonas</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>27</p>
                  </c>
                  <c ca="left">
                     <p>35.06%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Escherichia</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>8</p>
                  </c>
                  <c ca="left">
                     <p>10.39%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Clostridium</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>7</p>
                  </c>
                  <c ca="left">
                     <p>9.09%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>Arthrobacter</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>4</p>
                  </c>
                  <c ca="left">
                     <p>5.19%</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Because the eventual isolation of the cognate genes of these activities is greatly facilitated by comprehensive genome sequencing, we determined for what fraction of all validated orphans a full genome sequence is available (Table <tblr tid="T8">8</tblr>). 43% of Eubacterial species in which orphans occurred were found to have such sequences, available either presently or due shortly. This figure rises to 83% when including the genomes of related species, on the assumption that they might be sufficiently closely related to permit the identification of the cognate gene. For example, at the time of this study the completed genome sequence of <it>Pseudomonas fluorescens </it>was not available, but those of three other <it>Pseudomonas </it>species were.</p>
         <tbl id="T8">
            <title>
               <p>Table 8</p>
            </title>
            <caption>
               <p>Availability of completely sequenced genomes for Eubacterial validated orphans</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="left">
                     <p>
                        <b>Complete genome sequence</b>
                     </p>
                  </c>
                  <c cspan="2" ca="left">
                     <p>
                        <b>Ongoing genome sequencing</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Count</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Proportion</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Count</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Proportion</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Same species</p>
                  </c>
                  <c ca="left">
                     <p>23</p>
                  </c>
                  <c ca="left">
                     <p>31.9%</p>
                  </c>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>18.4%</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Same genus, related species</p>
                  </c>
                  <c ca="left">
                     <p>12</p>
                  </c>
                  <c ca="left">
                     <p>16.7%</p>
                  </c>
                  <c ca="left">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>32.6%</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>The number of available comprehensive genome sequences for validated Eubacterial orphans was tallied. Cases where the genome sequence of a species does not exist but where the sequence of a related species from the same genus is available are also listed, as are ongoing comprehensive genomic sequencing projects for genomes not currently available.</p>
            </tblfn>
         </tbl>
         <p>Oxidoreductases (EC1) and transferases (EC2) were the most frequently represented classes of enzymatic activity for validated orphans (Figure <figr fid="F3">3</figr>). On a per capita basis, oxidoreductases and transferases were overrepresented by ~20%, whereas hydrolases and ligases were underrepresented by 35% and 64%, respectively.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Distribution of enzymatic activities in validated orphans</p>
            </caption>
            <text>
               <p><b>Distribution of enzymatic activities in validated orphans</b>. The percentage of validated orphan activities belonging to each EC class is shown.</p>
            </text>
            <graphic file="1471-2105-8-244-3"/>
         </fig>
         <p>The original publication date for all orphans was broadly distributed around a mean of 1977 (Figure <figr fid="F4">4</figr>), compared to a mean of 1975 for validated orphans.</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Publication year of original publications describing orphan activities</p>
            </caption>
            <text>
               <p><b>Publication year of original publications describing orphan activities</b>. The publication date associated with the original source articles of all instances of orphans surveyed here is plotted (286 instances of orphans, corresponding to 228 activities), based upon the IUBMB record. In a number of cases more than one instance of an orphan activity was evaluated because the activity was reported in more than one species.</p>
            </text>
            <graphic file="1471-2105-8-244-4"/>
         </fig>
         <sec>
            <st>
               <p>Causes of artifacts</p>
            </st>
            <p>A comprehensive list of artifactual orphans and the inferred nature of the artifact is available <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Although this study was not designed to determine conclusively the causes of artifactuality, incompleteness in database entries appears to be the predominant cause of the artifacts identified here. For example, the DNA sequence associated with reaction 3.5.1.79 is available in the EMBL database, however, the UniProt entry for this enzyme does not list any protein sequence (Table <tblr tid="T9">9</tblr>). Other representative artifactual orphans are listed in Table <tblr tid="T9">9</tblr>, along with a description of the cause of the artifact. In a small fraction of cases a clear determination of the species in which the activity was characterized could not be made.</p>
            <tbl id="T9">
               <title>
                  <p>Table 9</p>
               </title>
               <caption>
                  <p>Example artifactual orphans</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p><b>EC No</b>.</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Enzyme Name</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Original Species</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Year</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><b>Swiss-Prot/TrEMBL Acc. No</b>.</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Cause of artifact</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Significance of error; importance of orphan activity</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.4.21.103</p>
                     </c>
                     <c ca="left">
                        <p>Physarolisin (a proteinase)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Physarum flavicomum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1982</p>
                     </c>
                     <c ca="center">
                        <p>Q8MZS4</p>
                     </c>
                     <c ca="left">
                        <p>IUBMB entry lists a 2003 paper describing a gene coding for a protein with this activity [28]. Sequence is in Swiss-Prot but ENZYME does not reference this sequence.</p>
                     </c>
                     <c ca="left">
                        <p>Lack of database cross-referencing presumably involving the long interval between the initial characterization of the activity and the cloning of the gene.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.5.2.16</p>
                     </c>
                     <c ca="left">
                        <p>Maleimide hydrolase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Blastobacter sp. A17p-4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1997</p>
                     </c>
                     <c ca="center">
                        <p>Q93T25</p>
                     </c>
                     <c ca="left">
                        <p>ENZYME and IUBMB entries are not referencing a Swiss-Prot entry from a 2002 paper describing the cloning of gene coding for this [29].</p>
                     </c>
                     <c ca="left">
                        <p>Lack of database cross-referencing is not restricted to older orphans.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.5.1.79</p>
                     </c>
                     <c ca="left">
                        <p>Phthalyl amidase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Xanthobacter agilis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1995</p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>The sequence, listed in a patent associated with a 1996 paper by [30] in <it>Journal of Molecular Catalysis B: Enzymatic </it>are available from Entrez, but not from TrEMBL. The paper itself is not available from PubMed.</p>
                     </c>
                     <c ca="left">
                        <p>Note: though the protein sequence is not available from the UniProt database, the DNA sequence is present in the EMBL database.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.1.8.2</p>
                     </c>
                     <c ca="left">
                        <p>Diisopropyl-fluoro-phosphatase</p>
                     </c>
                     <c ca="left">
                        <p><it>Alteromonas sp</it>.</p>
                     </c>
                     <c ca="left">
                        <p>1954</p>
                     </c>
                     <c ca="center">
                        <p>Q44238</p>
                     </c>
                     <c ca="left">
                        <p>ENZYME and IUBMB entries are not referencing a Swiss-Prot entry associated with a 1996 paper describing the cloning of a gene coding for an enzyme with this activity [31].</p>
                     </c>
                     <c ca="left">
                        <p>This enzymatic activity detoxifies nerve gas. The gene is part of a widespread gene family with otherwise unknown function, with members in <it>Homo sapiens</it>.</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Extent of salvageability</p>
            </st>
            <p>Validated orphans were analyzed to determine whether sufficient information is available from their published characterization that, when combined with other factors, could enable the rapid identification of at least one cognate gene. Overall, we determined that 57 validated orphans (25% of total) might be salvageable (Figure <figr fid="F5">5A</figr>; Table <tblr tid="T10">10</tblr>), distributed approximately equally across eukaryotes and bacteria. Far more bacterial orphans were judged to have "excellent" or "good" salvageability as compared with eukaryotic orphans: 70% (7+12 out of 27) vs. 48% (5+11 out of 33), respectively (Figure <figr fid="F5">5B</figr>). This discrepancy is primarily due to factors such as the much greater difficulty for purifying an activity from higher eukaryotes, the difficulty of obtaining enough starting protein from lower eukaryotes such as multicellular fungi, and the absence of a comprehensive genome sequence from species such as <it>Bos Taurus and Sus scrofa</it>.</p>
            <tbl id="T10">
               <title>
                  <p>Table 10</p>
               </title>
               <caption>
                  <p>Example artifactual orphans that are  salvageable</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p><b>EC No</b>.</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Ranking</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p><b>EC No</b>.</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Ranking</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.1.1.226</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>1.4.1.11</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.1.1.265</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>1.4.1.17</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.14.13.58</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>1.5.1.21</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2.4.1.180</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>2.3.1.98</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2.8.1.3</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>2.6.1.58</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.2.1.112</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>2.6.1.75</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.2.1.128</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>3.1.3.47</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.5.1.58</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>3.1.6.17</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3.6.1.18</p>
                     </c>
                     <c ca="left">
                        <p>excellent</p>
                     </c>
                     <c ca="left">
                        <p>3.2.1.100</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.1.1.172</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>3.4.17.16</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.1.1.196</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>3.5.1.30</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.13.12.9</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>3.5.3.2</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.14.13.23</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>4.2.1.43</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.14.16.5</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>4.2.1.62</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1.2.1.54</p>
                     </c>
                     <c ca="left">
                        <p>good</p>
                     </c>
                     <c ca="left">
                        <p>5.5.1.11</p>
                     </c>
                     <c ca="left">
                        <p>Good</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Validated orphans with a salvageability ranking of "good" or better are listed.</p>
               </tblfn>
            </tbl>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Salvageability ranking of validated orphans</p>
               </caption>
               <text>
                  <p><b>Salvageability ranking of validated orphans</b>. The suitability of validated orphans for eventual cloning of at least one cognate gene was evaluated according to the ranking system described in the text. Out of 228 orphans, 57 were judged to be salvageable. A: Overall salvageability ranking (percentage out of 57); B: Domain distribution of salvageable orphans (number of orphans). Note that the total is greater than 57 because some orphans have different evaluations in the different species in which they have been reported. One orphan is also shared between Eubacteria and Eukaryotes.</p>
               </text>
               <graphic file="1471-2105-8-244-5"/>
            </fig>
            <p>Overall, more than half of the salvageable orphans ranked "good" or "excellent", with oxidoreductases (EC1) and hydrolases (EC2) being overrepresented in that set. All other enzymatic classes were significantly underrepresented (Figure <figr fid="F6">6</figr>).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Distribution of enzymatic activities for salvageable orphans ranked "good" and "excellent"</p>
               </caption>
               <text>
                  <p>Distribution of enzymatic activities for salvageable orphans ranked "good" and "excellent".</p>
               </text>
               <graphic file="1471-2105-8-244-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This survey demonstrates that ~80% of orphan enzymatic activities are <it>bona fide; </it>therefore, we conclude that of the 1,356 putative orphans extant at the time of this study, more than 1,000 are highly likely to constitute true information deficits since their lack of sequence information is not the result of a database error.</p>
         <p>The absence of DNA or protein sequences encoding such well-characterized enzymatic activities is particularly consequential because these activities were often identified decades ago, and many have been the focus of significant research activity (Table <tblr tid="T1">1</tblr>). Without the cognate sequences for these activities, the quality of annotation of all sequenced genomes in terms of both coverage (fraction of genes that can be recognized) and accuracy (fraction of predicted gene functions that are correct) is diminished. Many of these activities may go for years without being sequenced &#8211; for example, 1-acylglycerophosphocholine O-acyltransferase (Table <tblr tid="T1">1</tblr>) was finally purified and sequenced nearly forty years after it was first characterized <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Perhaps more troubling is the unknown pool of "false positive" annotations. Phosphogluconate 2-dehydrogenase (Table <tblr tid="T1">1</tblr>), an orphan at the time of this analysis, has since been assigned to a sequence in the human genome with no experimental evidence linking it to that or any homologous sequence, but apparently instead on the basis of the gene in question already being assigned a similar activity. This kind of "hidden orphan" would have been missed by most orphan analyses, and can be expected to propagate a potentially incorrect assignment to other genomes in the future. Computational metabolic pathway prediction <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and metabolic engineering also depend on sequence information and are thus similarly compromised.</p>
         <p>Conversely, ~20% of orphans surveyed were observed to be artifacts, such that ~270 orphans out of 1,356 putative orphans examined should be resolvable entirely via literature research and database cleanup. As a result of this process as it was carried out on our sampling of orphans, we have reported 11 artifactual orphan activities to public sequence repositories for correction (see Table <tblr tid="T8">8</tblr> for examples).</p>
         <p>In addition to validating orphans, the survey was useful in capturing information from the literature to assess their salvageability: more than half of validated orphans were found to be salvageable (Figure <figr fid="F5">5</figr>). Examples of salvageable orphan activities with the traits that make them salvageable are listed in Table <tblr tid="T11">11</tblr>.</p>
         <tbl id="T11">
            <title>
               <p>Table 11</p>
            </title>
            <caption>
               <p>Selected salvageable orphans</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Ranking</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Pathways</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Activity</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Species</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Full Genome Sequence?</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Ongoing Genomic Sequencing?</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Mr (kDa)</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>pI (pH units)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>3.5.1.30</p>
                  </c>
                  <c ca="left">
                     <p>Good</p>
                  </c>
                  <c ca="left">
                     <p>None</p>
                  </c>
                  <c ca="left">
                     <p>5-amino-penta-namidase</p>
                  </c>
                  <c ca="left">
                     <p><it>Pseudomonas putida </it>P2, <it>Pseudomonas fluorescens</it></p>
                  </c>
                  <c ca="left">
                     <p>Yes (<it>P. putida</it>)*</p>
                  </c>
                  <c ca="left">
                     <p>Several <it>Pseudomonas </it>species</p>
                  </c>
                  <c ca="left">
                     <p>67</p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>5.5.1.11</p>
                  </c>
                  <c ca="left">
                     <p>Good</p>
                  </c>
                  <c ca="left">
                     <p>None</p>
                  </c>
                  <c ca="left">
                     <p>Dichloro-muconate cyclo- isomerase</p>
                  </c>
                  <c ca="left">
                     <p><it>Alcaligenes eutrophus </it>JMP 134 (<it>Ralstonia eutropha </it>JMP134)</p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
                  <c ca="left">
                     <p>Yes</p>
                  </c>
                  <c ca="left">
                     <p>40 &#177; 10</p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>4.2.1.97</p>
                  </c>
                  <c ca="left">
                     <p>Marginal</p>
                  </c>
                  <c ca="left">
                     <p>None</p>
                  </c>
                  <c ca="left">
                     <p>Phaseollidin hydratase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Fusarium solani f.sp. Phaseoli</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>No</p>
                  </c>
                  <c ca="left">
                     <p>Different species (<it>Fusarium sporotrichioides)</it></p>
                  </c>
                  <c ca="left">
                     <p>monomer 1: 47 monomer 2: 49</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.3.1.103</p>
                  </c>
                  <c ca="left">
                     <p>Poor</p>
                  </c>
                  <c ca="left">
                     <p>None</p>
                  </c>
                  <c ca="left">
                     <p>Sinapoylglucose&#8211;sinapoylglucose O-sinapoyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Raphanus sativus</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
                  <c ca="left">
                     <p>55</p>
                  </c>
                  <c ca="left">
                     <p>N/A</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>A selection of orphans with different salvageability rankings are listed. Pathway names are those used in the MetaCyc database. *: The genomes of several strains of <it>P. fluorescens </it>are in the final stages of assembly and are essentially fully sequenced. N/A: not available</p>
            </tblfn>
         </tbl>
         <p>As abundantly noted elsewhere, such database cleansing is essential to maximize the existing research investment and prevent the propagation of mistakes <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> (see Table <tblr tid="T12">12</tblr> for examples of artifacts that have been resolved). This necessity has not eluded the field of enzymology <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, and the present survey demonstrates the usefulness of correlating biological databases and mining the literature to enhance the value of existing research and facilitate the identification of the remaining orphan-associated genes. Until recently, there were no general repositories of orphan activity data, although some species-specific databases and pages were maintained, such as EchoBase <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and a web page listing unidentified <it>E. coli </it>enzymes maintained by the EcoCyc project <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Consequently, we updated the MetaCyc <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> database to identify reactions that have been analyzed by this survey, and annotated them and associated database objects with results such as the validity of their orphan status, links to their cognate protein in the case of artifacts, and the properties of the protein copurifying with the activity in the case of validated orphans. Recently, Lespinet and Labedan created ORENZA <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, a database dedicated to maintaining an up-to-date listing of all enzyme activities for which no sequences are available in major sequence databases <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. We are contributing our updated orphan information to ORENZA as well. These data, captured in MetaCyc and ORENZA, should facilitate the work of enzymologists interested in identifying the cognate genes of orphan activities. For instance, the work of Melnick <it>et al</it>. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> is an excellent example of the combined application of modern laboratory and bioinformatics techniques that would benefit from the data described here.</p>
         <tbl id="T12">
            <title>
               <p>Table 12</p>
            </title>
            <caption>
               <p>Example of artifactual orphans resolved by this survey</p>
            </caption>
            <tblbdy cols="4">
               <r>
                  <c ca="left">
                     <p><b>EC No</b>.</p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Enzyme Name</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Species</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><b>TrEMBL/Swiss-Prot Accession No</b>.</p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.1.1.163</p>
                  </c>
                  <c ca="left">
                     <p>Cyclopentanol dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p><it>Comamonas sp</it>.</p>
                  </c>
                  <c ca="left">
                     <p>Q8GAV9</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.13.11.24</p>
                  </c>
                  <c ca="left">
                     <p>Quercetin 2,3-dioxygenase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Bacillus subtilis</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>P42106</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>3.6.3.24</p>
                  </c>
                  <c ca="left">
                     <p>Nickel-transporting ATPase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Escherichia coli</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>P33593</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.1.1.143</p>
                  </c>
                  <c ca="left">
                     <p>24-methylenesterol C-methyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Arabidopsis thaliana</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Q94JS4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2.1.1.143</p>
                  </c>
                  <c ca="left">
                     <p>24-methylenesterol C-methyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Arabidopsis thaliana</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Q39227</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>All Swiss-Prot entries listed here have been updated with the corresponding EC number.</p>
            </tblfn>
         </tbl>
         <p>Several proposals have been made recently aimed at producing a complete catalog of biochemical activities, biological functions, and their cognate genes <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Many of these proposals recommend that such a project begin with prokaryotes because of the general ease of gene cloning from these species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Indeed, our data support this notion, as we find substantially more orphans with a salvageability ranking of "good" and "excellent" in prokaryotes as compared to eukaryotes. The availability of a comprehensive review of the problem achieved by this survey, combined with broad genomic sequencing and powerful computational tools, leads us to conclude that the field is in an excellent position to rectify the information gap associated with the orphan activity phenomenon.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>More than one third of enzyme activities with assigned EC numbers are orphan activities, having no associated gene or protein sequence. We carried out a literature-based survey of a representative sample of presumed orphans intended to further validate and characterize these orphan activities. We have also assessed the practicability of identifying the genes associated with these orphans. In doing so, we captured data from the literature that should assist in future identification of cognate genes for the orphan activities we examined.</p>
         <p>This survey confirmed that about 80% of sampled orphan activities have no sequence information associated with them, either in databases or in the literature. Weaknesses in database integration and failure to capture information from the literature account for most of the remaining 20%.</p>
         <p>This survey points toward the significant scientific cost of having such a large fraction of characterized enzyme activities disconnected from sequence data. It also suggests that a larger effort, beginning with a comprehensive survey of all putative orphan activities, would resolve nearly 300 artifactual orphans and reconnect a wealth of enzyme research with modern genomics. For these reasons, we propose that a systematic effort to identify the cognate genes of orphan enzymes be undertaken.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Literature survey process</p>
            </st>
            <p>This survey was performed from June through August 2004 and relied on enzyme activities described by the NC-IUBMB. This enzyme classification and nomenclature system is hierarchical in nature and is based upon the reaction catalyzed. It assigns specific numerical identifiers, an EC number, to each distinct enzymatic activity. The first digit represents the class of reaction catalyzed (e.g., oxidoreductases are EC1; transferases are EC2). The second digit of the EC number refers to the subclass, which generally contains information about the type of compound or group involved (e.g., an enzyme acting on the CH-OH group of donors, or acting on the aldehyde or oxo group of donors). The third digit defines the sub-subclass, which specifies the nature of the reaction. The fourth digit is a serial number that is used to identify the individual enzyme within a sub-subclass (see <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> for a description of the classification system).</p>
            <p>It is important to bear in mind that distinct proteins catalyzing the same reaction are assigned the same EC number. Since the EC system is based upon the reaction catalyzed, when applied to a protein it describes a biochemical function of this protein. That function can also be shared by several proteins (isozymes) that can be coded by genes in the same or different species.</p>
            <p>Presumed orphan EC numbers were identified using the BioWarehouse database system <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. BioWarehouse <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> is an integrated database that enables cross-database queries using the structured query language (SQL). SRI's BioWarehouse instance was queried for enzymatic activities with no matching sequences in any major protein sequence databases, including TrEMBL, PIR, SWISS-PROT, CMR, ENZYME, and BioCyc (the selection of these databases is described in <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>). This query returned an initial list of 1,356 EC numbers that had not been retired or merged at the time of the survey.</p>
            <p>This list was randomized and the primary literature associated with a sample of these putative orphans was processed successively according to that random order. The size of the sample necessary to ensure representational accuracy as compared to the total pool of EC numbers was calculated using Equation 1. Approximately 180 orphans are required to achieve better than 95% confidence, given the total number of EC numbers. Since a sample of 228 orphans was ultimately surveyed, the 95% level of significance was exceeded.</p>
            <sec>
               <st>
                  <p>Equation 1: sample size estimation</p>
               </st>
               <p indent="1"><it>SE </it>is the standard error associated with the survey, and is derived by dividing the sampling error by 1.96, such that for a sampling error of 5% (95% confidence interval), the standard error is 0.0255102. <it>p </it>is the probability that the EC number is a true positive, that is, there is truly no sequence information for that EC number; this value is 0.85 based on data from a preliminary survey. <it>N </it>is the universe of orphan activities. Solving for <it>n </it>provides the sample size.</p>
               <p>
                  <display-formula>SE<sup>2 </sup>= [(p(1-p)/n)] [(N-n)/N]</display-formula>
               </p>
               <p>A comprehensive manual analysis of the literature associated with this sample of 228 orphans drawn from the randomized list was performed as outlined in Figure <figr fid="F2">2</figr>. Various databases (Table <tblr tid="T13">13</tblr>) were consulted to extract the data elements listed in Table <tblr tid="T12">12</tblr>. For each selected putative orphan in the sample, the text search engine ExPASy Proteomics Server <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> was used to search TrEMBL, ENZYME, and IUBMB database records to confirm the absence of sequence data. For each orphan, all protein names, author names, reaction names, substrate names, and product names listed in the IUBMB record for that orphan were used as query arguments.</p>
               <tbl id="T13">
                  <title>
                     <p>Table 13</p>
                  </title>
                  <caption>
                     <p>Main data sources used by the orphan survey</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="left">
                           <p>
                              <b>Database name</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Content</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Source</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Accessed via...</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>TrEMBL [32]</p>
                        </c>
                        <c ca="left">
                           <p>Comprehensive protein and DNA sequence data</p>
                        </c>
                        <c ca="left">
                           <p>Swiss Institute of Bioinformatics</p>
                        </c>
                        <c ca="left">
                           <p>Web</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Comprehensive Microbial Repository (CMR [33])</p>
                        </c>
                        <c ca="left">
                           <p>Extensive genomic data for microbial species</p>
                        </c>
                        <c ca="left">
                           <p>The Institute for Genomic Research</p>
                        </c>
                        <c ca="left">
                           <p>BioWarehouse</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BioCyc databases</p>
                        </c>
                        <c ca="left">
                           <p>Collection of pathway/genome databases primarily concerned with microbial species</p>
                        </c>
                        <c ca="left">
                           <p>Bioinformatics Research Group, SRI International</p>
                        </c>
                        <c ca="left">
                           <p>BioWarehouse</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>IUBMB Enzyme Nomenclature [34]</p>
                        </c>
                        <c ca="left">
                           <p>Description of enzymes that have been assigned an EC number by the Enzyme Commission</p>
                        </c>
                        <c ca="left">
                           <p>Nomenclature Committee of the International Union of Biochemistry and Molecular Biology</p>
                        </c>
                        <c ca="left">
                           <p>Web and BioWarehouse</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ENZYME [35]</p>
                        </c>
                        <c ca="left">
                           <p>Repository of information relative to the nomenclature of enzymes</p>
                        </c>
                        <c ca="left">
                           <p>Swiss Institute of Bioinformatics</p>
                        </c>
                        <c ca="left">
                           <p>Web and BioWarehouse</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NCBI Taxonomy [36]</p>
                        </c>
                        <c ca="left">
                           <p>Taxonomy database</p>
                        </c>
                        <c ca="left">
                           <p>National Center for Biotechnology Information</p>
                        </c>
                        <c ca="left">
                           <p>Web and BioWarehouse</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PubMed</p>
                        </c>
                        <c ca="left">
                           <p>Literature database</p>
                        </c>
                        <c ca="left">
                           <p>National Library of Medicine</p>
                        </c>
                        <c ca="left">
                           <p>Web</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>The primary literature associated with each orphan's entry in the IUBMB database <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B26">26</abbr></abbrgrp> describing the isolation and characterization of the activity was reviewed for the presence of sequence information. In particular, we were alert for the presence of molecular descriptors that might be useful in cloning the associated genes in the papers (described below), particularly M<sub>r</sub>, pI, and details of the purification scheme (Table <tblr tid="T14">14</tblr>). Systematic searches of PubMed were also performed to ascertain whether publications other than those cited by IUBMB might contain relevant sequence and molecular descriptor data. A total of 331 publications (1.45 papers per orphan) were examined for additional molecular descriptors that might be useful in cloning, as described above. Data obtained from these publications were assembled into a database.</p>
               <tbl id="T14">
                  <title>
                     <p>Table 14</p>
                  </title>
                  <caption>
                     <p/>
                  </caption>
                  <tblbdy cols="1">
                     <r>
                        <c ca="left">
                           <p>Name of enzyme activity</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Is lack of sequence confirmed?</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Bibliographical data (publication dates, authors, institutions)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Name of species</p>
                        </c>
                     </r>
                     <r>
                        <c indent="1" ca="left">
                           <p>Can the species associated with the original publications be unambiguously identified?</p>
                        </c>
                     </r>
                     <r>
                        <c indent="1" ca="left">
                           <p>Is a comprehensive genome sequence available for those species?</p>
                        </c>
                     </r>
                     <r>
                        <c indent="2" ca="left">
                           <p>Are comprehensive genome sequences available from closely-related species?</p>
                        </c>
                     </r>
                     <r>
                        <c indent="1" ca="left">
                           <p>Is there ongoing genomic sequencing for those species or from closely-related species?</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Are molecular data such as M<sub>r </sub>and pI available?</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Does the purification and characterization procedure suggest that purifying this enzyme should be reasonably straightforward?</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>All artifactual orphans (orphans for which sequence information was found during the literature review process) were reported promptly to the Swiss Institute of Bioinformatics, the European Bioinformatics Institute, the ORENZA database, and the Nomenclature Committee of the IUBMB for the relevant database entries to be updated.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Data sources and database analyses</p>
            </st>
            <p>The initial searches for presumed orphan activities were performed using BioWarehouse version 3.0 (SRI International) running under Oracle 10G (Oracle Corporation, Redwood Shores, California). BioWarehouse is a bioinformatics data warehousing environment developed under the Bio-SPICE program <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Identification and ranking of salvageable orphans</p>
            </st>
            <p>Salvageable orphans are orphan activities for which it is likely that at least one cognate gene can be identified and confirmed in a practical manner. The extent of this salvageability was determined by ranking validated orphans according to the likelihood and practicality that at least one cognate gene can be identified, and that the gene product can be isolated and demonstrated to catalyze the enzymatic activity in a practical manner.</p>
            <p>Orphans were ranked based on data in the original literature, combined with the availability of the complete genome sequences for the species in which an orphan was first elucidated. The principal ranking factors are (1) clear identification of the species involved and its ease of growth; (2) the availability of molecular descriptors, most importantly the molecular mass (M<sub>r</sub>), but also the isoelectric point (pI); (3) the types of purification and analytical techniques used in the original literature; and (4) evidence that the protein can be purified with reasonable effort using current techniques, based on factors such as specific activity, purification yield, number of steps involved, and availability of substrate and of alternate purification procedures. "Excellent" and "Good" ratings indicate an activity associated with a sequenced organism, and whose purifications and assays are likely to be straightforward to replicate. "Difficult" activities are those with tricky purifications or complex assays, but a sequenced target organism or sequenced related organism. "Marginal" activities are those for which sequencing is in progress in the target organism, or a related organism. "Poor" activities are those for which no genome sequence is available, or sequencing is in progress in a related organism, and assay or purification conditions are likely to be hard to replicate.</p>
            <sec>
               <st>
                  <p>Data availability</p>
               </st>
               <p>Information about validated orphan activities has been entered into the MetaCyc database <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Other data generated by our survey can be found at <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>Enzyme Commission (EC), Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB), Structured Query Language (SQL)</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>PK conceived the study. PK and YP jointly devised the methodology. YP performed the literature research and drafted the manuscript. PK revised the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was funded by grant MCB-0438571 from the U.S. National Science Foundation. The BioWarehouse is funded by contract F30602-01-C-0153 from the Defense Advanced Research Projects Agency. This material is based upon work supported by DARPA and the Air Force Research Laboratory under Contract No. F30602-01-C-0153. We gratefully acknowledge Dr. Tadhg P. Begley, Department of Chemistry and Chemical Biology, Cornell University, for help in analyzing biochemical purification protocols; Dr. Ron Caspi, Bioinformatics Research Group, SRI International, for support with the MetaCyc database and analysis of purification protocols; and Dr. Alexander Shearer, Bioinformatics Research Group, SRI International, for assistance with manuscript revision and resubmission.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Identifying protein function&#8211;a call for community action</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <issue>3</issue>
            <fpage>E42</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">368155</pubid>
                  <pubid idtype="pmpid" link="fulltext">15024411</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020042</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>'Conserved hypothetical' proteins: prioritization of targets for experimental study</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>18</issue>
            <fpage>5452</fpage>
            <lpage>5463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524295</pubid>
                  <pubid idtype="pmpid" link="fulltext">15479782</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh885</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Call for an enzyme genomics initiative</p>
            </title>
            <aug>
               <au>
                  <snm>Karp</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>8</issue>
            <fpage>401</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">507876</pubid>
                  <pubid idtype="pmpid" link="fulltext">15287973</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-8-401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Orphan enzymes?</p>
            </title>
            <aug>
               <au>
                  <snm>Lespinet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Labedan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <issue>5706</issue>
            <fpage>42a</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.307.5706.42a</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). Enzyme Nomenclature. Recommendations 1992. Supplement 4: corrections and additions (1997)</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Eur J Biochem</source>
            <pubdate>1997</pubdate>
            <volume>250</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1432-1033.1997.001_1.x</pubid>
                  <pubid idtype="pmpid">9431984</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>ORENZA: a web resource for studying ORphan ENZyme activities</p>
            </title>
            <aug>
               <au>
                  <snm>Lespinet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Labedan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>436</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1609188</pubid>
                  <pubid idtype="pmpid" link="fulltext">17026747</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-436</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Orphan enzymes could be an unexplored reservoir of new drug targets</p>
            </title>
            <aug>
               <au>
                  <snm>Lespinet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Labedan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Drug Discov Today</source>
            <pubdate>2006</pubdate>
            <volume>11</volume>
            <issue>7&#8211;8</issue>
            <fpage>300</fpage>
            <lpage>305</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.drudis.2006.02.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">16580971</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>An Experimental Approach to Genome Annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Linn</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>MR</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <publisher>American Society for Microbiology</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Enzyme Genomics Site</p>
            </title>
            <url>http://bioinformatics.ai.sri.com/enzyme-genomics</url>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Identification and characterization of a lysophosphatidylcholine acyltransferase in alveolar type II cells</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hyatt</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Mucenski</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>31</issue>
            <fpage>11724</fpage>
            <lpage>11729</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1544237</pubid>
                  <pubid idtype="pmpid" link="fulltext">16864775</pubid>
                  <pubid idtype="doi">10.1073/pnas.0604946103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Computational prediction of human metabolic pathways from the complete human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Romero</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wagg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Kaiser</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Krummenacker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>R2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">549063</pubid>
                  <pubid idtype="pmpid" link="fulltext">15642094</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-6-1-r2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Predicting functions from protein sequences&#8211;where are the bottlenecks?</p>
            </title>
            <aug>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>1998</pubdate>
            <volume>18</volume>
            <issue>4</issue>
            <fpage>313</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0498-313</pubid>
                  <pubid idtype="pmpid" link="fulltext">9537411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>What we do not know about sequence analysis and sequence databases</p>
            </title>
            <aug>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>753</fpage>
            <lpage>754</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.753</pubid>
                  <pubid idtype="pmpid" link="fulltext">10366280</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Keeping Genome Databases Clean and Up to Date</p>
            </title>
            <aug>
               <au>
                  <snm>Pennisi</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <issue>5439</issue>
            <fpage>447</fpage>
            <lpage>450</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.286.5439.447</pubid>
                  <pubid idtype="pmpid" link="fulltext">10577208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Retrieving sequences of enzymes experimentally characterized but erroneously annotated : the case of the putrescine carbamoyltransferase</p>
            </title>
            <aug>
               <au>
                  <snm>Naumoff</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Glansdorff</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Labedan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>1</issue>
            <fpage>52</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514541</pubid>
                  <pubid idtype="pmpid" link="fulltext">15287962</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-52</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Puzzling over orphan enzymes</p>
            </title>
            <aug>
               <au>
                  <snm>Lespinet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Labedan</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>2006</pubdate>
            <volume>63</volume>
            <issue>5</issue>
            <fpage>517</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00018-005-5520-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">16465439</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>EchoBASE: an integrated post-genomic database for Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Misra</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Horler</snm>
                  <fnm>RSP</fnm>
               </au>
               <au>
                  <snm>Reindl</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Goryanin</snm>
                  <fnm>II</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>suppl 1</issue>
            <fpage>D329</fpage>
            <lpage>333</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p>EcoCyc orphan page</p>
            </title>
            <url>http://ecocyc.org/enzymes.shtml</url>
         </bibl>
         <bibl id="B19">
            <title>
               <p>MetaCyc: a multiorganism database of metabolic pathways and enzymes</p>
            </title>
            <aug>
               <au>
                  <snm>Caspi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Foerster</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fulcher</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hopkinson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ingraham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kaipa</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krummenacker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Tissier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D511</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347490</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381923</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj128</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>ORENZA</p>
            </title>
            <url>http://www.orenza.u-psud.fr</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Identification of the two missing bacterial genes involved in thiamine salvage: thiamine pyrophosphokinase and thiamine kinase</p>
            </title>
            <aug>
               <au>
                  <snm>Melnick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>J-H</fnm>
               </au>
               <au>
                  <snm>Kinsland</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Perkins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schyns</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vassieva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Osterman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Begley</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>11</issue>
            <fpage>3660</fpage>
            <lpage>3662</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">415752</pubid>
                  <pubid idtype="pmpid" link="fulltext">15150256</pubid>
                  <pubid idtype="doi">10.1128/JB.186.11.3660-3662.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>History of the enzyme nomenclature system</p>
            </title>
            <aug>
               <au>
                  <snm>Tipton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Boyce</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>34</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.1.34</pubid>
                  <pubid idtype="pmpid" link="fulltext">10812475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>BioWarehouse: a bioinformatics database warehouse toolkit</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Pouliot</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Stringer-Calvert</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Tenenbaum</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>170</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1444936</pubid>
                  <pubid idtype="pmpid" link="fulltext">16556315</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-170</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>BioWarehouse</p>
            </title>
            <url>http://bioinformatics.ai.sri.com/biowarehouse/</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>ExPASY Proteomics Server</p>
            </title>
            <url>http://www.expasy.org/cgi-bin/sprot-search-ful</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>IUBMB database</p>
            </title>
            <url>http://www.chem.qmul.ac.uk/iubmb/enzyme/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>MetaCyc</p>
            </title>
            <url>http://www.metacyc.org</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Structural and enzymatic characterization of physarolisin (formerly physaropepsin) proves that it is a unique serine-carboxyl proteinase</p>
            </title>
            <aug>
               <au>
                  <snm>Nishii</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ueki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Miyashita</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kojima</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>K</snm>
                  <fnm>M-M</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2003</pubdate>
            <volume>301</volume>
            <issue>4</issue>
            <fpage>1023</fpage>
            <lpage>1029</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0006-291X(03)00083-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">12589815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Cloming, sequence analysis of imidase gene from Alcaligenes eatrophus and its expression in <it>E coli</it></p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>uY</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Wei Sheng Wu Xue Bao</source>
            <pubdate>2002</pubdate>
            <volume>42</volume>
            <issue>2</issue>
            <fpage>153</fpage>
            <lpage>162</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12557390</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Discovery, purification, and properties of o-phthalyl amidase from Xanthobacter agilis</p>
            </title>
            <aug>
               <au>
                  <snm>Briggs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kreuzman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Whitesitt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yeh</snm>
                  <fnm>W-K</fnm>
               </au>
               <au>
                  <snm>Zmijewski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Catalysis B: Enzymatic</source>
            <pubdate>1996</pubdate>
            <volume>2</volume>
            <issue>1</issue>
            <fpage>53</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/1381-1177(96)00011-2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Cloning and expression of a gene encoding a bacterial enzyme for decontamination of organophosphorus nerve agents and nucleotide sequence of the enzyme</p>
            </title>
            <aug>
               <au>
                  <snm>Cheng</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Harvey</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>1996</pubdate>
            <volume>62</volume>
            <issue>5</issue>
            <fpage>1636</fpage>
            <lpage>1641</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">167937</pubid>
                  <pubid idtype="pmpid" link="fulltext">8633861</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The Universal Protein Resource (UniProt)</p>
            </title>
            <aug>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Boeckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gasteiger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Magrane</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Redaschi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yeh</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>suppl 1</issue>
            <fpage>D154</fpage>
            <lpage>159</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The Comprehensive Microbial Resource</p>
            </title>
            <aug>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Dickinson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>1</issue>
            <fpage>123</fpage>
            <lpage>125</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29848</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125067</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.123</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB). Enzyme Nomenclature. Recommendations 1992. Supplement 5: corrections and additions (1999)</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Eur J Biochem</source>
            <pubdate>1999</pubdate>
            <volume>250</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>6</lpage>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The ENZYME database in 2000</p>
            </title>
            <aug>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>1</issue>
            <fpage>304</fpage>
            <lpage>305</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102465</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592255</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Database resources of the National Center for Biotechnology Information</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>DiCuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Helmberg</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kenton</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pontius</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Schriml</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Suzek</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yaschenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>suppl 1</issue>
            <fpage>D39</fpage>
            <lpage>45</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
