<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1745-6150-2-33</ui>
   <ji>1745-6150</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Makarova</snm>
               <mi>S</mi>
               <fnm>Kira</fnm>
               <insr iid="I1"/>
               <email>makarova@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A2">
               <snm>Sorokin</snm>
               <mi>V</mi>
               <fnm>Alexander</fnm>
               <insr iid="I1"/>
               <email>sorokin@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A3">
               <snm>Novichkov</snm>
               <mi>S</mi>
               <fnm>Pavel</fnm>
               <insr iid="I1"/>
               <email>novichko@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A4">
               <snm>Wolf</snm>
               <mi>I</mi>
               <fnm>Yuri</fnm>
               <insr iid="I1"/>
               <email>wolf@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Koonin</snm>
               <mi>V</mi>
               <fnm>Eugene</fnm>
               <insr iid="I1"/>
               <email>koonin@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA</p>
            </ins>
         </insg>
         <source>Biology Direct</source>
         <issn>1745-6150</issn>
         <pubdate>2007</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>33</fpage>
         <url>http://www.biology-direct.com/content/2/1/33</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18042280</pubid>
               <pubid idtype="doi">10.1186/1745-6150-2-33</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>02</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>27</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>27</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Makarova et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: <url>ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/</url>.</p>
            </sec>
            <sec>
               <st>
                  <p>Reviewers</p>
               </st>
               <p>This article was reviewed by Peer Bork, Patrick Forterre, and Purificacion Lopez-Garcia.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>A robust classification of genes based on accurately deciphered evolutionary relationships is the cornerstone of comparative and evolutionary genomics. Such a classification is indispensable both for the functional annotation of sequenced genomes and for any genome-wide evolutionary reconstruction. The construction of an evolutionary classification of genes is a non-trivial task because of the complexity of homologous relationships between genes. The two principal classes of homologs are orthologs and paralogs. Orthologs are homologous genes that evolved via vertical descent from a single ancestral gene in the last common ancestor of the compared species. Paralogs are homologous genes, which, at some stage of evolution, have evolved by duplication of an ancestral gene <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Orthology and paralogy are intimately linked because, if a duplication (or a series of duplications) occurs after the speciation event that separated the compared species, orthology becomes a relationship between sets of paralogs, rather than individual genes (in which case, such genes are called co-orthologs).</p>
         <p>Correct identification of orthologs and paralogs is of central importance for both the functional and the evolutionary aspects of comparative genomics. Orthologs typically occupy the same functional niche in different organisms; by contrast, paralogs evolve to functional diversification as they diverge after the duplication <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Therefore, the accuracy of genome annotation critically depends on the accurate identification of orthologs <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. A clear demarcation of orthologs and paralogs is also required for constructing evolutionary scenarios which include, along with vertical inheritance, lineage-specific gene loss and horizontal gene transfer (HGT) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>In principle, orthologs, including co-orthologs, should be identified by means of phylogenetic analysis of entire families of homologous proteins in the compared genomes, which is expected to define orthologous protein sets as clades. However, for genome-wide protein sets, such analysis remains extremely labor-intensive, and error-prone as well <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Accordingly, procedures have been developed for identification of sets of likely orthologs without an explicit referral to phylogenetic analysis. These procedures are based on the notion of a genome-specific best hit (BeT), i.e., the protein from a target genome that is most similar (typically, in terms of similarity scores computed using BLAST or another sequence comparison method) to a given protein from the query genome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The assumption central to this approach is that orthologs have a greater similarity to each other than to any other protein from the respective genomes. When multiple genomes are analyzed, pairs of probable orthologs detected on the basis of BeTs are combined into orthologous clusters represented in all or a subset of the analyzed genomes. This approach, amended with additional procedures for detecting co-orthologous protein sets and for treating multidomain proteins, was implemented in the database of Clusters of Orthologous Groups (COGs) of proteins <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. The latest COG set released in 2003 includes ~70% of the proteins encoded in 69 genomes of prokaryotes and unicellular eukaryotes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The COGs have been employed for functional annotation of newly sequenced genomes (e.g. <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, comparative analysis of gene neighborhoods <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> and other types of connections between genes, as implemented in the widely used STRING tool <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, target selection in structural genomics (e.g. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and various genome-wide evolutionary analyses <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Independently, other groups have developed similar methodologies for identification of orthologs and paralogs in pairwise or multiple genome comparisons <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Very recently, a major effort on automatic construction of sets of orthologous genes has culminated in the EggNOG database which employed the COGs as a prototype and a seed <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
         <p>The methods for the construction of COGs were developed and originally applied to small sets of genomes; these and other related methods do not guarantee correct identification of the paralogous and orthologous relationships, due to the variability of domain architectures of proteins, differential loss of paralogs in different lineages, extreme divergence of some orthologous and paralogous genes, and other complications <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The computational cost of exhaustive genome comparisons also grows almost prohibitively with the steep increase in the number of sequenced genomes which approached 500 in the beginning of 2007 <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Thus, several smaller scale studies have been conducted in which COGs were constructed for compact groups of bacteria including the <it>Thermus-Deinococcus </it>group <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, <it>Cyanobacteria </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and <it>Lactobacillales </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In each of these analyses, a considerably better resolution of the homologous relationship than in the overall COG set has been achieved.</p>
         <p>In the previous comparative-genomic analyses of archaea, we delineated COGs for this domain of life and used them to partition archaeal genes into the evolutionarily stable, conserved core and the "shell" of genes that are often lost during evolution or are characteristic of a narrow group of species <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>; we further traced the dynamics of drop in the number of the core genes with sequencing of additional archaeal genomes <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>.</p>
         <p>Here we present the updated set of COGs that includes 41 sequenced archaeal genomes and delineate the core sets of genes that are represented in all archaea or in the major archaeal divisions, Euryarchaeota and Crenarchaeota. We further describe evolutionary reconstructions aimed at inferring the nature of the Last Archaeal Common Ancestor (LACA) and other ancestral forms, and uncovering the trends of gene loss and gain during archaeal evolution.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>The archaeal genomic data set and construction of archaeal COGs</p>
            </st>
            <p>Table <tblr tid="T1">1</tblr> lists the basic features of the analyzed archaeal genomes. The now available set of genomes represents reasonably well the genomic, taxonomic, and ecological diversity of archaea. The genome span the range from ~0.58 Mb (the parasite <it>Nanoarchaeum equitans</it>) to ~5.8 Mb (the mesophilic euryarchaeon <it>Methanosarcina acetivorum</it>); there are 20 hyperthermophiles and 21 mesophiles and moderate thermophiles; 27 genomes represent the Euryrchaeota, 13 belong to the Crenarchaeota, and the remaining one is <it>N. equitans </it>whose taxonomic position is considered uncertain <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The 41 archaeal genomes included in the arCOGs</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c ca="left">
                        <p>
                           <it>Species</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Division</p>
                     </c>
                     <c ca="left">
                        <p>Lineage</p>
                     </c>
                     <c ca="left">
                        <p>Abbreviation</p>
                     </c>
                     <c ca="left">
                        <p>Genome size, Mb</p>
                     </c>
                     <c ca="left">
                        <p>Number of annotated protein-coding genes</p>
                     </c>
                     <c ca="left">
                        <p>OGT<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Life style and other features</p>
                     </c>
                     <c ca="left">
                        <p>Ref<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>GenBank accession</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aeropyrum pernix</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Desulfurococcales</p>
                     </c>
                     <c ca="left">
                        <p>Aerpe</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1700</p>
                     </c>
                     <c ca="left">
                        <p>90&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Aerobic chemorganotroph, sulfur enhances growth</p>
                     </c>
                     <c ca="left">
                        <p>[60]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="BA000002.3">BA000002.3</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Caldivirga maquilingensis </it>IC-167</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Calma</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1943</p>
                     </c>
                     <c ca="left">
                        <p>90&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Moderate acidophile, heterotroph, anaerobe or microaerophyle</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AAXQ00000000">AAXQ00000000</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Cenarchaeum symbiosum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Cenarchaeales</p>
                     </c>
                     <c ca="left">
                        <p>Censy</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2017</p>
                     </c>
                     <c ca="left">
                        <p>~10&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Moderate psychrophile, uncultivated symbiont of sponges</p>
                     </c>
                     <c ca="left">
                        <p>[33]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="DP000238">DP000238</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Hyperthermus butylicus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Desulfurococcales</p>
                     </c>
                     <c ca="left">
                        <p>Hypbu</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1602</p>
                     </c>
                     <c ca="left">
                        <p>>100&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Hyperthermophilic neutrophile, anaerobe</p>
                     </c>
                     <c ca="left">
                        <p>[61]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000493.1">CP000493.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrobaculum aerophilum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrae</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2605</p>
                     </c>
                     <c ca="left">
                        <p>100&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Facultative nitrate-reducing anaerobe</p>
                     </c>
                     <c ca="left">
                        <p>[62]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE009441.1">AE009441.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Pyrobaculum calidifontis </it>JCM 11548</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrca</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2149</p>
                     </c>
                     <c ca="left">
                        <p>100&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Pyrae</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000561.1">CP000561.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Pyrobaculum islandicum </it>DSM 4184</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Pyris</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1978</p>
                     </c>
                     <c ca="left">
                        <p>100&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Pyrae</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000504.1">CP000504.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Staphylothermus marinus </it>F1</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Desulfurococcales</p>
                     </c>
                     <c ca="left">
                        <p>Stama</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                     <c ca="left">
                        <p>1570</p>
                     </c>
                     <c ca="left">
                        <p>80&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Anaerobic submarine heterotroph</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000575.1">CP000575.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Sulfolobus acidocaldarius </it>DSM 639</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Sulfolobales</p>
                     </c>
                     <c ca="left">
                        <p>Sulac</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2223</p>
                     </c>
                     <c ca="left">
                        <p>80&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Aerobic thermoacidophile</p>
                     </c>
                     <c ca="left">
                        <p>[63]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000077.1">CP000077.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sulfolobus solfataricus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Sulfolobales</p>
                     </c>
                     <c ca="left">
                        <p>Sulso</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2977</p>
                     </c>
                     <c ca="left">
                        <p>80&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Sulfur-metabolizing chemorganotroph, thermoacidophilic, motile aerobe</p>
                     </c>
                     <c ca="left">
                        <p>[64]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE006641.1">AE006641.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sulfolobus tokodaii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Sulfolobales</p>
                     </c>
                     <c ca="left">
                        <p>Sulto</p>
                     </c>
                     <c ca="left">
                        <p>2.7</p>
                     </c>
                     <c ca="left">
                        <p>2825</p>
                     </c>
                     <c ca="left">
                        <p>80&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Sulso</p>
                     </c>
                     <c ca="left">
                        <p>[65]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="BA000023.2">BA000023.2</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Thermofilum pendens </it>Hrk 5</p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Thepe</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1876</p>
                     </c>
                     <c ca="left">
                        <p>92&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Acidophilic anaerobe</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000505.1">CP000505.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermoproteus tenax</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Thete</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>2021</p>
                     </c>
                     <c ca="left">
                        <p>96&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Facultative hydrogen-sulfur authotroph, anaerobe</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>n/a</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Archaeoglobus fulgidus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Archaeoglobales</p>
                     </c>
                     <c ca="left">
                        <p>Arcfu</p>
                     </c>
                     <c ca="left">
                        <p>2.2</p>
                     </c>
                     <c ca="left">
                        <p>2420</p>
                     </c>
                     <c ca="left">
                        <p>83&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Motile, anaerobic, sulfate-reducing chemolito- or chemorgano- autothroph</p>
                     </c>
                     <c ca="left">
                        <p>[66]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE000782.1">AE000782.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Haloarcula marismortui </it>ATCC 43049</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Halma</p>
                     </c>
                     <c ca="left">
                        <p>4.3</p>
                     </c>
                     <c ca="left">
                        <p>4240</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemoorganotrophic obligate halophile</p>
                     </c>
                     <c ca="left">
                        <p>[67]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AY596297.1">AY596297.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Halobacterium sp</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Halsp</p>
                     </c>
                     <c ca="left">
                        <p>2.6</p>
                     </c>
                     <c ca="left">
                        <p>2622</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Aerobic chemorganotroph, obligate halophile, proteolytic, motile, with cell envelope; 2 extrachromosomal elements</p>
                     </c>
                     <c ca="left">
                        <p>[68]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE004437.1">AE004437.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Haloquadratum walsbyi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Halwa</p>
                     </c>
                     <c ca="left">
                        <p>3.2</p>
                     </c>
                     <c ca="left">
                        <p>2646</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Halophilic, aerobic heterotroph</p>
                     </c>
                     <c ca="left">
                        <p>[69]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AM180088.1">AM180088.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methano-thermobacter thermo-autotrophicus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Metth</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1873</p>
                     </c>
                     <c ca="left">
                        <p>65&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemolitoautothroph, strict anaerobe, nitrogen-fixing methanogen</p>
                     </c>
                     <c ca="left">
                        <p>[70]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE000666.1">AE000666.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanococcoides burtonii </it>DSM 6242</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metbu</p>
                     </c>
                     <c ca="left">
                        <p>2.6</p>
                     </c>
                     <c ca="left">
                        <p>2273</p>
                     </c>
                     <c ca="left">
                        <p>23&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Psychrotolerant, strictly anaerobic, slightly halophilic methylotroph</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000300.1">CP000300.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanocaldo-coccus jannaschii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanococcales</p>
                     </c>
                     <c ca="left">
                        <p>Metja</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1786</p>
                     </c>
                     <c ca="left">
                        <p>85&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemolito-autothrophic, strictly anaerobic, motile methanogen, 2 extrachromosomal elements</p>
                     </c>
                     <c ca="left">
                        <p>[71]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="L77117.1">L77117.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanococcus maripaludis </it>C5</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanococcales</p>
                     </c>
                     <c ca="left">
                        <p>MetmC</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1822</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Mesophilic hydrogenotrophic, nitrogen-fixing methanogen</p>
                     </c>
                     <c ca="left">
                        <p>[72]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000609.1">CP000609.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanococcus maripaludis </it>S2</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanococcales</p>
                     </c>
                     <c ca="left">
                        <p>Metmp</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1722</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>same as MetmC</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="BX950229.1">BX950229.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanocorpusculum labreanum </it>Z</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanomicrobiales</p>
                     </c>
                     <c ca="left">
                        <p>Metla</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1739</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Strictly anaerobic, CO<sub>2 </sub>fixing methanogen</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000559.1">CP000559.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanoculleus marisnigri </it>JR1</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanomicrobiales</p>
                     </c>
                     <c ca="left">
                        <p>Metcu</p>
                     </c>
                     <c ca="left">
                        <p>2.5</p>
                     </c>
                     <c ca="left">
                        <p>2489</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Strictly anaerobic methanogen</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000562.1">CP000562.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanopyrus kandleri</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanopyrales</p>
                     </c>
                     <c ca="left">
                        <p>Metka</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1687</p>
                     </c>
                     <c ca="left">
                        <p>110&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemolito-autothrophic, strictly anaerobic, methanogen, high intracellular salt concentration</p>
                     </c>
                     <c ca="left">
                        <p>[41]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE009439.1">AE009439.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanosaeta thermophila </it>PT</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metsa</p>
                     </c>
                     <c ca="left">
                        <p>1.9</p>
                     </c>
                     <c ca="left">
                        <p>1696</p>
                     </c>
                     <c ca="left">
                        <p>60&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Strictly anaerobic methanogen</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000477.1">CP000477.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanosarcina acetivorans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metac</p>
                     </c>
                     <c ca="left">
                        <p>5.8</p>
                     </c>
                     <c ca="left">
                        <p>4540</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemolito-autothrophic, anaerobic, nitrogen-fixing, versatile methanogen, motile, forms multicellular structures</p>
                     </c>
                     <c ca="left">
                        <p>[73]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE010299.1">AE010299.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanosarcina barkeri fusaro</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metba</p>
                     </c>
                     <c ca="left">
                        <p>4.8</p>
                     </c>
                     <c ca="left">
                        <p>3624</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Mac</p>
                     </c>
                     <c ca="left">
                        <p>[74]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000099.1">CP000099.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanosarcina mazei</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metma</p>
                     </c>
                     <c ca="left">
                        <p>4.1</p>
                     </c>
                     <c ca="left">
                        <p>3370</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Mac</p>
                     </c>
                     <c ca="left">
                        <p>[75]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE008384.1">AE008384.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanosphaera stadtmanae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Metst</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1534</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Methanogen, human intestinal inhabitant</p>
                     </c>
                     <c ca="left">
                        <p>[76]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000102.1">CP000102.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Methanospirillum hungatei </it>JF-1</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Methanomicrobiales</p>
                     </c>
                     <c ca="left">
                        <p>Methu</p>
                     </c>
                     <c ca="left">
                        <p>3.5</p>
                     </c>
                     <c ca="left">
                        <p>3139</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Strictly anaerobic methanogen</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000254.1">CP000254.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Natronomonas pharaonis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Natph</p>
                     </c>
                     <c ca="left">
                        <p>2.8</p>
                     </c>
                     <c ca="left">
                        <p>2822</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Extreme haloalkaliphile</p>
                     </c>
                     <c ca="left">
                        <p>[77]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CR936257.1">CR936257.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Picrophilus torridus </it>DSM 9790</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoplasmales</p>
                     </c>
                     <c ca="left">
                        <p>Picto</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                     <c ca="left">
                        <p>1535</p>
                     </c>
                     <c ca="left">
                        <p>65&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Extremely acidophilic moderate thermophile</p>
                     </c>
                     <c ca="left">
                        <p>[78]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE017261.1">AE017261.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrococcus abyssi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermococcales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrab</p>
                     </c>
                     <c ca="left">
                        <p>1.8</p>
                     </c>
                     <c ca="left">
                        <p>1898</p>
                     </c>
                     <c ca="left">
                        <p>96&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Pho</p>
                     </c>
                     <c ca="left">
                        <p>[79]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AL096836.1">AL096836.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrococcus furiosus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermococcales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrfu</p>
                     </c>
                     <c ca="left">
                        <p>1.9</p>
                     </c>
                     <c ca="left">
                        <p>2125</p>
                     </c>
                     <c ca="left">
                        <p>96&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Pho</p>
                     </c>
                     <c ca="left">
                        <p>[80]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE009950.1">AE009950.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrococcus horikoshii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermococcales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrho</p>
                     </c>
                     <c ca="left">
                        <p>1.7</p>
                     </c>
                     <c ca="left">
                        <p>1955</p>
                     </c>
                     <c ca="left">
                        <p>96&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Anaerobic, motile heterotroph</p>
                     </c>
                     <c ca="left">
                        <p>[81]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="BA000001.2">BA000001.2</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Thermococcus kodakaraensis </it>KOD1</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermococcales</p>
                     </c>
                     <c ca="left">
                        <p>Theko</p>
                     </c>
                     <c ca="left">
                        <p>2.1</p>
                     </c>
                     <c ca="left">
                        <p>2306</p>
                     </c>
                     <c ca="left">
                        <p>85&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Anaerobic heterotroph</p>
                     </c>
                     <c ca="left">
                        <p>[82]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AP006878.1">AP006878.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermoplasma acidophilum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoplasmales</p>
                     </c>
                     <c ca="left">
                        <p>Theac</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                     <c ca="left">
                        <p>1482</p>
                     </c>
                     <c ca="left">
                        <p>59&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Chemorganotrophic, thermoacidophilic, motile facultative anaerobe</p>
                     </c>
                     <c ca="left">
                        <p>[83]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AL139299.1">AL139299.1</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermoplasma volcanium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>Thermoplasmales</p>
                     </c>
                     <c ca="left">
                        <p>Thevo</p>
                     </c>
                     <c ca="left">
                        <p>1.6</p>
                     </c>
                     <c ca="left">
                        <p>1499</p>
                     </c>
                     <c ca="left">
                        <p>60&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Same as Tac</p>
                     </c>
                     <c ca="left">
                        <p>[84]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="BA000011.4">BA000011.4</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Uncultured methanogenic archaeon</p>
                     </c>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>?</p>
                     </c>
                     <c ca="left">
                        <p>Uncme</p>
                     </c>
                     <c ca="left">
                        <p>3.2</p>
                     </c>
                     <c ca="left">
                        <p>3085</p>
                     </c>
                     <c ca="left">
                        <p>37&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Methanogen isolated from rice rhizosphere</p>
                     </c>
                     <c ca="left">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AM114193.2">AM114193.2</ext-link>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nanoarchaeum equitans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Nanoarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>?</p>
                     </c>
                     <c ca="left">
                        <p>Naneq</p>
                     </c>
                     <c ca="left">
                        <p>0.5</p>
                     </c>
                     <c ca="left">
                        <p>536</p>
                     </c>
                     <c ca="left">
                        <p>80&#176;C</p>
                     </c>
                     <c ca="left">
                        <p>Obligate symbiont of the crenarchaeon <it>Ignicoccus</it></p>
                     </c>
                     <c ca="left">
                        <p>[30]</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AE017199.1">AE017199.1</ext-link>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>OGT, optimal growth temperature</p>
                  <p><sup>b</sup>Only the references that report the complete genome of the respective species and its initial analysis are cited</p>
               </tblfn>
            </tbl>
            <p>The archaeal COGs (arCOGs) were constructed using a new computational pipeline (Fig. <figr fid="F1">1</figr>) which is a substantial modification of the previously published procedures <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp>. Briefly, the pipeline consists of the initial step of the all-against-all comparison of the protein sequences encoded in the analyzed genomes; preliminary clustering to identify lineage-specific expansions (LSEs) of paralogs, genes that are inferred to have evolved by duplication after the divergence of the compared species; delineation of clusters of bidirectional best hits to form initial, crude COGs; iterative search of the rest of the archaeal protein databases to accrue potential diverged members of the COGs; minimum linkage clustering of the COGs; merge of related COGs with supplementary phyletic patterns to avoid oversplitting of fast evolving COGs; splitting of potentially overclumped COGs; the details of each of these procedures are given under Materials and Methods.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>A flow chart of the procedure employed for the construction of the arCOGs</p>
               </caption>
               <text>
                  <p>A flow chart of the procedure employed for the construction of the arCOGs. See Materials and Methods for the description of each step.</p>
               </text>
               <graphic file="1745-6150-2-33-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Coverage of archaeal genomes with arCOGs</p>
            </st>
            <p>Altogether, the process of arCOGs construction started with 91,951 proteins encoded in 41 archaeal genomes and ended with 80,963 of these proteins being included in 7,538 arCOGs (the arCOGs and accompanying materials are available online <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>). The fraction of the proteins encoded in a genome that belong to the COGs is a crucial number in the comparative-genomic analysis as it characterizes both the level of conservation and coherence between the analyzed genomes, and the potential for genome annotation by inference from homology. Already in the early COG analyses, with a small number of genomes included, it has been noticed that a substantial majority of the genes had orthologs in other genomes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. With the growth of the genome collection and the new, refined procedure for COG construction, the coverage of archaeal genomes further increased. Figure <figr fid="F2">2</figr> shows that, on average, the arCOGs described here cover 88% of the genes in an archaeal genome as compared to 76% with the previous release of COGs which included 69 genomes, among these, 13 archaea (for this comparison, the proteins from the 41 analyzed archaeal genomes were fit into the old COGs using the COGNITOR program). Predictably, the extra coverage was most pronounced for genomes that had close relatives within the analyzed set such as <it>Halobacteria</it>, <it>Pyrobaculi</it>, and <it>Methanosarcina</it>, but a substantial increase was seen across the entire set of genomes, with two notable exceptions, <it>Nanoarchaeum equitans </it>and, particularly, <it>Cenarchaeum symbiosum </it>(Fig. <figr fid="F2">2</figr>). In the case of <it>C. symbiosum</it>, somewhat paradoxically, the coverage with the old COGs was even somewhat greater than with the new arCOGs (Fig. <figr fid="F2">2</figr>). The reasons behind the poor coverage of these two genomes are clear. Both <it>N. equitans </it>and <it>C. symbiosum </it>have no close relatives in the current collection of archaeal genomes; in addition, <it>N. equitans </it>appears to be a very fast-evolving lineage <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> whereas <it>C. symbiosum </it>is a symbiotic crenarchaeon that seems to have acquired lots of bacterial genes and possesses a number of unique gene families <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, which leads to a poor representation in ar COGs (see also below). In addition to the increased coverage, the new arCOGs appear to provide a better resolution of orthologous relationships than the existing COG set: 719 of the old COGs <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> were split into two or more arCOGs, and the mean number of genes from an archaeal genome per COG (species-specific paralogs) dropped from 1.65 in the COGs (36% clusters with no paralogs) to 1.34 in arCOGs (58% clusters with no paralogs).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Coverage of archaeal genomes with arCOGs and COGs</p>
               </caption>
               <text>
                  <p>Coverage of archaeal genomes with arCOGs and COGs. Cyan, ArCOGs, purple, COGs. Abbreviations are as in Table 1.</p>
               </text>
               <graphic file="1745-6150-2-33-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phyletic patterns, conserved cores and variable shells of archaeal genomes</p>
            </st>
            <p>In an early comparative-genomic study of the archaea, we developed the notion of a conserved core of genes that are shared by all or the substantial majority of the genomes and, by inference, are likely to be essential for the cell function, as opposed to the variable "shell" of genes that show diverse distributions among species and, accordingly, appear to be subject to lineage-specific gene loss and horizontal gene transfer (HGT) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The current analysis of a much larger collection of archaeal genomes provides for a refinement of these concepts. Figure <figr fid="F3">3</figr> shows the distribution of the number of archaeal species in the arCOGs. Obviously, in quantitative terms, arCOGs with a small number of species (&lt;6) dominate the collection; the distribution is, essentially, an exponential decay curve, with a rise at the left end (40 or 41 species), which corresponds to the archaeal genomic core (see below), and a bump at 15 species which correspond to the 15 available genomes of methanogens. More formally, assuming that the distribution is described by an exponent(s), the best approximation is achieved with a sum of three exponential functions (Fig. <figr fid="F3">3</figr>). The first exponent can be construed to represent the conserved gene core (~230 arCOGs), the second one describes the "shell" of moderately common genes (~2200 arCOGs), and the third one corresponds to the "ORFans" (~5200 arCOGs), which include a small number of (typically, but not necessarily, closely related) species.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Distribution of the number of species in arCOGs: three classes of archaeal genes</p>
               </caption>
               <text>
                  <p>Distribution of the number of species in arCOGs: three classes of archaeal genes. A semi-logarithmic plot fitted with a sum of 3 exponents</p>
               </text>
               <graphic file="1745-6150-2-33-3"/>
            </fig>
            <p>The notion of a phyletic pattern which is, simply, the pattern of presence-absence of a COG in the analyzed set of a species, has been developed in the original COG study <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>and, independently, by others <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Subsequently, phyletic patterns have been extensively employed both for functional prediction and as starting material for evolutionary reconstruction (e.g. <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>). Figure <figr fid="F4">4</figr> shows the distribution of the phyletic patterns in the new set of arCOGs. The decay of the curve is remarkably steep, i.e., a substantial majority of the patterns (2654 of 3192) are unique, that is, represented by one arCOG only. Examination of the list of the top 10 widespread arCOGs is particularly instructive (Table <tblr tid="T2">2</tblr>). In this list, 9 patterns are "trivial", i.e., represented in multiple species of a compact monophyletic group, such as Methanosarcinales or Halobacteriales. The single exception is the "all" pattern which describes the strictly defined core of 165 archaeal genes represented in all currently sequenced genomes. The most common relatively "non-trivial" pattern is the one that includes arCOGs represented in all species except for <it>N. equitans </it>(50 arCOGs); again, this is hardly unexpected given the small number of genes in <it>N. equitans</it>, suggesting massive gene loss. Although phyletic patterns provide only a crude assessment of the relationship between the compared genomes and caution is due, such that too sweeping conclusions on evolution are not drawn solely from the inspection of these patterns, some conjectures from the trend seen in Figure <figr fid="F4">4</figr> and in Table <tblr tid="T2">2</tblr> appear straightforward. The uniqueness of most of the phyletic patterns suggests that emergence of new families in individual lineages, lineage-specific gene loss, and HGT are all major forces of archaeal evolution. However, the absence of common non-trivial patterns suggests that distinct "highways" of HGT <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>do not shape archaeal evolution.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The 10 most common phyletic patterns in the arCOGs</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Lineage</p>
                     </c>
                     <c ca="left">
                        <p>Species<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Number of arCOGs</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mathanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metac, Metba, Metma</p>
                     </c>
                     <c ca="left">
                        <p>239</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Halma, Halsp, Halwa, Netph</p>
                     </c>
                     <c ca="left">
                        <p>204</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sulfolobales</p>
                     </c>
                     <c ca="left">
                        <p>Sulac, Sulso, Sulto</p>
                     </c>
                     <c ca="left">
                        <p>192</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>All</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>All 41</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>166</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Thermoproteales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrae, Pyrca, Pyris, Thete</p>
                     </c>
                     <c ca="left">
                        <p>162</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Thermococcales</p>
                     </c>
                     <c ca="left">
                        <p>Pyrab, Pyrfu, Pyrho, Theko</p>
                     </c>
                     <c ca="left">
                        <p>142</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Methanosarcinales</p>
                     </c>
                     <c ca="left">
                        <p>Metac, Metba</p>
                     </c>
                     <c ca="left">
                        <p>126</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Methanococcales</p>
                     </c>
                     <c ca="left">
                        <p>MetmC, Metmp</p>
                     </c>
                     <c ca="left">
                        <p>105</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Halobacteriales</p>
                     </c>
                     <c ca="left">
                        <p>Halma, Halwa</p>
                     </c>
                     <c ca="left">
                        <p>99</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Thermoplasmales</p>
                     </c>
                     <c ca="left">
                        <p>Picto, Theac, Thevo</p>
                     </c>
                     <c ca="left">
                        <p>96</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Abbreviations are as in Table 1.</p>
               </tblfn>
            </tbl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Distribution of phyletic patterns by the number of arCOGs</p>
               </caption>
               <text>
                  <p>Distribution of phyletic patterns by the number of arCOGs. A log-log plot.</p>
               </text>
               <graphic file="1745-6150-2-33-4"/>
            </fig>
            <p>The 166 arCOGs that comprise the strictly defined core of the archaeal genomes, as well as the core gene sets of Euryarchaeota and Crenarchaeota, are, for obvious reasons, of special interest. First, it has to be noticed that the euryarchaeal core (282 arCOGs) and the crenarchaeal core (336 arCOGs) are not dramatically larger than the pan-archaeal core, emphasizing the high prevalence of gene loss and gain (on many occasions, via HGT) during the evolution of archaeal genomes. Along the same lines, but more unexpectedly, the euryarchaeal and crenarchaeal genomic signatures, i.e., the sets of arCOGs that are represented in all species in one group but not found in any species of the other group, consist of only one and three arCOGs, respectively. In agreement with previous observations, a breakdown of the functional assignments (according to the broad functional categories associated with the COGs <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>) reveals dramatic differences between the overall set of arCOGs and the core sets (Fig. <figr fid="F5">5</figr>). Indeed, each of the core sets is dominated by proteins functioning in the translation system, ribosome biogenesis, and tRNA modification, with additional significant contributions from other information-processing systems (basal transcription and replication). Moreover, even for the few core archaeal genes that remain experimentally uncharacterized, roles in translation and RNA modification could be predicted on the basis of the analysis of domain organization and operonic context (see <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>; KSM and EVK, unpublished). In a stark contrast, in the overall arCOG set, the informational functions are quantitatively minor whereas metabolic functions are dominant (Fig. <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Functional breakdown of the entire set of arCOGs and the three core sets</p>
               </caption>
               <text>
                  <p>Functional breakdown of the entire set of arCOGs and the three core sets. EA, Euryarchaea, CA, Crenarchaea.</p>
               </text>
               <graphic file="1745-6150-2-33-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Applications of arCOGs for evolutionary genomics of archaea: gene-content tree, evolutionary reconstructions, and putative phylogenetic of core and shell genes</p>
            </st>
            <p>From the inception of the COG methodology, it had been realized that COGs have potential for straightforward evolutionary-genomic applications. One of these is the construction of gene-content trees whereby the phyletic patterns of COGs are converted into a distance matrix between the analyzed genomes, with an appropriate normalization for genome size <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B40">40</abbr></abbrgrp>(see Materials and Methods).</p>
            <p>We used the phyletic patterns of the arCOGs as the input to produce a gene-content tree for the 41 analyzed archaeal species. Gene-content trees are known to reflect combination of bona fide phylogenetic relationships, horizontal gene flows, and life style differences between organisms leading to parallel gene loss. It appears that the archaeal gene-content tree carries a substantial phylogenetic signal (Fig. <figr fid="F6">6</figr>). The tree supports the major phylogenetic divisions within the archaea, i.e., the monophyly of Euryarchaeota and Crenarchaeota, and most of the branches within each of these divisions. However, at least three aspects of this tree deserve special attention. Firstly, the tree has methanogens as a clade within the Euryarchaeota. Regular, sequence-based phylogenetic analyses tend to break the methanogens into two or three clades, namely, methanococcales-methanothermobacteriales, methanosarcinales (typically, joined with halobacteriales), and <it>Methanopyrus kandleri</it>. The phylogenetic position of <it>M. kandleri </it>remains uncertain although monophyly with <it>Methanococcales </it>and <it>Methanobacteriales </it>is likely <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>, the placement of <it>Methanosarcinales </it>apart from the rest of the methanogens appears to be solidly supported <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Most likely, the aggregation of the methanogens in the gene-content tree in Fig. <figr fid="F6">6</figr> is caused by the shared genes encoding proteins involved in methanogenesis which might have spread both vertically and horizontally. Secondly, <it>N. equitans </it>is placed deeply within the <it>Thermococcales </it>branch. The initial phylogenetic analysis has been interpreted to indicate that this unusual organism was a basal, ancestral archaeal branch <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. However, a subsequent reappraisal suggested that the basal position of <it>N. equitans </it>was a long branch attraction artifact, and the correct placement of <it>N. equitans </it>should be with the <it>Thermococcales </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>; remarkably, the tree content analysis is best compatible with this hypothesis. Thirdly, the single available genome of a mesophilic crenarchaeon, <it>C. symbiosum</it>, falls within the euryarchaeal part of the tree; the definitive resolution of the phylogenetics affinities of mesophilic Crenarchaeota requires a representative collection of genomes but the present observations already indicate that <it>C. symbiosum </it>is not a typical crenarchaeon. At least, in part, this position of <it>C. symbiosum </it>in the gene-content tree could be explained by acquisition of euryarchaeal genes via HGT <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>The gene-content tree of archaea constructed on the basis of the phyletic patterns of arCOGs</p>
               </caption>
               <text>
                  <p>The gene-content tree of archaea constructed on the basis of the phyletic patterns of arCOGs. The species abbreviations are as in Table 1. Cren, Crenarchaeota; Eury, Euryarchaeota.</p>
               </text>
               <graphic file="1745-6150-2-33-6"/>
            </fig>
            <p>We then addressed the reverse problem, namely, reconstruction of the history of gene gain and loss in archaea given a particular phylogenetic tree topology. On the account of the uncertainty of the deep branches in archaeal phylogeny, we chose to use a partially unresolved tree in which the relationship between several clades is presented as a multifurcation (Fig. <figr fid="F7">7</figr>). The reconstruction was, then, performed using a modification of the weighted parsimony method <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> that has been previously applied to the analysis of the evolution of several major groups of bacteria <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. With all the caveats in mind, the results of this reconstruction reveal notable trends of gene gain and loss in different lineages of archaea as well as features of the inferred ancestral forms. Predictably, the most massive gene loss is seen in <it>N. equitans </it>(a parasite with the smallest known genome among archaea), closely followed by <it>Thermoplasmales </it>(another group of heterotrophic archaea with small genomes) and <it>C. symbiosum </it>(a symbiotic archaeon that might have undergone a major life style shift). The lineages with the most gene gain include those with the largest genomes, namely, <it>Halobacteriales </it>and <it>Methanosarcinales</it>, and <it>Sulfolobales-Desulfurococcales</it>. More unexpectedly, substantial gene gain, along with major gene loss, was inferred also for <it>Thermococcales </it>and <it>Thermoplasmales</it>; apparently, these are groups with highly dynamic genomes.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>A reconstruction of gene gain and loss in archaea</p>
               </caption>
               <text>
                  <p>A reconstruction of gene gain and loss in archaea. Each branch is labeled by 3 numbers: black, the (inferred) number of arCOGs in the node to which the given branch leads; blue, number of arCOGs lost along the branch; red, number of arCOGs gained along the branch. The red circles on branches denote hyperthermophiles, and blue circles denote mesophiles and moderate thermophiles.</p>
               </text>
               <graphic file="1745-6150-2-33-7"/>
            </fig>
            <p>The present reconstruction maps almost 1000 archaeal genes to LACA and ~1300 and ~1400 genes to the ancestors of Crenarchaeota and Euryarchaeota, respectively (Fig. <figr fid="F7">7</figr>). These numbers are notable in that the ancestral gene sets appear to be not much smaller than the smallest genomes of the extant free-living archaea, such as <it>Thermoplasma </it>(Figs. <figr fid="F7">7</figr> and <figr fid="F8">8</figr>). It must be kept in mind that these numbers are low bounds for the gene content of the ancestral forms inasmuch as parsimony has a fundamental bias toward underestimating the amount of gene loss <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Thus, perhaps, unexpectedly, it appears that ancestral forms, including LACA, were not much simpler, at least in terms of genomic complexity but, most likely, also in their cellular organization than some modern forms. This conjecture is further supported by a more detailed examination of the genes (arCOGs) assigned to LACA (Table <tblr tid="T3">3</tblr>). In particular, the results of the reconstruction suggest that LACA was a hyperthermophile that possessed reverse gyrase, the principal hallmark of the hyperthermophilic lifestyle <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> and most of the other genes characteristic of hyperthermophiles (<abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and unpublished observations), and a chemoautotroph that had the genes to support membrane-based redox bioenergetics and all central biosyntheses (Table <tblr tid="T3">3</tblr>). Notably, the reconstruction also indicates that LACA already possessed some widespread functional systems of archaea that are not normally thought of as being ancestral including the CASS system of antiviral defense <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> and the predicted toxin-antitoxin system centered around the "minimal" nucleotidyltransferases (<abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and KSM, YIW, and EVK, unpublished).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Major features of the reconstructed gene set of LACA</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>COG class</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>No. of arCOGs</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Implication for LACA</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>J</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>152</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Translation</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Complete translation system and essentially complete set of enzymes for tRNA and rRNA modification</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>61</p>
                     </c>
                     <c ca="left">
                        <p>Ribosomal proteins</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>aaRS and related enzymes</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>K</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>55</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Transcription</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Moderately sophisticated transcription control</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>Transcription regulators</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>RNA polymerase subunits</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>L</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>61</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Replication, recombination and repair</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Advanced DNA replication and repair system</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>Topoisomerases</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>DNA polymerase subunits</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>C</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>84</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Energy production and conversion</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Membrane-based redox bioenergetics; partial TCA cycle</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>Pyruvate oxidation</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>TCA cycle</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>NADH dehydrogenase or Na+/H+ antiporter</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>V-type ATPase-ATP synthase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>G</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>33</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Carbohydrate transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Moderately sophisticated sugar metabolism</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>Glycolysis/Gluconeogenesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>E</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>108</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Amino acid transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Enzymes for the biosynthesis of all amino acids</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>72</p>
                     </c>
                     <c ca="left">
                        <p>Amino acid biosynthesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>F</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>49</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Nucleotide transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Enzymes for the biosynthesis of all nucleotides</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>Nucleotide biosynthesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>Nucleotide salvage</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>H</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>67</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Coenzyme transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Enzymes for the biosynthesis of all essential cofactors</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>Cofactor biosynthesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>I</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>25</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Lipid transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Fully developed membrane</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>Lipid biosynthesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>M</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>26</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Cell wall, membrane and envelope biogenesis</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Fully developed cell wall</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>P</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>48</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Inorganic ion transport and metabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Sophisticated ion uptake system</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Q</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>8</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Secondary metabolites biosynthesis, transport and catabolism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Limited or unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>N</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Cell motility</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Limited motility and/or conjugation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>O</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>47</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Posttranslational modification, protein turnover, chaperones</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Sophisticated system of protein fate control</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>Proteasome</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>D</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Cell cycle control</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Limited or unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>T</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>10</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Signal transduction mechanisms</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Limited use of bacterial type signal transduction system; original signal transduction</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>Serine/threonine kinase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>U</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>10</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Intracellular trafficking and secretion</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Fully developed secretion system</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>Preprotein translocase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>V</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>20</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Defense mechanisms</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Viruses abundant at LACA times</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>including:</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>CASS proteins</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>R, S</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>183</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Poorly characterized or unknown</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Low-bound reconstructions for ancestral archaeal forms: genomes close in size to modern hyperthermophiles</p>
               </caption>
               <text>
                  <p>Low-bound reconstructions for ancestral archaeal forms: genomes close in size to modern hyperthermophiles. Each column shows the total number of annotated protein-coding genes in the respective archaeal species; the colored portions (green for Crenarchaeota, blue for Euryarchaeota, and cyan for Nanoarchaeota) show genes included in arCOGs. The hatched columns show the number of arCOGs assigned to LACA, the Last CrenArchaeal Common Ancestor (LCACA) and the Last EuryArchaeal Common Ancestor (LEACA).</p>
               </text>
               <graphic file="1745-6150-2-33-8"/>
            </fig>
            <p>Finally, we attempted to obtain a crude breakdown of the phylogenetic affinities of the arCOGs, in particular, those that form the archaeal gene core (see above) and those that were assigned to LACA. Because a comprehensive phylogenomic analysis is beyond the scope of this paper (and has major potential for its own share of artifacts), this was done by analyzing the taxonomic breakdown of the proteins that are most similar to the representatives of a given arCOG as detected in BLAST searches. In order to eliminate potential effects of HGT, a special protocol was developed to identify coherent affinities, e.g., a bacterial affinity was assigned only when the proteins from a given arCOG had best hits in multiple bacterial species (see Materials and Methods). We are fully aware of the limitations of such methodology that, at best, gives a crude approximation of the true phylogeny <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> but we also note that this type of analysis can reveal highly meaningful patterns in comparative-genomic data (e.g. <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>). This analysis revealed striking differences between the overall set of arCOGs, LACA, and the core sets (Fig. <figr fid="F9">9</figr>). Clearly, the overall set is dominated by "bacterial" and archaea-specific genes, with a small fraction of "eukaryotic" genes; this fraction is somewhat greater in the inferred gene set of LACA but the dominance of "bacterial" genes remains obvious. The core sets, especially, the 166 genes shared by all sequenced archaeal genomes, present a stark contrast in that they are dominated by "eukaryotic" genes (Fig. <figr fid="F9">9</figr>).</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Taxonomic affinities of ArCOGs with bacteria and eukaryotes</p>
               </caption>
               <text>
                  <p>Taxonomic affinities of ArCOGs with bacteria and eukaryotes. For the criteria of taxonomic assignments, see Materials and Methods.A, archaea, B, bacteria, E, eukaryotes.</p>
               </text>
               <graphic file="1745-6150-2-33-9"/>
            </fig>
            <p>Comparing these observations with those presented in Figs. <figr fid="F3">3</figr> and <figr fid="F5">5</figr>, one comes to the conclusion that, quantitatively, archaeal genomes are dominated by the relatively mobile "shell" genes that belong to the common prokaryotic gene pool and encode the overwhelming majority of metabolic, structural, and signal transduction functions; a sharp contrast is presented by the stable, archaeo-eukaryotic core of information-processing genes. These quantitative conclusions, even if based on a crude analysis, are in a good agreement with the early observations on the bimodal distribution of the taxonomic affinities of archaeal genes <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, the subsequent observations on the affinities of eukaryotic genes <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B53">53</abbr></abbrgrp>, and the complexity hypothesis which posited distinct evolutionary fates of information and operational genes <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The arCOGs, which are expected to be updated as genome sequencing progresses, are a resource for genome annotation of the newly sequenced archaeal genomes and the refinement of the existing annotations, as well as evolutionary reconstructions. Crude reconstructions presented here indicate that the ancestral archaeal forms, including LACA, probably, were full-fledged prokaryotes, of approximately the same level of complexity as the simplest of the modern free-living archaea.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Construction of archaeal COGs</p>
            </st>
            <p>Protein sets for 40 completely sequenced genomes of Archaea were downloaded from the NCBI FTP site <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> or from the RefSeq section of GenBank (<it>Caldivirga maquilingensis </it>IC-167, <it>Cenarchaeum symbiosum </it>and Uncultured methanogenic archaeon). Protein sequences of <it>Thermoproteus tenax </it>were kindly provided by Bettina Siebers with permission from the sequencing consortium. The procedure of COG construction involved the following steps.</p>
            <p>1. All-against-all BLAST <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> search was used to establish the similarity relationships between the archaeal proteins. Lineage-specific expansions of paralogs were identified essentially as described previously <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. Initial clusters based on triangles of symmetrical best hits were constructed using a modified COG algorithm <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp>; the major difference in the current implementation was the strict symmetry requirement for the "best hit" relationship between proteins. This constraint lowers the number of false-positives but, in the presence of paralogs, leads to substantial underclustering <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>; this was rectified on the subsequent steps.</p>
            <p>2. Multiple alignments of the initial cluster members were constructed using the MUSCLE program <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>; alignments were used to construct PSSMs for a PSI-BLAST search <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> against the database of Archaea proteins with the e-value threshold of 0.01; proteins (domains) were added to the corresponding best-scoring original clusters resulting in a set of expanded clusters.</p>
            <p>3. Sequences of expanded cluster members were aligned using MUSCLE, and the PSSMs constructed from these alignment were used for a second round of PSI-BLAST search against the database of archaeal proteins. The search results were used to construct a similarity graph for the relationships between the expanded clusters. Formally, all statistically significant (e&lt;0.01) hits in a search with the PSSM for a particular cluster were classified according to the cluster they belong to; clusters in the hit list were ranked according to the mean score across their members (members missing from the hit list were assigned an arbitrary score 2 bits below the significance threshold). An edge between the <it>i</it>-th and the <it>j</it>-th clusters was given weight equal to the lowest rank among the <it>i</it>&#8594;<it>j </it>and <it>j</it>&#8594;<it>i </it>relationships (i.e., if cluster <it>j </it>is the top-ranking hit when cluster <it>i </it>is the query but cluster <it>i </it>is the third-ranking hit for cluster <it>j</it>, then the edge connecting <it>i </it>and <it>j </it>is given the rank of 3). Connected components were extracted from the graph; pairs of nodes within a connected component were assigned an edge with a rank of infinity if they were not connected directly. A minimum-linkage clustering procedure was applied to the connected sets of clusters (if cluster <it>i </it>and <it>j </it>are merged, the edge between cluster <it>k </it>and the node, representing the merged clusters, is given the rank equal to the lowest rank of <it>k</it>-<it>i </it>and <it>k</it>-<it>j </it>edges), resulting in a rooted dendrogram of relationships between the clusters. Then each node on on the tree was labeled with the number of species that were present in all descendant clusters. Two rules were used to determine if the descendant clusters should be merged: i) if species-coverage of the node is at least 50% greater than that of any of the descendant nodes and ii) if, among the descendants of a node, one is species-rich and the other one is species-poor (formally, if <it>s</it><sub><it>i</it></sub>>20<it>s</it><sub><it>j</it></sub>/(10-<it>s</it><sub><it>j</it></sub>) where <it>s</it><sub><it>i </it></sub>and <it>s</it><sub><it>j </it></sub>stand for the species-coverage of the species-rich and species-poor descendant nodes, respectively).</p>
            <p>4. In parallel to the above procedures, a BLAST search against the COG 2003 database was performed, followed by using a modified COGNITOR program <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp> to assign archaeal proteins to prokaryotic COGs. Merged clusters with proteins assigned to different COGs were split into COG-specific clusters to avoid clustering of paralogous proteins that previously have been assigned to different curated COGs.</p>
         </sec>
         <sec>
            <st>
               <p>Reconstruction of gene gain and loss events during the evolution of Archaea</p>
            </st>
            <p>Reconstruction of gene gain and loss during the evolution of Archaea was performed using a modified weighted parsimony approach <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> implemented in a two-pass algorithm. First, a coarse-resolution multifurcating species tree was compiled from several single-gene phylogenetic reconstructions and taxonomic data. For each arCOG, the phyletic pattern indicating the presense/absence of the respective gene in each analyzed species was mapped onto the leaves of the tree. The first pass is performed in the leaves-to-root direction, and the number of descendant nodes containing the given gene is counted for each internal tree node. If this number is greater than or equal to the first (generally, more stringent) threshold, which is set for each node individually, the node is assigned state "1" (presence of the gene), otherwise it is assigned state "0" (absence of the gene). In the second pass, which is performed in the opposite, root-to-leaves direction, if the gene is absent in the given node (state "0") but present in its ancestor and the number of descendant nodes carrying this gene is greater than or equal to the second (generally more relaxed) threshold, the node is assigned state "1". For the guide tree and the thresholds, see <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Reviewers' reports</p>
         </st>
         <sec>
            <st>
               <p>Reviewer 1: Peer Bork, European Molecular Biology Laboratory</p>
            </st>
            <p>The paper describes the construction of orthologous group for archea.</p>
            <p>Given the success of the COGs and KOGs (a subset for eukaryotes with higher resolution) and the inability of current purely automatic procedures to produce reliable orthologus groups and, very importantly, their reliable functional annotation, I see this as an important resource for various studies. Furthermore, it uses a semi-automatic procedure that includes some clever guiding principles e.g. it takes into account phylogenetic gene presence patterns. The average coverage of 88% at a higher resolution than the current 76% COG coverage of genes in archeal genomes is another noteworthy and useful feature. As far as I can see, the arCOGs are of high quality and I look forward to use them.</p>
            <p>There is no comparison to more recent orthology-built procedures, but I assume that this semi-automatic procedure presented here provides a more accurate picture than purely automatic methods.</p>
            <p>The only concerns I have are availability/formate issues and some minimalistic Figure captions. Both should be easy to solve.</p>
            <p>Taken together, I congratulate the authors for this nice, important and very useful piece of work.</p>
            <p>
               <b>Authors' response:</b>
               <it>The formats of the files on the ftp site were modified to increase transparency, and an extended README file was added. We hope this imporves accessibility which is, indeed, crucial. The figure captions were amended.</it>
            </p>
         </sec>
         <sec>
            <st>
               <p>Reviewer 2: Patrick Forterre, Institut Pasteur and Universit&#233; Paris-Sud</p>
            </st>
            <p>The &#171;easy to use&#187; COG database has been especially useful for the biological community. It has helped to improve the quality of genome annotation and has been widely adopted by non bioinformatic experts to perform preliminary rounds of comparative genomic analysis. The main problem with such popular database is the delay in their updating, a daunting task considering the current avalanche of completely sequenced genomes. The present paper by Kira Makarova and colleagues reports a much welcome update of the COG database that focus on archaea (arCOGs). The number of completely sequenced archaeal genomes remains quite low (compared to the situation with bacteria) allowing an exhaustive analysis that remains to be done for bacteria and eukarya. The arCOGs database will be for sure an extremely important source of information for the community working on archaea and for all scientists interested in comparative genomics and microbial evolution. The new analysis corresponds to a substantial increase in information compared to previous one, since around 40% of arCOGs are new.</p>
            <p>In addition to the description of the arCOGs database, the paper by Kira Makarova and co-workers present several analyses that bring new (or update) data and raise several interesting evolutionary questions. In particular, they have built a gene-content tree based on the presence-absence of arCOGs in archaeal genome and estimated the evolution of the archaeal genome content along the evolutionary tree based on a gene loss and gain analysis. They reported several intriguing observations that are worth to be discussed in the framework of current debates on archaeal phylogeny and on the nature of the last universal archaeal ancestor.</p>
            <p>Makarova and co-workers noticed that the number of strictly specific euryarchaeal and crenarchaeal proteins is very low (one and three, respectively). This seems to strongly argue in favour of the monophyly of Archaea (against the &#171;eocyte&#187; hypothesis). However, it should be interesting to present a slightly &#171;relaxed&#187; version of these cores, by allowing for the possibility for a protein to be missing in a group of related archaea (something quite frequently observed, for instance the lack of the euryarchaeal histone in Thermoplasmatales). More generally, it could be interesting in the future to define a category of conserved arCOGs (carCOGs?) present in all members of at least two archaeal orders in order to discriminate between ORFans arCOGs that are only present in one order (probably &#171;recently&#187; introduced by lateral gene transfer) and arCOGs of probable ancient origin that can tell us something about the evolutionary relationships between the diverse archaeal orders. It should be then interesting to determine if the distribution of such carCOGs correlate with the archaeal phylogeny based on various evolutionary markers.</p>
            <p>The parasitic archaeon <it>Nanoarchaeum equitans </it>lacks the larger number (50) of universal arCOG, confirming that this archaeon probably evolved by &#171;genome reduction&#187;. Some authors have suggested that <it>N. equitans </it>is a primitive organism. I suspect that there is a relatively high percentage of these 50 proteins that have homologues in Bacteria or Eukarya. This could be indicated as an argument in favour of the reduction scenario <it>versus </it>the "old nano" hypothesis! Interestingly, the gene content tree based on arCOGs groups <it>N. equitans </it>with Thermococcales among Euryarchaeota. Although gene-content trees can be sometimes highly biased by lateral gene transfer, this observation is in good agreement with a preliminary global analysis based on best BLAST-hits and refined phylogenies based on proteins of the small ribosomal subunits, reverse gyrase, Topo VI and elongation factors (Brochier et al.2005). This confirms that <it>N. equitans </it>should not be considered as a member of a new archaeal phylum (as already widely found in text-books!!) but as an odd member of the Euryarchaeota, probably, distantly related to Thermococcales.</p>
            <p>Another puzzling observation is the grouping of <it>Cenarchaeum symbiosum </it>with euryarchaea in the gene-content tree. Interestingly, the COG coverage is quite similar for all archaeal genomes (around 88%) except for <it>C. symbiosum </it>and <it>N. equitans</it>. This can be explained by genome reduction in the case of <it>N. equitans</it>, but not in the case of <it>C. symbiosum </it>whose genome has a &#171;normal&#187; size. Significantly, the authors reported that the coverage of <it>C. symbiosum </it>genome with the old COGs was greater than with the new arCOGs! This indicates that this genome contains COGs present in Bacteria or <it>Saccharomyces cerevisiae </it>but not in any other archaeon. The proposed explanation is that <it>C. symbiosum </it>is a symbiotic crenarchaeon that has acquired lots of bacterial genes. An alternative hypothesis is that <it>C. symbiosum </it>is not a crenarchaeon after all, but represents an early branching archaeal phylum that contains bacterial and archaeal homologues that have been lost in other archaea.</p>
            <p>From their reconstruction of gene loss and gain events, Makarova and co-workers suggest that the last Universal archaeal ancestor (LACA) was a hyperthermophile and a chemo-litoautotrophe with a minimal number of genes around 1000. They conclude that LACA might have been (nearly) as advanced as modern archaeal hyperthermophiles and found this conclusion quite &#171;unexpected&#187;. I am not so surprised. It's a prejudice to think that ancestors are always simpler than present-day organisms and that ancient evolution always occurred toward more "complexity". There is no reason why reductive evolution, which has occurred so often in the evolution of modern cells, was not as pervasive in ancient time (Forterre and Philippe, 1999). In fact, an in-depth analysis of ribosomal protein distribution by Poch and co-workers already suggested a few years ago that the ribosome of LACA was probably more complex that the ribosome of any modern archaea (Lecompte et al., 2002).</p>
            <p><b>Authors' response: </b><it>We do not, exactly, disagree and certainly realize the importance of reductive evolution. Still, whether or not we should consider the reconstruction of a complex LACA surprising or not, depends on the perspective. Considering that LACA is supposed to be the common ancestor of one of the 3 domains of life, there might be some element of surprise in this observation. After all, at the earliest stages of the evolution of life, there must have been a dramatic increase in complexity. That this complexification stage, apparently, was over by the time the domain of life became distinct (very likely, the same will hold for bacteria) is, certainly, of note. Alternatively, it is conceivable that LACA is actually not as ancient as one might think but represents a more recent bottleneck in archaeal evolution such that there was a complexification stage after the onset of the archaeal domain but it is inaccessible by comparative genomics</it>.</p>
            <p>My only criticism of this paper is that the authors have taken a quite conservative view of archaeal phylogeny (only based on 16S rRNA) to analyse gene loss and gain along the archaeal history and to estimate the genome content of LUCA. Indeed, several features of their unresolved multifurcation tree are dubious.</p>
            <p><it>N. equitans </it>appears as an isolated lineages (a third phylum)</p>
            <p><it>C. symbiosum </it>is grouped with hyperthermophilic Crenarchaeota.</p>
            <p><it>Methanopyrus kandleri </it>is shown as an isolated branch</p>
            <p>In all these cases, the authors have chosen to follow the 16S rRNA tree, whereas careful analyses based on ribosomal proteins have shown that <it>Methanopyrus kandleri </it>most likely groups with <it>methanococcales </it>and <it>methanomicrobiales </it>(Brochier et al. 2004) and that N. <it>equitans </it>is at least sister-group of euryarchaea (if not of Thermococcales). As previously indicated, the grouping of <it>C. symbiosum </it>with crenarchaea could be also highly misleading. It should have been interesting to compare the genome content of LACA based on the 16S rRNA phylogeny and the more robust phylogeny based on ribosomal proteins. My feeling is that the nature of LACA (chemo-litoautotroph or not, hyperthermophile or not?) is still a pending question.</p>
            <p><b>Authors' response: </b><it>We have not really followed the 16S RNA tree but rather deliberately chose a poorly resolved topology so as not to subscribe to any particular phylogenetic hypothesis with respect to issues that are still considered unresolved. We are well aware of the published work on archaeal phylogenies and the two important papers by Brochier et al. are cited. Out of fairness, the likely position of Methanopyrus with Methanococcales and Methanobacteriales, was first reported in Slesarev et al. in 2002, and this cited as well. The wording on Methanopyrus in the text was modified to reflect these reports but we did not modify the tree in </it>Fig. <figr fid="F7">7</figr>. <it>One has to keep in mind that the reconstruction here is by no means supposed to be the final word on the scenario of archaeal evolution but more of an exercise showcasing the utility of the arCOGs. We expect that there will be many more iterations with more genomes, better resolved trees, and better methods of reconstruction, and we certainly hope to be involved</it>.</p>
            <p>Finally, in the discussion of the gene-content tree, the authors wrote &#171;<it>methanogenesis which are spread both vertically and horizontally</it>&#187;. In fact, a detailed phylogenetic analysis of genes involved in methanogenesis by Bapteste and co-workers has shown that, surprisingly, although these proteins can be considered as &#171;operational&#187; they have been only transmitted by vertical inheritance in the archaeal domain (Bapteste et al., 2005).</p>
            <p><b>Authors' response: </b><it>We believe that the issue is not quite resolved yet. The wording in the paper was softened, nevertheless</it>.</p>
            <p>Bapteste E, Brochier C, Boucher Y.</p>
            <p>Higher-level classification of the Archaea: evolution of methanogenesis and methanogens.</p>
            <p>Archaea.1, 353&#8211;363 (2005).</p>
            <p>Brochier, C. Forterre P. and Gribaldo S.</p>
            <p>Archaeal phylogeny based on proteins of the transcription and translation machineries: tackling the <it>Methanopyrus kandleri </it>paradox</p>
            <p>Genome Biology, 5, R17 (2004).</p>
            <p>Brochier, C., Gribaldo, S., Zivanovic, Y. Confalonieri, F. and Forterre, P.</p>
            <p>Nanoarchaea: representative of a novel archaeal phylum or a fast evolving euryarchaeal lineage related to Thermococcales?</p>
            <p>Genome Biology, 6:R42 (2005).</p>
            <p>Forterre, P. and Philippe, H</p>
            <p>Where is the root of the universal tree of life?</p>
            <p>Bioessays, 21, 871&#8211;879 (1999).</p>
            <p>Lecompte O, Ripp R, Thierry JC, Moras D, Poch O.</p>
            <p>Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale.</p>
            <p>Nucleic Acids Res., 30, 5382&#8211;5390 (2002).</p>
         </sec>
         <sec>
            <st>
               <p>Reviewer 3: Purificaci&#243;n L&#243;pez-Garc&#237;a, CNRS, Universit&#233; Paris-Sud</p>
            </st>
            <p>This article describes the analysis of genes present in most of the currently available archaeal genome sequences in view of their classification in clusters of orthologous genes specific to the archaea (arCOG). It represents an updated extension of previous comparative genomic analyses of COGs though exclusively devoted to the archaea. As a consequence, the arCOG database produced is more refined, resulting in an increased coverage and resolution. The latter is reflected in the numerical increase of specific archaeal COGs and the accompanying decrease in the number of clusters containing paralogs. The comparison of arCOGs thus defined allows to infer the presence of ~166 core arCOGs, which were likely present in the last archaeal common ancestor (LACA), while 282 and 336 arCOGs appear ancestral to the euryarchaeotal and crenarchaeotal branches, respectively. From the nature of the core arCOGs, the authors conclude that the LACA was a rather complex hyperthermophilic chemoautotroph possessing ~1000 genes. Differential gene gain and loss are predicted to have occurred in the two major archaeal branches. The pattern of arCOG distribution in the different archaeal genomes is used to reconstruct a gene-content tree. Despite biases that may be associated to this approach, which are cautiously recognized by the authors, the tree obtained is largely congruent with widely accepted archaeal molecular phylogenies. Interestingly, <it>Nanoarchaeum equitans </it>is placed within the Thermococcales in agreement with recent detailed phylogenetic analyses, reinforcing the idea that the basal placement of <it>N. equitans </it>in some trees was due to long-branch attraction artifacts. The two major differences of this gene-content tree with respect to previous accepted molecular phylogenies for the archaea are that all methanogenic euryarchaeota, normally split in at least two large groups in molecular phylogenies, cluster together as they share a large number of methanogenesis-related genes, and that <it>Cenarchaeum symbiosum </it>is placed within the Euryarchaota, in disagreement with its expected position within the Crenarchaeota. Although the type of analyses carried out is not innovative, the new arCOG database presented here will certainly be very useful to improve future genome annotations.</p>
            <p>I have only a few minor comments or suggestions, as follows:</p>
            <p>-<it>First, it has to be noticed that the euryarchaeal core (282 arCOGs) and the crenarchaeal core (336 arCOGs) are not dramatically larger than the pan-archaeal core, emphasizing the general volatility of archaeal genomes</it>.</p>
            <p>The affirmation that 282 and 336 arCOGs <it>are not dramatically larger </it>than the 166 core arCOGs appears quite subjective. It is roughly twice the size. How does this compare with the situation in bacteria? It would be nice to include this information here, and even better, to relate/normalize this information to the average genetic distance in a reference conserved genetic marker, such as the 16S rRNA gene.</p>
            <p><b>Authors' response: </b><it>"Dramatic", certainly, is in the eye of the beholder. We believe the reader will see it that way, so no changes. Comparing to bacteria is dubious because there are no two major groups of bacteria emulating Euryarchaeota and Crenarchaeota. Calibration &#8211; complex exercise that goes beyond the scope of this paper</it>.</p>
            <p>Defining genome volatility would also be useful. Genome volatility has been defined in the literature as the mean volatility of all codons weighted by their frequency within the genome, codon volatility being a measurement related to the non-synonymous versus synonymous mutations (e.g. Dagan and Graur, Mol Biol Evol 2004, 22:496). I believe the meaning is more informal and vague here, and also subjective. Can you provide a reference showing that archaeal genomes are "volatile"?</p>
            <p><b>Authors' response: </b><it>Good point, we changed the wording to avoid any wrong connotations, "volatility" is not used anymore</it>.</p>
            <p>Horizontal gene transfer from bacteria has apparently contributed to shape the <it>C. symbiosum </it>genome. In page 14, it is mentioned that <it>C. symbiosum </it>falls within the euryarchaeotal part of the gene-content tree. Would you predict that HGT from euryarchaeota may partly explain this observation as some (although very limited) environmental genomic studies appear to suggest (Lopez-Garcia, Brochier et al, Environ Microbiol 2004, 6:19?</p>
            <p><b>Authors' response: </b><it>Yes, a valid point, we included this possibility in the revision and cite the paper</it>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>KSM performed the bulk of comparative genome analysis and contributed to the design of computational procedures and algorithms; YIW designed the algorithms and computational procedures, and contributed to genome analysis and software development; AS contributed to software development; EVK initiated the project, contributed to the design of the computational procedure, and wrote the manuscript. All authors read and approved the final version of the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the Intramural Research Program of the National Institutes of Health, National Library of Medicine.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Distinguishing homologous from analogous proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Systematic Zoology</source>
            <pubdate>1970</pubdate>
            <volume>19</volume>
            <fpage>99</fpage>
            <lpage>106</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2307/2412448</pubid>
                  <pubid idtype="pmpid">5449325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Orthologs, paralogs and evolutionary genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>39</volume>
            <fpage>309</fpage>
            <lpage>338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.39.073003.114725</pubid>
                  <pubid idtype="pmpid" link="fulltext">16285863</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evolution by gene duplication</p>
            </title>
            <aug>
               <au>
                  <snm>Ohno</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <publisher>Berlin-Heidelberg-New York , Springer-Verlag</publisher>
            <pubdate>1970</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The altered evolutionary trajectories of gene duplicates</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Katju</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>11</issue>
            <fpage>544</fpage>
            <lpage>549</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.09.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">15475113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>In Silico Biol</source>
            <pubdate>1998</pubdate>
            <volume>1</volume>
            <fpage>55</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11471243</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The balance of driving forces during genome evolution in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Kunin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>7</issue>
            <fpage>1589</fpage>
            <lpage>1594</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403731</pubid>
                  <pubid idtype="pmpid" link="fulltext">12840037</pubid>
                  <pubid idtype="doi">10.1101/gr.1092603</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Mirkin</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Fenner</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <issue>1</issue>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149225</pubid>
                  <pubid idtype="pmpid" link="fulltext">12515582</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-3-2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genomes in flux: the evolution of archaeal and proteobacterial gene content</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>1</issue>
            <fpage>17</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.176501</pubid>
                  <pubid idtype="pmpid" link="fulltext">11779827</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A phylogenomic approach to microbial evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>2</issue>
            <fpage>545</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29656</pubid>
                  <pubid idtype="pmpid" link="fulltext">11139625</pubid>
                  <pubid idtype="doi">10.1093/nar/29.2.545</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Metabolism and evolution of Haemophilus influenzae deduced from a whole- genome comparison with Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Borodovsky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rudd</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>1996</pubdate>
            <volume>6</volume>
            <issue>3</issue>
            <fpage>279</fpage>
            <lpage>291</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(02)00478-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">8805245</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A genomic perspective on protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>278</volume>
            <issue>5338</issue>
            <fpage>631</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.278.5338.631</pubid>
                  <pubid idtype="pmpid" link="fulltext">9381173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The COG database: new developments in phylogenetic classification of proteins from complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Garkavtsev</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Shankavaram</snm>
                  <fnm>UT</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>22</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29819</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125040</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.22</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The COG database: an updated version includes eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genome sequence and comparative analysis of the solvent-producing bacterium Clostridium acetobutylicum</p>
            </title>
            <aug>
               <au>
                  <snm>Nolling</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Breton</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Omelchenko</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Dubois</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hitti</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Sabathe</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Doucette-Stamm</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Soucaille</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>GN</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <issue>16</issue>
            <fpage>4823</fpage>
            <lpage>4838</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99537</pubid>
                  <pubid idtype="pmpid" link="fulltext">11466286</pubid>
                  <pubid idtype="doi">10.1128/JB.183.16.4823-4838.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Comparative genomics of the lactic acid bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Slesarev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mirkin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pavlov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pavlova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Karamychev</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Polouchine</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shakhova</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Grigoriev</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rohksar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lucas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Goodstein</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Hawkins</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Plengvidhya</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Welker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Diaz-Muniz</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dosti</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Smeianov</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wechter</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Barabote</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lorca</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Altermann</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Barrangou</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ganesan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rawsthorne</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tamir</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Breidt</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Broadbent</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hutkins</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>O'Sullivan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Steele</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Unlu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Saier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Klaenhammer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kozyavkin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weimer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mills</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>42</issue>
            <fpage>15611</fpage>
            <lpage>15616</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1622870</pubid>
                  <pubid idtype="pmpid" link="fulltext">17030793</pubid>
                  <pubid idtype="doi">10.1073/pnas.0607117103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>18</issue>
            <fpage>3442</fpage>
            <lpage>3444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">110752</pubid>
                  <pubid idtype="pmpid" link="fulltext">10982861</pubid>
                  <pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Genome alignment, evolution of prokaryotic genome organization and prediction of gene function using genomic context</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>356</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1619R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Connected gene neighborhoods in prokaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Murvai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Czabarka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Szekely</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>10</issue>
            <fpage>2212</fpage>
            <lpage>2223</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">115289</pubid>
                  <pubid idtype="pmpid" link="fulltext">12000841</pubid>
                  <pubid idtype="doi">10.1093/nar/30.10.2212</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>STRING 7--recent developments in the integration and prediction of protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chaffron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D358</fpage>
            <lpage>62</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669762</pubid>
                  <pubid idtype="pmpid" link="fulltext">17098935</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl825</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A phylogenetic approach to target selection for structural genomics: solution structure of YciH</p>
            </title>
            <aug>
               <au>
                  <snm>Cort</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Bash</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <issue>20</issue>
            <fpage>4018</fpage>
            <lpage>4027</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148669</pubid>
                  <pubid idtype="pmpid" link="fulltext">10497266</pubid>
                  <pubid idtype="doi">10.1093/nar/27.20.4018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Automatic clustering of orthologs and in-paralogs from pairwise species comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Storm</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>314</volume>
            <issue>5</issue>
            <fpage>1041</fpage>
            <lpage>1052</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.5197</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>OrthoMCL: identification of ortholog groups for eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stoeckert</snm>
                  <fnm>CJ</fnm>
                  <suf>Jr.</suf>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>9</issue>
            <fpage>2178</fpage>
            <lpage>2189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403725</pubid>
                  <pubid idtype="pmpid" link="fulltext">12952885</pubid>
                  <pubid idtype="doi">10.1101/gr.1224503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>eggNOG: automated construction and annotation of orthologous groups of genes</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Julien</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <inpress/>
            <note>[Epub ahead of print]</note>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Database resources of the National Center for Biotechnology Information</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chetvernin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>DiCuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Kapustin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Landsman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Souvorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yaschenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D5</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781113</pubid>
                  <pubid idtype="pmpid" link="fulltext">17170002</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl1031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Comparative genomics of Thermus thermophilus and Deinococcus radiodurans: divergent routes of adaptation to thermophily and radiation resistance</p>
            </title>
            <aug>
               <au>
                  <snm>Omelchenko</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Gaidamakova</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Matrosova</snm>
                  <fnm>VY</fnm>
               </au>
               <au>
                  <snm>Vasilenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhai</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>57</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1274311</pubid>
                  <pubid idtype="pmpid" link="fulltext">16242020</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-5-57</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The cyanobacterial genome core and the origin of photosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Mulkidjanian</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Burd</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kaznadzey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Haselkorn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>35</issue>
            <fpage>13126</fpage>
            <lpage>13131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1551899</pubid>
                  <pubid idtype="pmpid" link="fulltext">16924101</pubid>
                  <pubid idtype="doi">10.1073/pnas.0605709103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <issue>7</issue>
            <fpage>608</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10413400</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Comparative genomics of Archaea: how much have we learned in six years, and what's next?</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>115</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193635</pubid>
                  <pubid idtype="pmpid" link="fulltext">12914651</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-8-115</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Evolutionary and functional genomics of the Archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <issue>5</issue>
            <fpage>586</fpage>
            <lpage>594</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.mib.2005.08.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">16111915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The genome of Nanoarchaeum equitans: insights into early archaeal evolution and derived parasitism</p>
            </title>
            <aug>
               <au>
                  <snm>Waters</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hohn</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Ahel</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Barnstead</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beeson</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Bibbs</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bolanos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kretz</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Mathur</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Podar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stetter</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Short</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Noordewier</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>22</issue>
            <fpage>12984</fpage>
            <lpage>12988</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">240731</pubid>
                  <pubid idtype="pmpid" link="fulltext">14566062</pubid>
                  <pubid idtype="doi">10.1073/pnas.1735403100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Nanoarchaea: representatives of a novel archaeal phylum or a fast-evolving euryarchaeal lineage related to Thermococcales?</p>
            </title>
            <aug>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gribaldo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Confalonieri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Forterre</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>R42</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1175954</pubid>
                  <pubid idtype="pmpid" link="fulltext">15892870</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-5-r42</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Archaeal Clusters of Orthologous Genes</p>
            </title>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum</p>
            </title>
            <aug>
               <au>
                  <snm>Hallam</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Konstantinidis</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schleper</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sugahara</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Preston</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>de la Torre</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>DeLong</snm>
                  <fnm>EF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>48</issue>
            <fpage>18296</fpage>
            <lpage>18301</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1643844</pubid>
                  <pubid idtype="pmpid" link="fulltext">17114289</pubid>
                  <pubid idtype="doi">10.1073/pnas.0608549103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Microb Comp Genomics</source>
            <pubdate>1998</pubdate>
            <volume>3</volume>
            <issue>4</issue>
            <fpage>199</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10027190</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Who's your neighbor? New computational approaches for functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <issue>6</issue>
            <fpage>609</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/76443</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Potential genomic determinants of hyperthermophily</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>4</issue>
            <fpage>172</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00047-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">12683966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Genome phylogeny based on gene content</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>108</fpage>
            <lpage>110</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/5052</pubid>
                  <pubid idtype="pmpid" link="fulltext">9916801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Genome trees and the tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>9</issue>
            <fpage>472</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(02)02744-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12175808</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Highways of gene sharing in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Beiko</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Harlow</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>40</issue>
            <fpage>14332</fpage>
            <lpage>14337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242295</pubid>
                  <pubid idtype="pmpid" link="fulltext">16176988</pubid>
                  <pubid idtype="doi">10.1073/pnas.0504068102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Genome trees constructed using five different approaches suggest new major bacterial clades</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <issue>8</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60490</pubid>
                  <pubid idtype="pmpid" link="fulltext">11734060</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens</p>
            </title>
            <aug>
               <au>
                  <snm>Slesarev</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Mezhevaya</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Polushin</snm>
                  <fnm>NN</fnm>
               </au>
               <au>
                  <snm>Shcherbinina</snm>
                  <fnm>OV</fnm>
               </au>
               <au>
                  <snm>Shakhova</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Belova</snm>
                  <fnm>GI</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Stetter</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Malykh</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Kozyavkin</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>7</issue>
            <fpage>4644</fpage>
            <lpage>4649</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123701</pubid>
                  <pubid idtype="pmpid" link="fulltext">11930014</pubid>
                  <pubid idtype="doi">10.1073/pnas.032671499</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Archaeal phylogeny based on proteins of the transcription and translation machineries: tackling the Methanopyrus kandleri paradox</p>
            </title>
            <aug>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Forterre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gribaldo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>3</issue>
            <fpage>R17</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395767</pubid>
                  <pubid idtype="pmpid" link="fulltext">15003120</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-3-r17</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Comparative analysis of a genome fragment of an uncultivated mesopelagic crenarchaeote reveals multiple horizontal gene transfers</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Garcia</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Valera</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Environ Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>19</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1462-2920.2003.00533.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14686938</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Dollo parsimony and reconstruction of genome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Babenko</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Parsimony, Phylogeny, and Genomics</source>
            <publisher>Oxford , Oxford University Press</publisher>
            <editor>Albert VA</editor>
            <pubdate>2005</pubdate>
            <fpage>190</fpage>
            <lpage>200</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>A hot story from comparative genomics: reverse gyrase is the only hyperthermophile-specific protein</p>
            </title>
            <aug>
               <au>
                  <snm>Forterre</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>5</issue>
            <fpage>236</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(02)02650-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12047940</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Shabalina</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2006</pubdate>
            <volume>1</volume>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462988</pubid>
                  <pubid idtype="pmpid" link="fulltext">16545108</pubid>
                  <pubid idtype="doi">10.1186/1745-6150-1-7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>CRISPR provides acquired resistance against viruses in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Barrangou</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fremaux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Deveau</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boyaval</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moineau</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Romero</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>315</volume>
            <issue>5819</issue>
            <fpage>1709</fpage>
            <lpage>1712</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1138140</pubid>
                  <pubid idtype="pmpid" link="fulltext">17379808</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>DNA polymerase beta-like nucleotidyltransferase superfamily: identification of three new families, classification and evolutionary history</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <issue>7</issue>
            <fpage>1609</fpage>
            <lpage>1618</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148363</pubid>
                  <pubid idtype="pmpid" link="fulltext">10075991</pubid>
                  <pubid idtype="doi">10.1093/nar/27.7.1609</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The closest BLAST hit is often not the nearest neighbor</p>
            </title>
            <aug>
               <au>
                  <snm>Koski</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <issue>6</issue>
            <fpage>540</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11443357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>442</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(98)01553-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9825671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes</p>
            </title>
            <aug>
               <au>
                  <snm>Esser</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ahmadinejad</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wiegand</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sebastiani</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gelius-Dietrich</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Henze</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kretschmann</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Richly</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Leister</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Steel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>1643</fpage>
            <lpage>1660</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh160</pubid>
                  <pubid idtype="pmpid" link="fulltext">15155797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>4</issue>
            <fpage>619</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1997.4821861.x</pubid>
                  <pubid idtype="pmpid">9379893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The tree of one percent</p>
            </title>
            <aug>
               <au>
                  <snm>Dagan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>10</issue>
            <fpage>118</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1794558</pubid>
                  <pubid idtype="pmpid" link="fulltext">17081279</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-10-118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Horizontal gene transfer among genomes: the complexity hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Jain</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rivera</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Lake</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <issue>7</issue>
            <fpage>3801</fpage>
            <lpage>3806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">22375</pubid>
                  <pubid idtype="pmpid" link="fulltext">10097118</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.7.3801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>NCBI genomes</p>
            </title>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Lineage-specific gene expansions in bacterial and archaeal genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Jordan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Spouge</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>555</fpage>
            <lpage>565</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311027</pubid>
                  <pubid idtype="pmpid" link="fulltext">11282971</pubid>
                  <pubid idtype="doi">10.1101/gr.GR-1660R</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The role of lineage-specific gene family expansion in the evolution of eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Lespinet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>7</issue>
            <fpage>1048</fpage>
            <lpage>1059</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186617</pubid>
                  <pubid idtype="pmpid" link="fulltext">12097341</pubid>
                  <pubid idtype="doi">10.1101/gr.174302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>5</issue>
            <fpage>1792</fpage>
            <lpage>1797</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390337</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034147</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Complete genome sequence of an aerobic hyper-thermophilic crenarchaeon, Aeropyrum pernix K1</p>
            </title>
            <aug>
               <au>
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haikawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jin-no</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ankai</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>1999</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>83</fpage>
            <lpage>101, 145-52</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/6.2.83</pubid>
                  <pubid idtype="pmpid" link="fulltext">10382966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>The genome of Hyperthermus butylicus: a sulfur-reducing, peptide fermenting, neutrophilic Crenarchaeote growing up to 108 degrees C</p>
            </title>
            <aug>
               <au>
                  <snm>Brugger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zibat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Redder</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ruepp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Awayez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>She</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
            </aug>
            <source>Archaea</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <issue>2</issue>
            <fpage>127</fpage>
            <lpage>135</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17350933</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Genome sequence of the hyperthermophilic crenarchaeon Pyrobaculum aerophilum</p>
            </title>
            <aug>
               <au>
                  <snm>Fitz-Gibbon</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Ladner</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>UJ</fnm>
               </au>
               <au>
                  <snm>Stetter</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>2</issue>
            <fpage>984</fpage>
            <lpage>989</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117417</pubid>
                  <pubid idtype="pmpid" link="fulltext">11792869</pubid>
                  <pubid idtype="doi">10.1073/pnas.241636498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>The genome of Sulfolobus acidocaldarius, a model organism of the Crenarchaeota</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Brugger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Skovgaard</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Redder</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>She</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Torarinsson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Greve</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Awayez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zibat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <issue>14</issue>
            <fpage>4992</fpage>
            <lpage>4999</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1169522</pubid>
                  <pubid idtype="pmpid" link="fulltext">15995215</pubid>
                  <pubid idtype="doi">10.1128/JB.187.14.4992-4999.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>The complete genome of the crenarchaeon Sulfolobus solfataricus P2</p>
            </title>
            <aug>
               <au>
                  <snm>She</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Confalonieri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Allard</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Awayez</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chan-Weiher</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Clausen</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Curtis</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>De Moors</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Erauso</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Heikamp-de Jong</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jeffries</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Kozera</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Medina</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Thi-Ngoc</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Redder</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schenk</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Theriault</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tolstrup</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Duguet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Sensen</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Van der Oost</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>14</issue>
            <fpage>7835</fpage>
            <lpage>7840</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">35428</pubid>
                  <pubid idtype="pmpid" link="fulltext">11427726</pubid>
                  <pubid idtype="doi">10.1073/pnas.141222098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Complete genome sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7</p>
            </title>
            <aug>
               <au>
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jin-no</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ankai</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kosugi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hosoyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fukui</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nishijima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Otsuka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakazawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takamiya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshizawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kudoh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kushida</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oguchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aoki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Masuda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yanagii</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishimura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamagishi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <issue>4</issue>
            <fpage>123</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/8.4.123</pubid>
                  <pubid idtype="pmpid" link="fulltext">11572479</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>The complete genome sequence of the hyperthermophilic, sulphate- reducing archaeon Archaeoglobus fulgidus [published erratum appears in Nature 1998 Jul 2;394(6688):101]</p>
            </title>
            <aug>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>McKenney</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Loftus</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>390</volume>
            <issue>6658</issue>
            <fpage>364</fpage>
            <lpage>370</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/37052</pubid>
                  <pubid idtype="pmpid" link="fulltext">9389475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Genome sequence of Haloarcula marismortui: a halophilic archaeon from the Dead Sea</p>
            </title>
            <aug>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Bonneau</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Facciotti</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Glusman</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Deutsch</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Gan</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Date</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>WV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>2221</fpage>
            <lpage>2234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">525680</pubid>
                  <pubid idtype="pmpid" link="fulltext">15520287</pubid>
                  <pubid idtype="doi">10.1101/gr.2700304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Genome sequence of Halobacterium species NRC-1</p>
            </title>
            <aug>
               <au>
                  <snm>Ng</snm>
                  <fnm>WV</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Mahairas</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Berquist</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shukla</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Lasky</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Thorsson</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sbrogna</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Swartzell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weir</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dahl</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Welti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goo</snm>
                  <fnm>YA</fnm>
               </au>
               <au>
                  <snm>Leithauser</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cruz</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Danson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hough</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Maddocks</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Jablonski</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Krebs</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Angevine</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Dale</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Isenbarger</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Peck</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Pohlschroder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spudich</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Alam</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Freitas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Daniels</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Dennis</snm>
                  <fnm>PP</fnm>
               </au>
               <au>
                  <snm>Omer</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Ebhardt</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>DasSarma</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <issue>22</issue>
            <fpage>12176</fpage>
            <lpage>12181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17314</pubid>
                  <pubid idtype="pmpid" link="fulltext">11016950</pubid>
                  <pubid idtype="doi">10.1073/pnas.190337797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>The genome of the square archaeon Haloquadratum walsbyi : life at the limits of water activity</p>
            </title>
            <aug>
               <au>
                  <snm>Bolhuis</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Palm</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wende</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Falb</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rampp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Valera</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pfeiffer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Oesterhelt</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>169</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1544339</pubid>
                  <pubid idtype="pmpid" link="fulltext">16820047</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-169</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Doucette-Stamm</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Deloughery</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dubois</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aldredge</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bashirzadeh</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Blakely</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hoang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Keagle</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lumm</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pothier</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spadafora</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Vicaire</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wierzbowski</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jiwani</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Caruso</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bush</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Reeve</snm>
                  <fnm>JN</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <issue>22</issue>
            <fpage>7135</fpage>
            <lpage>7155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">179657</pubid>
                  <pubid idtype="pmpid" link="fulltext">9371463</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii [see comments]</p>
            </title>
            <aug>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>FitzGerald</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Reich</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Merrick</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Glodek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Geoghagen</snm>
                  <fnm>NSM</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>273</volume>
            <issue>5278</issue>
            <fpage>1058</fpage>
            <lpage>1073</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.273.5278.1058</pubid>
                  <pubid idtype="pmpid" link="fulltext">8688087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Complete genome sequence of the genetically tractable hydrogenotrophic methanogen Methanococcus maripaludis</p>
            </title>
            <aug>
               <au>
                  <snm>Hendrickson</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Kaul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bovee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chung</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Conway de Macario</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dodsworth</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Gillett</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Hackett</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haydock</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lie</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Major</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Porat</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Palmeiri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rouse</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Saenphimmachak</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Van Dien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Whitman</snm>
                  <fnm>WB</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Leigh</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>20</issue>
            <fpage>6956</fpage>
            <lpage>6969</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522202</pubid>
                  <pubid idtype="pmpid" link="fulltext">15466049</pubid>
                  <pubid idtype="doi">10.1128/JB.186.20.6956-6969.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>The genome of M. acetivorans reveals extensive metabolic and physiological diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Endrizzi</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Macdonald</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Engels</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Atnoor</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>DeArellano</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>McEwan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McKernan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Talamas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tirrell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Cann</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Grahame</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Guss</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Hedderich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ingram-Smith</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kuettner</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Krzycki</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Leigh</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mukhopadhyay</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Reeve</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Conway de Macario</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ferry</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Jarrell</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Jing</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Macario</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pritchett</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sowers</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Swanson</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Zinder</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Metcalf</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>4</issue>
            <fpage>532</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187521</pubid>
                  <pubid idtype="pmpid" link="fulltext">11932238</pubid>
                  <pubid idtype="doi">10.1101/gr.223902</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>The Methanosarcina barkeri genome: comparative analysis with Methanosarcina acetivorans and Methanosarcina mazei reveals extensive rearrangement within methanosarcinal genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Maeder</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Brettin</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Bruce</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Gilna</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Lapidus</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Metcalf</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tapia</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sowers</snm>
                  <fnm>KR</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2006</pubdate>
            <volume>188</volume>
            <issue>22</issue>
            <fpage>7922</fpage>
            <lpage>7931</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636319</pubid>
                  <pubid idtype="pmpid" link="fulltext">16980466</pubid>
                  <pubid idtype="doi">10.1128/JB.00810-06</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>The genome of Methanosarcina mazei: evidence for lateral gene transfer between bacteria and archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Deppenmeier</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Johann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hartsch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Merkl</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Martinez-Arias</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Henne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wiezer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baumer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jacobi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bruggemann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lienard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Christmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bomeke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Steckel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bhattacharyya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lykidis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Gunsalus</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Fritz</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Gottschalk</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Mol Microbiol Biotechnol</source>
            <pubdate>2002</pubdate>
            <volume>4</volume>
            <issue>4</issue>
            <fpage>453</fpage>
            <lpage>461</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12125824</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>The genome sequence of Methanosphaera stadtmanae reveals why this human intestinal archaeon is restricted to methanol and H2 for methane formation and ATP synthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Fricke</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Seedorf</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Henne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kruer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liesegang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hedderich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gottschalk</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thauer</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2006</pubdate>
            <volume>188</volume>
            <issue>2</issue>
            <fpage>642</fpage>
            <lpage>658</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347301</pubid>
                  <pubid idtype="pmpid" link="fulltext">16385054</pubid>
                  <pubid idtype="doi">10.1128/JB.188.2.642-658.2006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Living with two extremes: conclusions from the genome sequence of Natronomonas pharaonis</p>
            </title>
            <aug>
               <au>
                  <snm>Falb</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pfeiffer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Palm</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rodewald</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hickmann</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Tittor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oesterhelt</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>10</issue>
            <fpage>1336</fpage>
            <lpage>1343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1240075</pubid>
                  <pubid idtype="pmpid" link="fulltext">16169924</pubid>
                  <pubid idtype="doi">10.1101/gr.3952905</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Genome sequence of Picrophilus torridus and its implications for life around pH 0</p>
            </title>
            <aug>
               <au>
                  <snm>Futterer</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Angelov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liesegang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gottschalk</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schleper</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schepers</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dock</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Antranikian</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liebl</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>24</issue>
            <fpage>9091</fpage>
            <lpage>9096</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">428478</pubid>
                  <pubid idtype="pmpid" link="fulltext">15184674</pubid>
                  <pubid idtype="doi">10.1073/pnas.0401356101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>An integrated analysis of the genome of the hyperthermophilic archaeon Pyrococcus abyssi</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>GN</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Flament</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heilig</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lecompte</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Poch</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Prieur</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Querellou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ripp</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thierry</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Van der Oost</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Forterre</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>47</volume>
            <issue>6</issue>
            <fpage>1495</fpage>
            <lpage>1512</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2003.03381.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12622808</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Divergence of the hyperthermophilic archaea Pyrococcus furiosus and P. horikoshii inferred from complete genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Maeder</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Gonzalez</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>DiRuggiero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Robb</snm>
                  <fnm>FT</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>152</volume>
            <issue>4</issue>
            <fpage>1299</fpage>
            <lpage>1305</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460691</pubid>
                  <pubid idtype="pmpid" link="fulltext">10430560</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Complete sequence and gene organization of the genome of a hyper- thermophilic archaebacterium, Pyrococcus horikoshii OT3 (supplement)</p>
            </title>
            <aug>
               <au>
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sawada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Haikawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kosugi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hosoyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ogura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Otsuka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakazawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takamiya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ohfuku</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Funahashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kudoh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kushida</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oguchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aoki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>1998</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>147</fpage>
            <lpage>155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/5.2.147</pubid>
                  <pubid idtype="pmpid" link="fulltext">9679203</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Complete genome sequence of the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1 and comparison with Pyrococcus genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Fukui</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Atomi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kanai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Matsumi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Imanaka</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>352</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">551561</pubid>
                  <pubid idtype="pmpid" link="fulltext">15710748</pubid>
                  <pubid idtype="doi">10.1101/gr.3003105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum [In Process Citation]</p>
            </title>
            <aug>
               <au>
                  <snm>Ruepp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Graml</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Santos-Martinez</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Koretke</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Volker</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stocker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lupas</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Baumeister</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <issue>6803</issue>
            <fpage>508</fpage>
            <lpage>513</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35035069</pubid>
                  <pubid idtype="pmpid" link="fulltext">11029001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Archaeal adaptation to higher temperatures revealed by genomic sequence of Thermoplasma volcanium</p>
            </title>
            <aug>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Amano</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Koike</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Higuchi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima-Ohya</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanehori</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kawamoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nunoshiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Aramaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <issue>26</issue>
            <fpage>14257</fpage>
            <lpage>14262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18905</pubid>
                  <pubid idtype="pmpid" link="fulltext">11121031</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.26.14257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
