<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-5-research0024</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Evolution of gene fusions: horizontal transfer versus independent events</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Yanai</snm>
               <fnm>Itai</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Wolf</snm>
               <mi>I</mi>
               <fnm>Yuri</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3" ca="yes">
               <snm>Koonin</snm>
               <mi>V</mi>
               <fnm>Eugene</fnm>
               <insr iid="I2"/>
               <email>koonin@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Graduate Program and Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA</p>
            </ins>
            <ins id="I2">
               <p>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MA 20894, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>5</issue>
         <fpage>research0024.1</fpage>
         <lpage>research0024.13</lpage>
         <url>http://genomebiology.com/2002/3/5/research/0024</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-5-research0024</pubid>
               <pubid idtype="pmpid">12049665</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>12</day>
               <month>11</month>
               <year>2001</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>7</day>
               <month>2</month>
               <year>2002</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>26</day>
               <month>3</month>
               <year>2002</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>4</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>Yanai et al., licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shorttitle>
         <p>Evolution of gene fusions: horizontal transfer versus independent events</p>
      </shorttitle>
      <shortabs>
         <p>The evolutionary history of gene fusions was studied by phylogenetic analysis. Of the 51 gene fusions studied,  31 were most probably disseminated by cross-kingdom horizontal gene transfer, 14 appeared to have evolved independently in different kingdoms and two were probably inherited from a common ancestor.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Gene fusions can be used as tools for functional prediction and also as evolutionary markers. Fused genes often show a scattered phyletic distribution, which suggests a role for processes other than vertical inheritance in their evolution.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The evolutionary history of gene fusions was studied by phylogenetic analysis of the domains in the fused proteins and the orthologous domains that form stand-alone proteins. Clustering of fusion components from phylogenetically distant species was construed as evidence of dissemination of the fused genes by horizontal transfer. Of the 51 examined gene fusions that are represented in at least two of the three primary kingdoms (Bacteria, Archaea and Eukaryota), 31 were most probably disseminated by cross-kingdom horizontal gene transfer, whereas 14 appeared to have evolved independently in different kingdoms and two were probably inherited from the common ancestor of modern life forms. On many occasions, the evolutionary scenario also involves one or more secondary fissions of the fusion gene. For approximately half of the fusions, stand-alone forms of the fusion components are encoded by juxtaposed genes, which are known or predicted to belong to the same operon in some of the prokaryotic genomes. This indicates that evolution of gene fusions often, if not always, involves an intermediate stage, during which the future fusion components exist as juxtaposed and co-regulated, but still distinct, genes within operons.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>These findings suggest a major role for horizontal transfer of gene fusions in the evolution of protein-domain architectures, but also indicate that independent fusions of the same pair of domains in distant species is not uncommon, which suggests positive selection for the multidomain architectures.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Gene fusion leading to the formation of multidomain proteins is one of the major routes of protein evolution. Gene fusions characteristically bring together proteins that function in a concerted manner, such as successive enzymes in metabolic pathways, enzymes and the domains involved in their regulation, or DNA-binding domains and ligand-binding domains in prokaryotic transcriptional regulators [<abbr bid="B1">1</abbr>,<abbr bid="B2">2</abbr>,<abbr bid="B3">3</abbr>]. The selective advantage of domain fusion lies in the increased efficiency of coupling of the corresponding biochemical reaction or signal transduction step [<abbr bid="B1">1</abbr>] and in the tight co-regulation of expression of the fused domains. In signal transduction systems, such as prokaryotic two-component regulators and sugar phosphotransferase (PTS) systems, or eukaryotic receptor kinases, domain fusion is the main principle of functional design [<abbr bid="B4">4</abbr>,<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>]. Furthermore, accretion of multiple domains appears to be one of the important routes for increasing functional complexity in the evolution of multicellular eukaryotes [<abbr bid="B7">7</abbr>,<abbr bid="B8">8</abbr>,<abbr bid="B9">9</abbr>].</p>
         <p>Pairs of distinct genes that are fused in at least one genome have been termed fusion-linked [<abbr bid="B3">3</abbr>]. A gene fusion is presumably fixed during evolution only when the partners cooperate functionally and, by inference, a functional link can be predicted to exist between fusion-linked genes. Recently, this simple concept has been used by several groups as a means of systematic prediction of the functions of uncharacterized genes [<abbr bid="B1">1</abbr>,<abbr bid="B2">2</abbr>,<abbr bid="B3">3</abbr>,<abbr bid="B10">10</abbr>,<abbr bid="B11">11</abbr>].</p>
         <p>In addition to their utility for functional prediction, analysis of gene fusions may help in addressing fundamental evolutionary issues. Gene fusions often show scattered phyletic patterns, appearing in several species from different lineages. By investigating the phylogenies of each of the two fusion-linked genes, it may be possible to determine the evolutionary scenario for the fusion itself. A recent study provided evidence that the fission of fused genes occurred during evolution at a rate comparable to that of fusion [<abbr bid="B12">12</abbr>]. Here, we address another central aspect of the evolution of gene fusions, namely, do fusions of the same domains in different phylogenetic lineages reflect vertical descent, possibly accompanied by multiple lineage-specific fission events, or independent fusion events, or horizontal transfer of the fused gene? In other words, is a fusion of a given pair of genes extremely rare and, once formed, is it spread by horizontal gene transfer (HGT) perhaps also followed by fissions in some lineages? Alternatively, are independent fusions of the same gene pair in distinct lineages relatively common during evolution? Among fusions that are found in at least two of the three primary kingdoms of life (Bacteria, Archaea and Eukaryota), we detected both modes of evolution, but horizontal transfer of a fused gene appeared to be more common than independent fusion events or vertical inheritance with multiple fissions.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>To distinguish between a single fusion event followed by HGT and/or fission of the fused gene and multiple, independent fusion events in distinct organisms, we analyzed phylogenetic trees that were constructed separately for each of the fusion-linked domains (proteins). The fusion was split into the individual component domains and phylogenetic trees were built for each of the corresponding orthologous sets from 32 complete microbial genomes (Figure <figr fid="F1">1</figr>, and see Materials and methods), including both fusion components and products of stand-alone genes. The topologies of the resulting trees were compared to each other and to the topology of a phylogenetic tree constructed on the basis of a concatenated alignment of ribosomal proteins, which was chosen as the (hypothetical) species tree of the organisms involved [<abbr bid="B13">13</abbr>]. If the fusion events either occurred independently of each other or were vertically inherited, perhaps followed by fission in some lineages, the distribution of the fusion components in the phylogenetic trees for the orthologous clusters to which they belong is expected to mimic the distribution of the species carrying the fusion in the species tree. In contrast, if the fusion gene has been disseminated by HGT, fusion components will form odd clusters different from those in the species tree.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Phyletic patterns of fusion-linked COGs</p>
            </caption>
            <text>
               <p>Phyletic patterns of fusion-linked COGs. Each pair of COGs is represented by a double column. The dark-gray rectangles indicate fusions, the light-gray rectangles indicate that the fusion components are represented by stand-alone genes in the given genomes, and the white rectangles indicate that there is no representative of the given COG in the given genome. Where one rectangle in a double column is light gray and the other is white, the genome in question has a representative of only one of the pair of fusion-linked COGs. Species abbreviations are as listed in Materials and methods.</p>
            </text>
            <graphic file="gb-2002-3-5-research0024-1"/>
         </fig>
         <p>This could be a straightforward approach to reconstructing the evolutionary history of gene fusions, if only the topology of the species trees was well resolved. However, this is not necessarily the case for bacteria or archaea, where relationships between major lineages remain uncertain [<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>], although a recent detailed analysis suggested some higher-level evolutionary affinities [<abbr bid="B13">13</abbr>]. Because the distinction between the three primary kingdoms is widely recognized [<abbr bid="B14">14</abbr>,<abbr bid="B16">16</abbr>] and is clear in the trees for most protein families [<abbr bid="B17">17</abbr>], <it>trans</it>-kingdom horizontal transfers of fused genes can be more reliably detected with the proposed approach. Therefore, we concentrated on the evolutionary histories of gene fusions that are shared by at least two of the three primary kingdoms.</p>
         <p>As the framework for this analysis, we used the database of clusters of orthologous groups (COGs) of proteins [<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>], which contains sets of orthologous proteins and domains from complete microbial genomes (32 genomes at the time of this analysis; see Materials and methods). Domain fusions represented in some genomes by stand-alone versions of the fusion components are split in the COG database so that each fusion component can be assigned to a different COG. Whenever distinct domains of a fusion protein belong to separate COGs, the corresponding COGs are said to be fusion-linked [<abbr bid="B3">3</abbr>]. A search of the COGs database revealed 405 pairs of fusion-linked COGs. The vast majority (87%) of fusion links include fusion present in only one primary kingdom (Table <tblr tid="T1">1</tblr>). Only 52 pairs of fusion-linked COGs included fusions represented in two or three kingdoms (Table <tblr tid="T1">1</tblr>), and for reasons discussed above, we chose these pairs of COGs for an evolutionary analysis of gene fusions.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Phyletic patterns of gene fusions</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="left">
                     <p>Kingdom profile<sup>*</sup></p>
                  </c>
                  <c ca="center">
                     <p>Number of fusion links between COGs</p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>abe</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="center">
                     <p>27</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="center">
                     <p>20</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>a-e</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>a--</p>
                  </c>
                  <c ca="center">
                     <p>82</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>-b-</p>
                  </c>
                  <c ca="center">
                     <p>215</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>--e</p>
                  </c>
                  <c ca="center">
                     <p>56</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Total</p>
                  </c>
                  <c ca="center">
                     <p>405</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>*</sup>a, Archaea; b, Bacteria; e, Eukaryota.</p>
            </tblfn>
         </tbl>
         <p>Figure <figr fid="F1">1</figr> shows a genome-COG matrix that reveals the phyletic (phylogenetic) patterns of the presence or absence of the orthologs across the spectrum of the sequenced genomes [<abbr bid="B18">18</abbr>] for each of the 52 pairs of fusion-linked COGs containing cross-kingdom fusions. When assessed against the topology of the tentative species tree based on the concatenated alignments of ribosomal proteins [<abbr bid="B13">13</abbr>], fusions showed a scattered distribution in phyletic patterns (depicted by columns in Figure <figr fid="F1">1</figr>). For example, the fusion between COG1788 and COG2057 (&#945; and &#946; subunits of acyl-CoA: acetate CoA transferase) is seen in the bacteria <it>Escherichia coli, Deinococcus radiodurans</it> and <it>Bacillus halodurans</it>, and in the archaea <it>Aeropyrum pernix, Thermophilus acidophilum</it> and <it>Halobacterium</it> sp. Similarly, the fusion between COG1683 and COG3272 (uncharacterized, conserved domains) was found in the bacteria <it>Pseudomonas aeruginosa</it> and <it>Vibrio cholerae</it>, and in the archaeon <it>Methanobacterium thermoautotrophicum</it>. In each of these cases, with the species tree used as a reference, the bacteria involved are phylogenetically distant from each other and more so from the archaea, and non-fused versions of the two domains exist within the same bacterial lineages and in archaea (Figure <figr fid="F1">1</figr>). These observations emphasize the central question of this work: are the fusions between the same pair of domains in different species independent or are they best explained by HGT?</p>
         <p>Figure <figr fid="F2">2</figr> shows the pair of phylogenetic trees for the fusion-linked COGs 1788 and 2057. In both trees, the fusion components from <it>E. coli</it> and <it>B. halodurans</it> (YdiF and BH3898, respectively) confidently group with the archaeal fusion components, to the exclusion of the non-fused orthologs. This position of the <it>E. coli</it> and <it>B. halodurans</it> fusion components is unexpected and is in contrast to the placement of the orthologs from other gamma-proteobacteria and Gram-positive bacteria, as well as non-fused paralogs from the same species (AtoA/D and BH2258/2259, respectively) within the bacterial cluster. These observations strongly suggest that the gene for fused subunits of acyl-CoA: acetate CoA transferase was disseminated horizontally between <it>E. coli, B. halodurans</it>, and archaea. The presence of non-fused paralogs in both these bacterial species appears to be best compatible with gene transfer from archaea to bacteria. In contrast, the fusion of the pair of domains from the same COGs seen in <it>D. radiodurans</it> seems to be an independent event because, in both trees, the <it>D. radiodurans</it> branch is in the middle of the bacterial cluster (Figure <figr fid="F2">2a,2b</figr>). Thus, the history of this pair of fusion-linked COGs appears to involve horizontal transfer of the fused gene between bacteria and archaea (and possibly also within kingdoms), as well as at least one additional, independent fusion event in bacteria.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Phylogenetic trees for fusion-linked COGs: &#945; and &#946; subunits of acyl-CoA:acetate CoA transferase</p>
            </caption>
            <text>
               <p>Phylogenetic trees for fusion-linked COGs: &#945; and &#946; subunits of acyl-CoA:acetate CoA transferase. Fusion components are denoted by shading and by a number after an underline (_1 for the amino-terminal domain and _2 for the carboxy-terminal domain). The three primary kingdoms are color-coded as indicated in the figure. The RELL bootstrap values are indicated for each internal branch. <b>(a)</b> &#945; subunit (domain) (COG1788); <b>(b)</b> &#946; subunit (domain) (COG2057). The proteins are designated using the corresponding systematic gene names followed (after the underline) by the abbreviated species names. Species abbreviations are as in Materials and methods and Figure <figr fid="F1">1</figr>.</p>
            </text>
            <graphic file="gb-2002-3-5-research0024-2"/>
         </fig>
         <p>Figure <figr fid="F3">3</figr> shows the phylogenetic trees for the two domains of phosphoribosylformylglycinamidine (FGAM) synthase, a purine biosynthesis enzyme. The components of this fusion, which is found in proteobacteria and eukaryotes, form a tight cluster separated by a long internal branch from the non-fused bacterial and archaeal orthologs. This tree topology suggests HGT between bacteria and eukaryotes, possibly a relocation of the fused gene from the pro-mitochondrion to the eukaryotic nuclear genome or, alternatively, gene transfer from eukaryotes to proteobacteria. An additional aspect of the evolution of this gene is the apparent acceleration of evolution upon gene fusion, which is manifest in the long branch that separates the proteobacterial-eukaryotic cluster from the rest of the bacterial and archaeal species (Figure <figr fid="F3">3a,3b</figr>).</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Phylogenetic trees for fusion-linked COGs: phosphoribosylformylglycinamidine (FGAM) synthase</p>
            </caption>
            <text>
               <p>Phylogenetic trees for fusion-linked COGs: phosphoribosylformylglycinamidine (FGAM) synthase. <b>(a)</b> Synthetase domain (subunit) (COG0046); <b>(b)</b> glutamine amidotransferase domain (subunit) (COG0047). Protein designations are as in Figure <figr fid="F2">2</figr>.</p>
            </text>
            <graphic file="gb-2002-3-5-research0024-3"/>
         </fig>
         <p>The fusion-linked COGs 1605 and 0077 (chorismate mutase and prephenate dehydratase, respectively) show a more complicated history, with distinct fusion events resulting in different domain architectures (see legend to Figure <figr fid="F4">4</figr>). The presence, in both trees, of two distinct clusters of fusion components and the isolated fusion in <it>Campylobacter jejuni</it> suggest at least three independent fusion events, two of which apparently were followed by horizontal dissemination of the fused gene (Figure <figr fid="F4">4a,4b</figr>). The single archaeal fusion, the <it>Arachaeoglobus fulgidus</it> protein AF0227, belongs to one of these clusters and shows a strongly supported affinity with the ortholog from the hyperthermophilic bacterium <it>Thermotoga maritima</it>. (Figure <figr fid="F4">4a,4b</figr>). Given the broad distribution of this fusion in bacteria, horizontal transfer of the bacterial fused gene to archaea is the most likely scenario.</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Phylogenetic trees for fusion-linked COGs: chorismate mutase and prephenate dehydratase</p>
            </caption>
            <text>
               <p>Phylogenetic trees for fusion-linked COGs: chorismate mutase and prephenate dehydratase. <b>(a)</b> Chorismate mutase (COG1605); <b>(b)</b> prephenate dehydratase (COG0077). Protein designations are as in Figure <figr fid="F2">2</figr>. The protein AF0227 contains a prephenate dehydrogenase domain in addition to the chorismate mutase and prephenate dehydratase domains.</p>
            </text>
            <graphic file="gb-2002-3-5-research0024-4"/>
         </fig>
         <p>The pair of fusion-linked COGs 0777 and 0825 (&#945; and &#946; subunits of acetyl-CoA carboxylase, respectively) shows unequivocal clustering of the fusion components from numerous archaeal and bacterial species, which indicates a prevalent role for HGT in the evolution of this fusion (Figure <figr fid="F5">5a,5b</figr>). Moreover, archaea are scattered among bacteria, suggesting multiple HGT events. However, an apparent independent fusion is seen in <it>Mycobacterium tuberculosis</it> (Figure <figr fid="F5">5a,5b</figr>). It could be argued that, in cases like those in Figure <figr fid="F5">5</figr>, where there is a sharp separation (a long, strongly supported internal branch in each of the trees) between the fusion components and stand-alone proteins, the COGs involved needed to be reorganized, to form one COG consisting of fusion proteins only and two separate COGs consisting of stand-alone proteins. Formally, this would eliminate the need for HGT as an explanation of the tree topology for any of these new COGs. However, this solution (even if attractive from the point of view of classification) does not seem to be correct in light of the principle of orthology that underlies the COG system: it appears that, in both of the COGs involved, the fusion components and stand-alone proteins are <it>bona fide</it> orthologs, as judged by the high level of sequence conservation and by the fact that, in the majority of species involved, they are the only versions of this key enzyme.</p>
         <fig id="F5">
            <title>
               <p>Figure 5</p>
            </title>
            <caption>
               <p>Phylogenetic trees for fusion-linked COGs: &#945; and &#946; subunits of acetyl-CoA carboxylase</p>
            </caption>
            <text>
               <p>Phylogenetic trees for fusion-linked COGs: &#945; and &#946; subunits of acetyl-CoA carboxylase. <b>(a)</b> &#946; subunit (domain) (COG0777); <b>(b)</b> &#945; subunit (domain) (COG0825). Protein designations are as in Figure <figr fid="F2">2</figr>. The proteins DRA0310 and PA1400, in addition to the domains corresponding to the &#945; and &#946; subunits of acetyl-CoA carboxylase, contain a biotin carboxylase domain and a biotin carboxyl carrier protein domain. The clustering of these proteins in phylogenetic trees almost certainly reflects HGT between the respective bacterial lineages.</p>
            </text>
            <graphic file="gb-2002-3-5-research0024-5"/>
         </fig>
         <p>The results of phylogenetic analyses of the 51 cross-kingdom fusion links are summarized in Tables <tblr tid="T2">2</tblr> and <tblr tid="T3">3</tblr> and the Additional data. In 31 of the 51 links, an inter-kingdom horizontal transfer of the fused gene appeared to be the evolutionary mechanism by which the fusion entered one of the kingdoms. In contrast, only 14 fusion-linked pairs of COGs show evidence of independent fusion in two kingdoms, and in just two cases, the fusion seems to have been inherited from the last universal common ancestor. The latter two scenarios were distinguished on the basis of the parsimony principle, that is, by counting the number of evolutionary events (fusions or fissions) that were required to produce the observed distribution of fusion components and stand-alone versions of the domains involved across the tree branches. Accordingly, it needs to be emphasized that we can only infer the most likely scenario under the assumption that the probabilities of fusion and fission are comparable. It cannot be ruled out that some of the scenarios we classify as independent fusions in reality reflect the existence of an ancestral fused gene and subsequent multiple, independent fissions. The detection of ancestral domain fusions may call for the unification of the respective COG pairs in a single COG, with the species in which fission occurred represented by two distinct proteins.</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Evolutionary history of <it>trans</it>-kingdom gene fusions</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c ca="left">
                     <p>COG A</p>
                  </c>
                  <c ca="left">
                     <p>Protein function</p>
                  </c>
                  <c ca="left">
                     <p>COG B</p>
                  </c>
                  <c ca="left">
                     <p>Protein function</p>
                  </c>
                  <c ca="left">
                     <p>Kingdom pattern<sup>*</sup></p>
                  </c>
                  <c ca="left">
                     <p>Principal mode of evolution<sup>&#8224;</sup></p>
                  </c>
                  <c ca="left">
                     <p>Fusion</p>
                  </c>
                  <c ca="left">
                     <p>Gene juxtaposition<sup>&#8225;</sup></p>
                  </c>
                  <c ca="left">
                     <p>Evolutionary scenario</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0046</p>
                  </c>
                  <c ca="left">
                     <p>Phospho-ribosyl-formylglycinamidine (FGAM) synthase, synthetase domain</p>
                  </c>
                  <c ca="left">
                     <p>COG0047</p>
                  </c>
                  <c ca="left">
                     <p>Phospho-ribosyl-formyl-glycinamidine (FGAM) synthase glutamine Amidotransferase domain</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Ecol, Paer, Vcho, Hinf, Xfas, Nmen</p>
                  </c>
                  <c ca="left">
                     <p>Pyro, Paby, Tmar, Drad, Bsub, Bhal</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and proteobacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0067</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 1</p>
                  </c>
                  <c ca="left">
                     <p>COG0069</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 2</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Mjan, Tmar</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0067</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 1</p>
                  </c>
                  <c ca="left">
                     <p>COG0070</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 3</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0069</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 2</p>
                  </c>
                  <c ca="left">
                     <p>COG0070</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 3</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Mjan, Mthe</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0139</p>
                  </c>
                  <c ca="left">
                     <p>Phospho-ribosyl-AMP cyclohydrolase (histidine biosynthesis)</p>
                  </c>
                  <c ca="left">
                     <p>COG0140</p>
                  </c>
                  <c ca="left">
                     <p>Phospho-ribosyl-ATP pyrophospho-hydrolase (histidine biosynthesis)</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Uncertain</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0145</p>
                  </c>
                  <c ca="left">
                     <p>N-methylhydaintoinase A</p>
                  </c>
                  <c ca="left">
                     <p>COG0146</p>
                  </c>
                  <c ca="left">
                     <p>N-methylhydaintoinase B</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Syne, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Mjan, Aero, Hpyl</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and (the ancestor of) Cyanobacteria and Actinomycetes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0147</p>
                  </c>
                  <c ca="left">
                     <p>Anthranilate/para-aminobenzoate synthase component I</p>
                  </c>
                  <c ca="left">
                     <p>COG0512</p>
                  </c>
                  <c ca="left">
                     <p>Anthranilate/para-aminobenzoate synthase component II</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Nmen, Cjej, Paer, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Mthe, Taci, Aero, Tmar, Drad, Bsub, Bhal, Ecol, Vcho, Xfas</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0169</p>
                  </c>
                  <c ca="left">
                     <p>Shikimate 5-dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p>COG0710</p>
                  </c>
                  <c ca="left">
                     <p>3-dehydro-quinate dehydratase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Ctra, Cpne, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Paby<sup>&#182;</sup>Ecol</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0294</p>
                  </c>
                  <c ca="left">
                     <p>Dihydropteroate synthase</p>
                  </c>
                  <c ca="left">
                     <p>COG0801</p>
                  </c>
                  <c ca="left">
                     <p>7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Ctra, Cpne, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Llac<sup>&#182;</sup>, Tmar, Drad, Bsub, Bhal</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0304</p>
                  </c>
                  <c ca="left">
                     <p>3-oxoacyl-(acyl-carrier-protein) synthase</p>
                  </c>
                  <c ca="left">
                     <p>COG0331</p>
                  </c>
                  <c ca="left">
                     <p>(acyl-carrier-protein) S-malonyl-transferase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Drad, Ecol, Vcho</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0331</p>
                  </c>
                  <c ca="left">
                     <p>3-oxoacyl-(acyl-carrier-protein) synthase</p>
                  </c>
                  <c ca="left">
                     <p>COG2030</p>
                  </c>
                  <c ca="left">
                     <p>Acyl dehydratase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Bsub, Scer</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer between eukaryotes and Actinomycetes; additional, independent fusions in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0337</p>
                  </c>
                  <c ca="left">
                     <p>3-dehydroquinate synthetase</p>
                  </c>
                  <c ca="left">
                     <p>COG0703</p>
                  </c>
                  <c ca="left">
                     <p>Shikimate kinase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Tmar, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Drad, Mtub, Proteo-bacteria, Ctra, Cpne</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria (with different domain organizations)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0403</p>
                  </c>
                  <c ca="left">
                     <p>Glycine cleavage system protein P (pyridoxal-binding), amino-terminal domain</p>
                  </c>
                  <c ca="left">
                     <p>COG1003</p>
                  </c>
                  <c ca="left">
                     <p>Glycine cleavage system protein P (pyridoxal-binding), carboxy-terminal domain</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Drad, Mtub, Syne, Ecol, Paer, Xfas, Nmen</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Pyro, Taci, Aero, Tmar, Bsub, Bhal</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and proteobacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0439</p>
                  </c>
                  <c ca="left">
                     <p>Biotin carboxylase</p>
                  </c>
                  <c ca="left">
                     <p>COG0511</p>
                  </c>
                  <c ca="left">
                     <p>Biotin carboxyl carrier protein</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Mtub, Rpxx, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Bhal, Ecol, Paer Vcho, Hinf, Xfas, Nmen, Hpyl, Ctra, Cpne</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria; additional, independent fusions in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0439</p>
                  </c>
                  <c ca="left">
                     <p>Biotin carboxylase</p>
                  </c>
                  <c ca="left">
                     <p>COG1038</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate carboxylase, carboxy-terminal domain/subunit</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Bsub, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Mjan</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria; subsequent domain accretion in eukaryotes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0439</p>
                  </c>
                  <c ca="left">
                     <p>Biotin carboxylase</p>
                  </c>
                  <c ca="left">
                     <p>COG0825</p>
                  </c>
                  <c ca="left">
                     <p>Acetyl-CoA carboxylase &#945;-subunit</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Rpxx</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and bacteria; subsequent domain accretion in eukaryotes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0476</p>
                  </c>
                  <c ca="left">
                     <p>Dinucleotide-utilizing enzyme involved in molybdopterin and thiamine biosynthesis</p>
                  </c>
                  <c ca="left">
                     <p>COG0607</p>
                  </c>
                  <c ca="left">
                     <p>Rhodanese-related sulfurtransferase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Syne, Paer, Scer</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in x sulfurtransferase</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0511</p>
                  </c>
                  <c ca="left">
                     <p>Biotin carboxyl carrier protein</p>
                  </c>
                  <c ca="left">
                     <p>COG0825</p>
                  </c>
                  <c ca="left">
                     <p>Acetyl-CoA carboxylase &#945;-subunit</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Drad, Paer, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Pyro, Tmar, Hbsp<sup>&#165;</sup></p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0664</p>
                  </c>
                  <c ca="left">
                     <p>cAMP-binding domain</p>
                  </c>
                  <c ca="left">
                     <p>COG1752</p>
                  </c>
                  <c ca="left">
                     <p>Esterase</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Ccre<sup>||</sup>, Scer</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between eukaryotes and actinomycetes; an additional, independent fusion event in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1984</p>
                  </c>
                  <c ca="left">
                     <p>Allophanate hydrolase subunit 2</p>
                  </c>
                  <c ca="left">
                     <p>COG2049</p>
                  </c>
                  <c ca="left">
                     <p>Allophanate hydrolase subunit 1</p>
                  </c>
                  <c ca="left">
                     <p>-be</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Bsub, Scer</p>
                  </c>
                  <c ca="left">
                     <p>Most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1155</p>
                  </c>
                  <c ca="left">
                     <p>Archaeal/vacuolar-type H<sup>+</sup>-ATPase subunit A</p>
                  </c>
                  <c ca="left">
                     <p>COG1372</p>
                  </c>
                  <c ca="left">
                     <p>Intein</p>
                  </c>
                  <c ca="left">
                     <p>a-e</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Taci, Pyro, Scer</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in eukaryotes and archaea</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0025</p>
                  </c>
                  <c ca="left">
                     <p>Na<sup>+</sup>/H<sup>+</sup> and K<sup>+</sup>/H<sup>+</sup> antiporters</p>
                  </c>
                  <c ca="left">
                     <p>COG0569</p>
                  </c>
                  <c ca="left">
                     <p>K<sup>+</sup> transport systems, NAD-binding component</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Bhal, Syne</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Uncertain</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0062</p>
                  </c>
                  <c ca="left">
                     <p>Uncharacterized, conserved protein</p>
                  </c>
                  <c ca="left">
                     <p>COG0063</p>
                  </c>
                  <c ca="left">
                     <p>Predicted sugar kinase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>AF</p>
                  </c>
                  <c ca="left">
                     <p>All archaea; all bacteria that have COG0062</p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>One ancestral fusion; fission in eukaryotes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0069</p>
                  </c>
                  <c ca="left">
                     <p>Glutamate synthase domain 2</p>
                  </c>
                  <c ca="left">
                     <p>COG1037</p>
                  </c>
                  <c ca="left">
                     <p>Ferredoxin-like domain</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Mjan, Mthe, Tmar; (all that have COG1037)</p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>One ancestral fusion; fused gene transfer from archaea to bacteria (<it>Thermotoga</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0077</p>
                  </c>
                  <c ca="left">
                     <p>Prephenate dehydratase</p>
                  </c>
                  <c ca="left">
                     <p>COG1605</p>
                  </c>
                  <c ca="left">
                     <p>Chorismate mutase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Aqua, Tmar, Ecol, Vcho, Paer, Hinf, Xfas, Nmen, Cjej</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer between bacteria and archaea (<it>Archaeoglobus</it> and <it>Thermotoga</it> lineages); additional, independent fusions in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0108</p>
                  </c>
                  <c ca="left">
                     <p>3,4-dihydroxy-2-butanone 4-phosphate synthase</p>
                  </c>
                  <c ca="left">
                     <p>COG0807</p>
                  </c>
                  <c ca="left">
                     <p>GTP cyclohydrolase II</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Aful, Aqua, Tmar, Drad, Mtub, Bsub, Bhal, Syne, Paer, Vcho, Xfas, Nmen, Hpyl, Cjej, Ctra, Cpne</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Uncertain</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0280</p>
                  </c>
                  <c ca="left">
                     <p>Phosphotransacetylase</p>
                  </c>
                  <c ca="left">
                     <p>COG0281</p>
                  </c>
                  <c ca="left">
                     <p>Malic enzyme</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Ecol, Hinf, Xfas, Rpxx</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from bacteria to archaea (<it>Halobacterium</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0287</p>
                  </c>
                  <c ca="left">
                     <p>Prephenate dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p>COG1605</p>
                  </c>
                  <c ca="left">
                     <p>Chorismate mutase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Ecol, Vcho, Hinf</p>
                  </c>
                  <c ca="left">
                     <p>Taci, Aero, Ccre</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in archaea and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0301</p>
                  </c>
                  <c ca="left">
                     <p>ATP pyrophosphatase (thiamine biosynthesis)</p>
                  </c>
                  <c ca="left">
                     <p>COG0607</p>
                  </c>
                  <c ca="left">
                     <p>Rhodanese-related sulfurtransferase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Taci, Ecol, Vcho, Paer, Hinf</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in archaea and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0340</p>
                  </c>
                  <c ca="left">
                     <p>Biotin-(acetyl-CoA carboxylase) ligase</p>
                  </c>
                  <c ca="left">
                     <p>COG1654</p>
                  </c>
                  <c ca="left">
                     <p>Biotin operon repressor</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Paby, Drad, Bsub, Bhal, Ecol, Paer, Vcho, Xfas; (all that have COG1654)</p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from bacteria to archaea (<it>Archaeoglobus</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0351</p>
                  </c>
                  <c ca="left">
                     <p>Hydroxymethyl-pyrimidine/phospho-methylpyrimidine kinase</p>
                  </c>
                  <c ca="left">
                     <p>COG1992</p>
                  </c>
                  <c ca="left">
                     <p>Uncharacterized conserved protein</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Mjan, Pyro, Aero, Tmar</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from archaea to bacteria (<it>Thermotoga</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0468</p>
                  </c>
                  <c ca="left">
                     <p>RecA/RadA recombinase</p>
                  </c>
                  <c ca="left">
                     <p>COG1372</p>
                  </c>
                  <c ca="left">
                     <p>Intein</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Pyro, Mtub</p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in archaea and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0475</p>
                  </c>
                  <c ca="left">
                     <p>Kef-type K<sup>+</sup> transport systems, membrane component</p>
                  </c>
                  <c ca="left">
                     <p>COG1226</p>
                  </c>
                  <c ca="left">
                     <p>Kef-type K<sup>+</sup> transport systems, NAD-binding component</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mthe, Ecol, Paer, Hinf, Xfas, Nmen, Cjej, Rpxx</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from bacteria to archaea (<it>Methanobacterium</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0550</p>
                  </c>
                  <c ca="left">
                     <p>Topoisomerase IA</p>
                  </c>
                  <c ca="left">
                     <p>COG0551</p>
                  </c>
                  <c ca="left">
                     <p>Zn-finger domain associated with topoisomerase type IA</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>AF</p>
                  </c>
                  <c ca="left">
                     <p>Most bacteria and archaea</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One ancestral fusion with subsequent fission in Aper, Aqua</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0558</p>
                  </c>
                  <c ca="left">
                     <p>Phosphatidyl-glycerophosphate synthase</p>
                  </c>
                  <c ca="left">
                     <p>COG1213</p>
                  </c>
                  <c ca="left">
                     <p>Predicted sugar nucleotidyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Pyro, Aqua</p>
                  </c>
                  <c ca="left">
                     <p>Aero</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from archaea to bacteria (AquIFEx)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0560</p>
                  </c>
                  <c ca="left">
                     <p>Phosphoserine phosphatase</p>
                  </c>
                  <c ca="left">
                     <p>COG2716</p>
                  </c>
                  <c ca="left">
                     <p>ACT-domain-containing protein</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Aful, Mtub, Paer</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Uncertain</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0649</p>
                  </c>
                  <c ca="left">
                     <p>NADH:ubiquinone oxidoreductase subunit 7</p>
                  </c>
                  <c ca="left">
                     <p>COG0852</p>
                  </c>
                  <c ca="left">
                     <p>NADH:ubiquinone oxidoreductase 27 kD subunit</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Aqua, Ecol, Paer</p>
                  </c>
                  <c ca="left">
                     <p>Most archaea and bacteria</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from bacteria to archaea (<it>Halobacterium</it>)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0662</p>
                  </c>
                  <c ca="left">
                     <p>Mannose-6-phosphate isomerase</p>
                  </c>
                  <c ca="left">
                     <p>COG0836</p>
                  </c>
                  <c ca="left">
                     <p>Mannose-1-phosphate guanylyltransferase</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Pyro, Aqua, Ecol, Paer, Vcho, Xfas, Hpyl, Cjej</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer from bacteria to archaea; a second, independent fusion event in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0674</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG1014</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Hbsp, Taci, Aero, Mtub, Bhal, Syne, Ecol, Vcho, Tpal</p>
                  </c>
                  <c ca="left">
                     <p>Mjan, Mthe, Aqua, Tmar, Hpyl, Cjej</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer from archaea to bacteria; a second, independent fusion event in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0777</p>
                  </c>
                  <c ca="left">
                     <p>Acetyl-CoA carboxylase &#946; subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG0825</p>
                  </c>
                  <c ca="left">
                     <p>Acetyl-CoA carboxylase &#945; subunit</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Hbsp, Pyro, Tmar, Drad, Mtub, Bsub, Bhal, Paer, Rpxx</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer from bacteria to archaea; a second, independent fusion event in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1013</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG1014</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Mthe, Syne, Ecol, Vcho, Tpal</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Taci, Aero, Mtub, Bhal</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in archaea and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1112</p>
                  </c>
                  <c ca="left">
                     <p>Superfamily I DNA and RNA helicases and helicase subunits</p>
                  </c>
                  <c ca="left">
                     <p>COG2251</p>
                  </c>
                  <c ca="left">
                     <p>Predicted metal-binding domain</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>IFE</p>
                  </c>
                  <c ca="left">
                     <p>Pyro, Mtub</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>Independent fusion events in archaea and bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1239</p>
                  </c>
                  <c ca="left">
                     <p>Mg-chelatase subunit ChlI</p>
                  </c>
                  <c ca="left">
                     <p>COG1240</p>
                  </c>
                  <c ca="left">
                     <p>Mg-chelatase subunit ChlD</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Mthe, Taci, Mtub, Syne</p>
                  </c>
                  <c ca="left">
                     <p>Mjan, Paer</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer between bacteria and archaea, with subsequent fissions</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1361</p>
                  </c>
                  <c ca="left">
                     <p>S-layer domain</p>
                  </c>
                  <c ca="left">
                     <p>COG1470</p>
                  </c>
                  <c ca="left">
                     <p>Predicted membrane protein</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Pyro, Bhal</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from archaea to bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1387</p>
                  </c>
                  <c ca="left">
                     <p>Histidinol phosphatase and related hydrolases of the PHP family</p>
                  </c>
                  <c ca="left">
                     <p>COG1796</p>
                  </c>
                  <c ca="left">
                     <p>DNA polymerase IV (family X)</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mthe, Taci, Drad, Bsub, Bhal; (all prokaryotes that have COG1796)</p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between archaea to bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1683</p>
                  </c>
                  <c ca="left">
                     <p>Uncharacterized conserved protein</p>
                  </c>
                  <c ca="left">
                     <p>COG3272</p>
                  </c>
                  <c ca="left">
                     <p>Uncharacterized conserved protein</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Mthe, Paer, Vcho</p>
                  </c>
                  <c ca="left">
                     <p>-</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer between archaea and bacteria (<it>Methanobacterium</it> and <it>Vibrio</it>/<it>Pseudomonas</it>, respectively)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG1788</p>
                  </c>
                  <c ca="left">
                     <p>Acyl-CoA:acetate CoA transferase alpha subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG2057</p>
                  </c>
                  <c ca="left">
                     <p>Acyl-CoA:acetate CoA transferase beta subunit</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Taci, Aero, Drad, Bhal, Ecol</p>
                  </c>
                  <c ca="left">
                     <p>Mtub, Bsub, Paer, Hinf, Hpyl</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer between bacteria and archaea; a second, independent fusion event in bacteria</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG3261</p>
                  </c>
                  <c ca="left">
                     <p>Ni, Fe-hydrogenase III large subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG3262</p>
                  </c>
                  <c ca="left">
                     <p>Ni, Fe-hydrogenase III component G</p>
                  </c>
                  <c ca="left">
                     <p>ab-</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Paby, Mtub, Ecol</p>
                  </c>
                  <c ca="left">
                     <p>Pyro</p>
                  </c>
                  <c ca="left">
                     <p>One fusion event, fused gene transfer from bacteria to archaea</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0518</p>
                  </c>
                  <c ca="left">
                     <p>GMP synthase - Glutamine amidotransferase domain</p>
                  </c>
                  <c ca="left">
                     <p>COG0519</p>
                  </c>
                  <c ca="left">
                     <p>GMP synthase-PP-ATPase domain</p>
                  </c>
                  <c ca="left">
                     <p>abe</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aero, Scer, most bacteria</p>
                  </c>
                  <c ca="left">
                     <p>Mthe, Pyro, Paby</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer among bacteria, archaea, and eukaryotes</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>COG0674</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit</p>
                  </c>
                  <c ca="left">
                     <p>COG1013</p>
                  </c>
                  <c ca="left">
                     <p>Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit</p>
                  </c>
                  <c ca="left">
                     <p>abe</p>
                  </c>
                  <c ca="left">
                     <p>HGT</p>
                  </c>
                  <c ca="left">
                     <p>Aful, Mthe, Taci, Pyro, Paby, Scer, Syne, Ecol, Vcho, Cjej, Tpal</p>
                  </c>
                  <c ca="left">
                     <p>Hbsp, Mjan, Aero, Aqua, Tmar, Mtub, Hpyl</p>
                  </c>
                  <c ca="left">
                     <p>Fused gene transfer from archaea to bacteria (&#945;-proteobacteria)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>*</sup>Abbreviations: a, archaea, b, bacteria, e, eukaryotes; a dash indicates that the given kingdom is not represented in at least one of the fusion-linked COGs. <sup>&#8224;</sup>AF, ancestral fusion, HGT, horizontal gene transfer, IFE, independent fusion events. <sup>&#8225;</sup> In several cases, the indicated genes are separated by one to three genes or their order is switched compared to that of the fusion components. <sup>&#167;</sup>Paby, <it>Pyrococcus abyssi</it>, an archaeal genome not included in the master set of genomes analyzed in this study. <sup>&#182;</sup>Llac, <it>Lactococcus lactis</it>, a bacterial genome not included in the master set of genomes analyzed in this study. <sup>||</sup>Ccre, <it>Caulobacter crescentus</it>, a bacterial genome not included in the master set of genomes analyzed in this study. <sup>&#165;</sup>Hbsp, <it>Halobacterium</it> sp., an archaeal genome not included in the master set of genomes analyzed in this study.</p>
            </tblfn>
         </tbl>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Summary of evolutionary scenarios for cross-kingdom gene fusions</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="left">
                     <p>Evolutionary mode<sup>*</sup></p>
                  </c>
                  <c ca="center">
                     <p>Number of fusion-linked COG pairs</p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cross-kingdom horizontal transfer of a fused gene</p>
                  </c>
                  <c ca="center">
                     <p>31</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Independent fusion events</p>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Ancestral fusion</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Uncertain</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Total</p>
                  </c>
                  <c ca="center">
                     <p>51</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>*</sup>As indicated in Table <tblr tid="T2">2</tblr>, the evolutionary scenarios for some of the analyzed COGs included both cross-kingdom horizontal transfer and apparent independent gene fusion within one of the kingdoms.</p>
            </tblfn>
         </tbl>
         <p>Examination of the genomic context of the genes that encode stand-alone counterparts of the fusion components showed that, in 25 of the 51 cases, these genes were juxtaposed in some, and in certain cases, many prokaryotic genomes (Table <tblr tid="T2">2</tblr>). This suggests that evolution of gene fusions often, if not always, passes through an intermediate stage of juxtaposed and co-regulated, but still distinct, genes within known or predicted operons. In addition, some of the juxtaposed gene pairs might have evolved by fission of a fused gene.</p>
         <p>The results of the present analysis point to HGT as a major route of cross-kingdom dissemination of fused genes. Horizontal transfer might be even more prominent in the evolution of fused genes within the bacterial and archaeal kingdoms. This notion is supported by the topologies of some of the phylogenetic trees analyzed, which show unexpected clustering of bacterial species from different lineages (note, for example, the grouping of <it>D. radiodurans</it> with <it>P. aeruginosa</it> in Figure <figr fid="F5">5</figr>). Massive HGT between archaea and bacteria, particularly hyperthermophiles, has been suggested by genome comparisons [<abbr bid="B20">20</abbr>,<abbr bid="B21">21</abbr>,<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>,<abbr bid="B24">24</abbr>]. However, proving HGT in each individual case is difficult, and the significance of cross-kingdom HGT has been disputed [<abbr bid="B25">25</abbr>,<abbr bid="B26">26</abbr>]. With gene fusions, the existence of a derived shared character (fusion) supporting the clades formed by fusion components and the concordance of the independently built trees for each of the fusion components make a solid case for HGT.</p>
         <p>The apparent independent fusion of the same pair of genes (or, more precisely, members of the same two COGs) on multiple occasions during evolution might seem unlikely. However, we found that one-fourth to one-third of the gene fusions shared by at least two kingdoms might have evolved through such independent events, and probable additional independent fusions were noted among bacteria. This could be due to the extensive genome rearrangement characteristic of the evolution of prokaryotes [<abbr bid="B27">27</abbr>,<abbr bid="B28">28</abbr>], and to the selective value of these particular fusions, which tend to get fixed once they emerge.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <p>The version of the COG database used in this study included the following complete prokaryotic genomes. Bacteria: Aae, <it>Aquifex aeolicus;</it> Bap, <it>Buchnera aphidicola;</it> Bbu, <it>Borrelia burgdorferi;</it> Bsu, <it>Bacillus subtilis;</it> Bhal, <it>Bacillus halodurans;</it> Cje, <it>Campylobacter jejuni;</it> Cpn, <it>Chlamydophila pneumoniae;</it> Ctr, <it>Chlamydia trachomatis;</it> Dra, <it>Deinococcus radiodurans;</it> Eco, <it>Escherichia coli;</it> Hin, <it>Haemophilus influenzae;</it> Hpy, <it>Helicobacter pylori;</it> Mge, <it>Mycoplasma genitalium;</it> Mpn, <it>Mycoplasma pneumoniae;</it> Mtu, <it>Mycobacterium tuberculosis;</it> Nme, <it>Neisseria meningitidis;</it> Pae, <it>Pseudomonas aeruginosa;</it> Rpr, <it>Rickettsia prowazekii;</it> Syn, <it>Synechocystis</it> sp.; Tma, <it>Thermotoga maritima;</it> Tpa, <it>Treponema pallidum;</it> Vch, <it>Vibrio cholerae;</it> Xfa, <it>Xylella fastidiosa</it>. Eukaryote: Sce, <it>Saccharomyces cerevisiae</it>. Archaea: Ape, <it>Aeropyrum pernix;</it> Afu, <it>Archaeoglobus fulgidus;</it> Hbs, <it>Halobacterium</it> sp.; Mja, <it>Methanococcus jannaschii;</it> Mth, <it>Methanobacterium thermoautotrophicum;</it> Pho, <it>Pyrococcus horikoshii;</it> Pab, <it>Pyrococcus abyssi;</it> Tac, <it>Thermoplasma acidophilum</it>.</p>
         <p>COGs containing fusion components from at least two of the three primary kingdoms, were selected for phylogenetic analysis. COGs containing 60 or more members were excluded because of potential uncertainty of orthologous relationship between members of such large groups [<abbr bid="B18">18</abbr>]. Multiple alignments were generated for each analyzed COG using the T-Coffee program [<abbr bid="B29">29</abbr>].</p>
         <p>Phylogenetic trees were constructed by first generating a distance matrix using the PROTDIST program and the Dayhoff PAM model for amino-acid substitutions and employing this matrix for minimum evolution (least-square) tree building [<abbr bid="B30">30</abbr>] using the FITCH program. The PROTDIST and FITCH programs are modules of the PHYLIP software package [<abbr bid="B31">31</abbr>]. The tree topology was then optimized by local rearrangements using PROTML, a maximum likelihood tree-building program, included in the MOLPHY package [<abbr bid="B32">32</abbr>]. Local bootstrap probability was estimated for each internal branch by using the resampling of estimated log-likelihoods (RELL) method with 10,000 bootstrap replications [<abbr bid="B33">33</abbr>]. The gene order in prokaryotic genomes was examined using the 'Genomic context' feature of the COG database.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>Phylogenetic trees for 84 individual COGs presented as 52 pairs of <it>trans</it>-kingdom fusion-linked COGs are available. Bootstrap values (percentage of 1,000 replications) are indicated for each fork. Archaeal proteins are designated by black squares, bacterial proteins by gray squares and eukaryotic proteins by empty squares. Fusion components are denoted by _1, _2, _3, etc. Pylogenetic trees are avaliabel as PDF files for the following individual COGs:</p>
         <p>See Table 2 for more details of individual COGs</p>
         <p>
            <supplr sid="S1">COG0025</supplr>
         </p>
         <p>
            <supplr sid="S2">COG0046</supplr>
         </p>
         <p>
            <supplr sid="S3">COG0047</supplr>
         </p>
         <p>
            <supplr sid="S4">COG0062</supplr>
         </p>
         <p>
            <supplr sid="S5">COG0063</supplr>
         </p>
         <p>
            <supplr sid="S6">COG0067</supplr>
         </p>
         <p>
            <supplr sid="S7">COG0069</supplr>
         </p>
         <p>
            <supplr sid="S8">COG0070</supplr>
         </p>
         <p>
            <supplr sid="S9">COG0077</supplr>
         </p>
         <p>
            <supplr sid="S10">COG0108</supplr>
         </p>
         <p>
            <supplr sid="S11">COG0139</supplr>
         </p>
         <p>
            <supplr sid="S12">COG0140</supplr>
         </p>
         <p>
            <supplr sid="S13">COG0145</supplr>
         </p>
         <p>
            <supplr sid="S14">COG0146</supplr>
         </p>
         <p>
            <supplr sid="S15">COG0147</supplr>
         </p>
         <p>
            <supplr sid="S16">COG0169</supplr>
         </p>
         <p>
            <supplr sid="S17">COG0280</supplr>
         </p>
         <p>
            <supplr sid="S18">COG0281</supplr>
         </p>
         <p>
            <supplr sid="S19">COG0287</supplr>
         </p>
         <p>
            <supplr sid="S20">COG0294</supplr>
         </p>
         <p>
            <supplr sid="S21">COG0301</supplr>
         </p>
         <p>
            <supplr sid="S22">COG0304</supplr>
         </p>
         <p>
            <supplr sid="S23">COG0331</supplr>
         </p>
         <p>
            <supplr sid="S24">COG0337</supplr>
         </p>
         <p>
            <supplr sid="S25">COG0340</supplr>
         </p>
         <p>
            <supplr sid="S26">COG0351</supplr>
         </p>
         <p>
            <supplr sid="S27">COG0403</supplr>
         </p>
         <p>
            <supplr sid="S28">COG0439</supplr>
         </p>
         <p>
            <supplr sid="S29">COG0468</supplr>
         </p>
         <p>
            <supplr sid="S30">COG0475</supplr>
         </p>
         <p>
            <supplr sid="S31">COG0476</supplr>
         </p>
         <p>
            <supplr sid="S32">COG0511</supplr>
         </p>
         <p>
            <supplr sid="S33">COG0512</supplr>
         </p>
         <p>
            <supplr sid="S34">COG0518</supplr>
         </p>
         <p>
            <supplr sid="S35">COG0519</supplr>
         </p>
         <p>
            <supplr sid="S36">COG0550</supplr>
         </p>
         <p>
            <supplr sid="S37">COG0551</supplr>
         </p>
         <p>
            <supplr sid="S38">COG0558</supplr>
         </p>
         <p>
            <supplr sid="S39">COG0560</supplr>
         </p>
         <p>
            <supplr sid="S40">COG0569</supplr>
         </p>
         <p>
            <supplr sid="S41">COG0607</supplr>
         </p>
         <p>
            <supplr sid="S42">COG0649</supplr>
         </p>
         <p>
            <supplr sid="S43">COG0662</supplr>
         </p>
         <p>
            <supplr sid="S44">COG0664</supplr>
         </p>
         <p>
            <supplr sid="S45">COG0674</supplr>
         </p>
         <p>
            <supplr sid="S46">COG0703</supplr>
         </p>
         <p>
            <supplr sid="S47">COG0710</supplr>
         </p>
         <p>
            <supplr sid="S48">COG0777</supplr>
         </p>
         <p>
            <supplr sid="S49">COG0801</supplr>
         </p>
         <p>
            <supplr sid="S50">COG0807</supplr>
         </p>
         <p>
            <supplr sid="S51">COG0825</supplr>
         </p>
         <p>
            <supplr sid="S52">COG0836</supplr>
         </p>
         <p>
            <supplr sid="S53">COG0852</supplr>
         </p>
         <p>
            <supplr sid="S54">COG1003</supplr>
         </p>
         <p>
            <supplr sid="S55">COG1013</supplr>
         </p>
         <p>
            <supplr sid="S56">COG1014</supplr>
         </p>
         <p>
            <supplr sid="S57">COG1037</supplr>
         </p>
         <p>
            <supplr sid="S58">COG1038</supplr>
         </p>
         <p>
            <supplr sid="S59">COG1112</supplr>
         </p>
         <p>
            <supplr sid="S60">COG1155</supplr>
         </p>
         <p>
            <supplr sid="S61">COG1213</supplr>
         </p>
         <p>
            <supplr sid="S62">COG1226</supplr>
         </p>
         <p>
            <supplr sid="S63">COG1239</supplr>
         </p>
         <p>
            <supplr sid="S64">COG1240</supplr>
         </p>
         <p>
            <supplr sid="S65">COG1361</supplr>
         </p>
         <p>
            <supplr sid="S66">COG1372</supplr>
         </p>
         <p>
            <supplr sid="S67">COG1387</supplr>
         </p>
         <p>
            <supplr sid="S68">COG1470</supplr>
         </p>
         <p>
            <supplr sid="S69">COG1605</supplr>
         </p>
         <p>
            <supplr sid="S70">COG1654</supplr>
         </p>
         <p>
            <supplr sid="S71">COG1683</supplr>
         </p>
         <p>
            <supplr sid="S72">COG1752</supplr>
         </p>
         <p>
            <supplr sid="S73">COG1788</supplr>
         </p>
         <p>
            <supplr sid="S74">COG1796</supplr>
         </p>
         <p>
            <supplr sid="S75">COG1984</supplr>
         </p>
         <p>
            <supplr sid="S76">COG1992</supplr>
         </p>
         <p>
            <supplr sid="S77">COG2030</supplr>
         </p>
         <p>
            <supplr sid="S78">COG2049</supplr>
         </p>
         <p>
            <supplr sid="S79">COG2057</supplr>
         </p>
         <p>
            <supplr sid="S80">COG2251</supplr>
         </p>
         <p>
            <supplr sid="S81">COG2716</supplr>
         </p>
         <p>
            <supplr sid="S82">COG3261</supplr>
         </p>
         <p>
            <supplr sid="S83">COG3262</supplr>
         </p>
         <p>
            <supplr sid="S84">COG3272</supplr>
         </p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>COG0025</p>
            </caption>
            <text>
               <p>COG0025</p>
            </text>
            <file name="gb-2002-3-5-research0024-S1.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>COG0046</p>
            </caption>
            <text>
               <p>COG0046</p>
            </text>
            <file name="gb-2002-3-5-research0024-S2.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>COG0047</p>
            </caption>
            <text>
               <p>COG0047</p>
            </text>
            <file name="gb-2002-3-5-research0024-S3.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>COG0062</p>
            </caption>
            <text>
               <p>COG0062</p>
            </text>
            <file name="gb-2002-3-5-research0024-S4.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>COG0063</p>
            </caption>
            <text>
               <p>COG0063</p>
            </text>
            <file name="gb-2002-3-5-research0024-S5.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>COG0067</p>
            </caption>
            <text>
               <p>COG0067</p>
            </text>
            <file name="gb-2002-3-5-research0024-S6.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>COG0069</p>
            </caption>
            <text>
               <p>COG0069</p>
            </text>
            <file name="gb-2002-3-5-research0024-S7.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>COG0070</p>
            </caption>
            <text>
               <p>COG0070</p>
            </text>
            <file name="gb-2002-3-5-research0024-S8.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>COG0077</p>
            </caption>
            <text>
               <p>COG0077</p>
            </text>
            <file name="gb-2002-3-5-research0024-S9.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S10">
            <title>
               <p>Additional data file 10</p>
            </title>
            <caption>
               <p>COG0108</p>
            </caption>
            <text>
               <p>COG0108</p>
            </text>
            <file name="gb-2002-3-5-research0024-S10.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S11">
            <title>
               <p>Additional data file 11</p>
            </title>
            <caption>
               <p>COG0139</p>
            </caption>
            <text>
               <p>COG0139</p>
            </text>
            <file name="gb-2002-3-5-research0024-S11.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S12">
            <title>
               <p>Additional data file 13</p>
            </title>
            <caption>
               <p>COG0140</p>
            </caption>
            <text>
               <p>COG0140</p>
            </text>
            <file name="gb-2002-3-5-research0024-S12.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S13">
            <title>
               <p>Additional data file 13</p>
            </title>
            <caption>
               <p>COG0145</p>
            </caption>
            <text>
               <p>COG0145</p>
            </text>
            <file name="gb-2002-3-5-research0024-S13.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S14">
            <title>
               <p>COG0146</p>
            </title>
            <caption>
               <p>COG0146</p>
            </caption>
            <text>
               <p>cdf2psc: converts a .cdf file into a .psc file.</p>
            </text>
            <file name="gb-2002-3-5-research0024-S14.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S15">
            <title>
               <p>Additional data file 15</p>
            </title>
            <caption>
               <p>COG0147</p>
            </caption>
            <text>
               <p>COG0147</p>
            </text>
            <file name="gb-2002-3-5-research0024-S15.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S16">
            <title>
               <p>Additional data file 16</p>
            </title>
            <caption>
               <p>COG0169</p>
            </caption>
            <text>
               <p>COG0169</p>
            </text>
            <file name="gb-2002-3-5-research0024-S16.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S17">
            <title>
               <p>Additional data file 17</p>
            </title>
            <caption>
               <p>COG0280</p>
            </caption>
            <text>
               <p>COG0280</p>
            </text>
            <file name="gb-2002-3-5-research0024-S17.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S18">
            <title>
               <p>Additional data file 18</p>
            </title>
            <caption>
               <p>COG0281</p>
            </caption>
            <text>
               <p>COG0281</p>
            </text>
            <file name="gb-2002-3-5-research0024-S18.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S19">
            <title>
               <p>Additional data file 19</p>
            </title>
            <caption>
               <p>COG0287</p>
            </caption>
            <text>
               <p>COG0287</p>
            </text>
            <file name="gb-2002-3-5-research0024-S19.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S20">
            <title>
               <p>Additional data file 20</p>
            </title>
            <caption>
               <p>COG0294</p>
            </caption>
            <text>
               <p>COG0294</p>
            </text>
            <file name="gb-2002-3-5-research0024-S20.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S21">
            <title>
               <p>Additional data file 21</p>
            </title>
            <caption>
               <p>COG0301</p>
            </caption>
            <text>
               <p>COG0301</p>
            </text>
            <file name="gb-2002-3-5-research0024-S21.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S22">
            <title>
               <p>Additional data file 22</p>
            </title>
            <caption>
               <p>COG0304</p>
            </caption>
            <text>
               <p>COG0304</p>
            </text>
            <file name="gb-2002-3-5-research0024-S22.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S23">
            <title>
               <p>Additional data file 23</p>
            </title>
            <caption>
               <p>COG0331</p>
            </caption>
            <text>
               <p>COG0331</p>
            </text>
            <file name="gb-2002-3-5-research0024-S23.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S24">
            <title>
               <p>Additional data file 24</p>
            </title>
            <caption>
               <p>COG0337</p>
            </caption>
            <text>
               <p>COG0337</p>
            </text>
            <file name="gb-2002-3-5-research0024-S24.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S25">
            <title>
               <p>Additional data file 25</p>
            </title>
            <caption>
               <p>COG0340</p>
            </caption>
            <text>
               <p>COG0340</p>
            </text>
            <file name="gb-2002-3-5-research0024-S25.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S26">
            <title>
               <p>Additional data file 26</p>
            </title>
            <caption>
               <p>COG0351</p>
            </caption>
            <text>
               <p>COG0351</p>
            </text>
            <file name="gb-2002-3-5-research0024-S26.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S27">
            <title>
               <p>Additional data file 27</p>
            </title>
            <caption>
               <p>COG0403</p>
            </caption>
            <text>
               <p>COG0403</p>
            </text>
            <file name="gb-2002-3-5-research0024-S27.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S28">
            <title>
               <p>Additional data file 28</p>
            </title>
            <caption>
               <p>COG0439</p>
            </caption>
            <text>
               <p>COG0439</p>
            </text>
            <file name="gb-2002-3-5-research0024-S28.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S29">
            <title>
               <p>Additional data file 29</p>
            </title>
            <caption>
               <p>COG0468</p>
            </caption>
            <text>
               <p>COG0468</p>
            </text>
            <file name="gb-2002-3-5-research0024-S29.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S30">
            <title>
               <p>Additional data file 30</p>
            </title>
            <caption>
               <p>COG0475</p>
            </caption>
            <text>
               <p>COG0475</p>
            </text>
            <file name="gb-2002-3-5-research0024-S30.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S31">
            <title>
               <p>Additional data file 31</p>
            </title>
            <caption>
               <p>COG0476</p>
            </caption>
            <text>
               <p>COG0476</p>
            </text>
            <file name="gb-2002-3-5-research0024-S31.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S32">
            <title>
               <p>Additional data file 32</p>
            </title>
            <caption>
               <p>COG0511</p>
            </caption>
            <text>
               <p>COG0511</p>
            </text>
            <file name="gb-2002-3-5-research0024-S32.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S33">
            <title>
               <p>Additional data file 33</p>
            </title>
            <caption>
               <p>COG0512</p>
            </caption>
            <text>
               <p>COG0512</p>
            </text>
            <file name="gb-2002-3-5-research0024-S33.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S34">
            <title>
               <p>Additional data file 34</p>
            </title>
            <caption>
               <p>COG0518</p>
            </caption>
            <text>
               <p>COG0518</p>
            </text>
            <file name="gb-2002-3-5-research0024-S34.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S35">
            <title>
               <p>Additional data file 35</p>
            </title>
            <caption>
               <p>COG0519</p>
            </caption>
            <text>
               <p>COG0519</p>
            </text>
            <file name="gb-2002-3-5-research0024-S35.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S36">
            <title>
               <p>Additional data file 36</p>
            </title>
            <caption>
               <p>COG0550</p>
            </caption>
            <text>
               <p>COG0550</p>
            </text>
            <file name="gb-2002-3-5-research0024-S36.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S37">
            <title>
               <p>Additional data file 37</p>
            </title>
            <caption>
               <p>COG0551</p>
            </caption>
            <text>
               <p>COG0551</p>
            </text>
            <file name="gb-2002-3-5-research0024-S37.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S38">
            <title>
               <p>Additional data file 38</p>
            </title>
            <caption>
               <p>COG0558</p>
            </caption>
            <text>
               <p>COG0558</p>
            </text>
            <file name="gb-2002-3-5-research0024-S38.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S39">
            <title>
               <p>Additional data file 39</p>
            </title>
            <caption>
               <p>COG0560</p>
            </caption>
            <text>
               <p>COG0560</p>
            </text>
            <file name="gb-2002-3-5-research0024-S39.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S40">
            <title>
               <p>Additional data file 40</p>
            </title>
            <caption>
               <p>COG0569</p>
            </caption>
            <text>
               <p>COG0569</p>
            </text>
            <file name="gb-2002-3-5-research0024-S40.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S41">
            <title>
               <p>Additional data file 41</p>
            </title>
            <caption>
               <p>COG0607</p>
            </caption>
            <text>
               <p>COG0607</p>
            </text>
            <file name="gb-2002-3-5-research0024-S41.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S42">
            <title>
               <p>Additional data file 42</p>
            </title>
            <caption>
               <p>COG0649</p>
            </caption>
            <text>
               <p>COG0649</p>
            </text>
            <file name="gb-2002-3-5-research0024-S42.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S43">
            <title>
               <p>Additional data file 43</p>
            </title>
            <caption>
               <p>COG0662</p>
            </caption>
            <text>
               <p>COG0662</p>
            </text>
            <file name="gb-2002-3-5-research0024-S43.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S44">
            <title>
               <p>Additional data file 44</p>
            </title>
            <caption>
               <p>COG0664</p>
            </caption>
            <text>
               <p>COG0664</p>
            </text>
            <file name="gb-2002-3-5-research0024-S44.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S45">
            <title>
               <p>Additional data file 45</p>
            </title>
            <caption>
               <p>COG0674</p>
            </caption>
            <text>
               <p>COG0674</p>
            </text>
            <file name="gb-2002-3-5-research0024-S45.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S46">
            <title>
               <p>Additional data file 46</p>
            </title>
            <caption>
               <p>COG0703</p>
            </caption>
            <text>
               <p>COG0703</p>
            </text>
            <file name="gb-2002-3-5-research0024-S46.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S47">
            <title>
               <p>Additional data file 47</p>
            </title>
            <caption>
               <p>COG0710</p>
            </caption>
            <text>
               <p>COG0710.</p>
            </text>
            <file name="gb-2002-3-5-research0024-S47.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S48">
            <title>
               <p>Additional data file 48</p>
            </title>
            <caption>
               <p>COG0777</p>
            </caption>
            <text>
               <p>COG0777</p>
            </text>
            <file name="gb-2002-3-5-research0024-S48.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S49">
            <title>
               <p>Additional data file 49</p>
            </title>
            <caption>
               <p>COG0801</p>
            </caption>
            <text>
               <p>COG0801</p>
            </text>
            <file name="gb-2002-3-5-research0024-S49.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S50">
            <title>
               <p>Additional data file 50</p>
            </title>
            <caption>
               <p>COG0807</p>
            </caption>
            <text>
               <p>COG0807</p>
            </text>
            <file name="gb-2002-3-5-research0024-S50.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S51">
            <title>
               <p>Additional data file 51</p>
            </title>
            <caption>
               <p>COG0825</p>
            </caption>
            <text>
               <p>COG0825</p>
            </text>
            <file name="gb-2002-3-5-research0024-S51.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S52">
            <title>
               <p>Additional data file 52</p>
            </title>
            <caption>
               <p>COG0836</p>
            </caption>
            <text>
               <p>COG0836</p>
            </text>
            <file name="gb-2002-3-5-research0024-S52.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S53">
            <title>
               <p>Additional data file 53</p>
            </title>
            <caption>
               <p>COG0852</p>
            </caption>
            <text>
               <p>COG0852</p>
            </text>
            <file name="gb-2002-3-5-research0024-S53.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S54">
            <title>
               <p>Additional data file 54</p>
            </title>
            <caption>
               <p>COG1003</p>
            </caption>
            <text>
               <p>COG1003</p>
            </text>
            <file name="gb-2002-3-5-research0024-S54.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S55">
            <title>
               <p>Additional data file 55</p>
            </title>
            <caption>
               <p>COG1013</p>
            </caption>
            <text>
               <p>COG1013</p>
            </text>
            <file name="gb-2002-3-5-research0024-S55.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S56">
            <title>
               <p>Additional data file 56</p>
            </title>
            <caption>
               <p>COG1014</p>
            </caption>
            <text>
               <p>COG1014</p>
            </text>
            <file name="gb-2002-3-5-research0024-S56.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S57">
            <title>
               <p>Additional data file 57</p>
            </title>
            <caption>
               <p>COG1037</p>
            </caption>
            <text>
               <p>COG1037</p>
            </text>
            <file name="gb-2002-3-5-research0024-S57.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S58">
            <title>
               <p>Additional data file 58</p>
            </title>
            <caption>
               <p>COG1038</p>
            </caption>
            <text>
               <p>COG1038</p>
            </text>
            <file name="gb-2002-3-5-research0024-S58.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S59">
            <title>
               <p>Additional data file 59</p>
            </title>
            <caption>
               <p>COG1112</p>
            </caption>
            <text>
               <p>COG1112</p>
            </text>
            <file name="gb-2002-3-5-research0024-S59.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S60">
            <title>
               <p>Additional data file 60</p>
            </title>
            <caption>
               <p>COG1155</p>
            </caption>
            <text>
               <p>COG1155</p>
            </text>
            <file name="gb-2002-3-5-research0024-S60.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S61">
            <title>
               <p>Additional data file 61</p>
            </title>
            <caption>
               <p>COG1213</p>
            </caption>
            <text>
               <p>COG1213</p>
            </text>
            <file name="gb-2002-3-5-research0024-S61.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S62">
            <title>
               <p>Additional data file 62</p>
            </title>
            <caption>
               <p>COG1226</p>
            </caption>
            <text>
               <p>COG1226</p>
            </text>
            <file name="gb-2002-3-5-research0024-S62.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S63">
            <title>
               <p>Additional data file 63</p>
            </title>
            <caption>
               <p>COG1239</p>
            </caption>
            <text>
               <p>COG1239</p>
            </text>
            <file name="gb-2002-3-5-research0024-S63.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S64">
            <title>
               <p>Additional data file 64</p>
            </title>
            <caption>
               <p>COG1240</p>
            </caption>
            <text>
               <p>COG1240</p>
            </text>
            <file name="gb-2002-3-5-research0024-S64.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S65">
            <title>
               <p>Additional data file 65</p>
            </title>
            <caption>
               <p>COG1361</p>
            </caption>
            <text>
               <p>COG1361</p>
            </text>
            <file name="gb-2002-3-5-research0024-S65.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S66">
            <title>
               <p>Additional data file 66</p>
            </title>
            <caption>
               <p>COG1372</p>
            </caption>
            <text>
               <p>COG1372</p>
            </text>
            <file name="gb-2002-3-5-research0024-S66.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S67">
            <title>
               <p>Additional data file 67</p>
            </title>
            <caption>
               <p>COG1387</p>
            </caption>
            <text>
               <p>COG1387</p>
            </text>
            <file name="gb-2002-3-5-research0024-S67.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S68">
            <title>
               <p>Additional data file 68</p>
            </title>
            <caption>
               <p>COG1470</p>
            </caption>
            <text>
               <p>COG1470</p>
            </text>
            <file name="gb-2002-3-5-research0024-S68.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S69">
            <title>
               <p>Additional data file 69</p>
            </title>
            <caption>
               <p>COG1605</p>
            </caption>
            <text>
               <p>COG1605</p>
            </text>
            <file name="gb-2002-3-5-research0024-S69.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S70">
            <title>
               <p>Additional data file 70</p>
            </title>
            <caption>
               <p>COG1654</p>
            </caption>
            <text>
               <p>COG1654</p>
            </text>
            <file name="gb-2002-3-5-research0024-S70.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S71">
            <title>
               <p>Additional data file 71</p>
            </title>
            <caption>
               <p>COG1683</p>
            </caption>
            <text>
               <p>COG1683</p>
            </text>
            <file name="gb-2002-3-5-research0024-S71.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S72">
            <title>
               <p>Additional data file 72</p>
            </title>
            <caption>
               <p>COG1752</p>
            </caption>
            <text>
               <p>COG1752</p>
            </text>
            <file name="gb-2002-3-5-research0024-S72.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S73">
            <title>
               <p>Additional data file 73</p>
            </title>
            <caption>
               <p>COG1788</p>
            </caption>
            <text>
               <p>COG1788</p>
            </text>
            <file name="gb-2002-3-5-research0024-S73.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S74">
            <title>
               <p>Additional data file 74</p>
            </title>
            <caption>
               <p>COG1796</p>
            </caption>
            <text>
               <p>COG1796</p>
            </text>
            <file name="gb-2002-3-5-research0024-S74.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S75">
            <title>
               <p>Additional data file 75</p>
            </title>
            <caption>
               <p>COG1984</p>
            </caption>
            <text>
               <p>COG1984</p>
            </text>
            <file name="gb-2002-3-5-research0024-S75.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S76">
            <title>
               <p>Additional data file 76</p>
            </title>
            <caption>
               <p>COG1992</p>
            </caption>
            <text>
               <p>COG1992</p>
            </text>
            <file name="gb-2002-3-5-research0024-S76.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S77">
            <title>
               <p>Additional data file 77</p>
            </title>
            <caption>
               <p>COG2030</p>
            </caption>
            <text>
               <p>COG2030</p>
            </text>
            <file name="gb-2002-3-5-research0024-S77.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S78">
            <title>
               <p>Additional data file 78</p>
            </title>
            <caption>
               <p>COG2049</p>
            </caption>
            <text>
               <p>COG2049</p>
            </text>
            <file name="gb-2002-3-5-research0024-S78.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S79">
            <title>
               <p>Additional data file 79</p>
            </title>
            <caption>
               <p>COG2057</p>
            </caption>
            <text>
               <p>COG2057</p>
            </text>
            <file name="gb-2002-3-5-research0024-S79.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S80">
            <title>
               <p>Additional data file 80</p>
            </title>
            <caption>
               <p>COG2251</p>
            </caption>
            <text>
               <p>COG2251</p>
            </text>
            <file name="gb-2002-3-5-research0024-S80.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S81">
            <title>
               <p>Additional data file 81</p>
            </title>
            <caption>
               <p>COG2716</p>
            </caption>
            <text>
               <p>COG2716</p>
            </text>
            <file name="gb-2002-3-5-research0024-S81.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S82">
            <title>
               <p>Additional data file 82</p>
            </title>
            <caption>
               <p>COG3261</p>
            </caption>
            <text>
               <p>COG3261</p>
            </text>
            <file name="gb-2002-3-5-research0024-S82.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S83">
            <title>
               <p>Additional data file 83</p>
            </title>
            <caption>
               <p>COG3262</p>
            </caption>
            <text>
               <p>COG3262</p>
            </text>
            <file name="gb-2002-3-5-research0024-S83.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="S84">
            <title>
               <p>Additional data file 84</p>
            </title>
            <caption>
               <p>COG3272</p>
            </caption>
            <text>
               <p>COG3272</p>
            </text>
            <file name="gb-2002-3-5-research0024-S84.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Charles DeLisi, Adnan Derti, I. King Jordan, Kira Makarova, Igor Rogozin, and Fyodor Kondrashov for critical reading of the manuscript and helpful discussions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Detecting protein function and protein-protein interactions from genome sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>285</volume>
            <fpage>751</fpage>
            <lpage>753</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.285.5428.751</pubid>
                  <pubid idtype="pmpid" link="fulltext">10427000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Gene and context: integrative approaches to genome analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Adv Protein Chem</source>
            <pubdate>2000</pubdate>
            <volume>54</volume>
            <fpage>345</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10829232</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Yanai</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Derti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>7940</fpage>
            <lpage>7945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">35447</pubid>
                  <pubid idtype="pmpid" link="fulltext">11438739</pubid>
                  <pubid idtype="doi">10.1073/pnas.141236298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Communication modules in bacterial signaling proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Parkinson</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Kofoid</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1992</pubdate>
            <volume>26</volume>
            <fpage>71</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.ge.26.120192.000443</pubid>
                  <pubid idtype="pmpid" link="fulltext">1482126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Modular multidomain phosphoryl transfer proteins of bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Reizer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Saier</snm>
                  <fnm>MH</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>1997</pubdate>
            <volume>7</volume>
            <fpage>407</fpage>
            <lpage>415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(97)80059-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">9204284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Signaling - 2000 and beyond.</p>
            </title>
            <aug>
               <au>
                  <snm>Hunter</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>100</volume>
            <fpage>113</fpage>
            <lpage>127</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10647936</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The impact of comparative genomics on our understanding of evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>101</volume>
            <fpage>573</fpage>
            <lpage>576</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10892642</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Comparative genomics of the eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gabor Miklos</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Hariharan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Fortini</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2204</fpage>
            <lpage>2215</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2204</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731134</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Initial sequencing and analysis of the human genome.</p>
            </title>
            <aug>
               <au>
                  <cnm>International Human Genome Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Protein interaction maps for complete genomes based on gene fusion events.</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Ilipoulos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>402</volume>
            <fpage>86</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10573422</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Who's your neighbor? New computational approaches for functional genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>609</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/76443</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genome evolution: gene fusion versus gene fission.</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>9</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01924-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10637623</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genome trees constructed using five different approaches suggest new major bacterial clades.</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <fpage>8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60490</pubid>
                  <pubid idtype="pmpid" link="fulltext">11734060</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-1-8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A molecular view of microbial diversity and the biosphere.</p>
            </title>
            <aug>
               <au>
                  <snm>Pace</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>276</volume>
            <fpage>734</fpage>
            <lpage>740</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.276.5313.734</pubid>
                  <pubid idtype="pmpid" link="fulltext">9115194</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Is there a phylogenetic signal in prokaryote proteins?</p>
            </title>
            <aug>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Mitchison</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>49</volume>
            <fpage>98</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10368438</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya.</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Kandler</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Wheelis</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1990</pubdate>
            <volume>87</volume>
            <fpage>4576</fpage>
            <lpage>4579</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">54159</pubid>
                  <pubid idtype="pmpid" link="fulltext">2112744</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Archaea and the prokaryote-to-eukaryote transition.</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1997</pubdate>
            <volume>61</volume>
            <fpage>456</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9409149</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A genomic perspective on protein families.</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>278</volume>
            <fpage>631</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.278.5338.631</pubid>
                  <pubid idtype="pmpid" link="fulltext">9381173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The COG database: new developments in phylogenetic classification of proteins from complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Garkavtsev</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Shankavaram</snm>
                  <fnm>UT</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>22</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29819</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125040</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.22</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>619</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1997.4821861.x</pubid>
                  <pubid idtype="pmpid">9379893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>442</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(98)01553-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9825671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of <it>Thermotoga maritima</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>399</volume>
            <fpage>323</fpage>
            <lpage>329</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/20601</pubid>
                  <pubid idtype="pmpid" link="fulltext">10360571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Lateral genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Trends Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>M5</fpage>
            <lpage>M8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0962-8924(99)01664-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">10611671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Horizontal gene transfer in prokaryotes: quantification and classification.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>55</volume>
            <fpage>709</fpage>
            <lpage>742</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.55.1.709</pubid>
                  <pubid idtype="pmpid" link="fulltext">11544372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Archaeal and bacterial hyperthermophiles: horizontal gene exchange or common ancestry?</p>
            </title>
            <aug>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>298</fpage>
            <lpage>299</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01811-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10431189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p><it>Thermotoga</it> heats up lateral gene transfer.</p>
            </title>
            <aug>
               <au>
                  <snm>Logsdon</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Faguy</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>R747</fpage>
            <lpage>R751</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(99)80474-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10531001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Conservation of gene order: a fingerprint of proteins that physically interact.</p>
            </title>
            <aug>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>324</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01274-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9787636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Genome alignment, evolution of prokaryotic genome organization and prediction of gene function using genomic context.</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>356</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1619R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>T-Coffee: A novel method for fast and accurate multiple sequence alignment.</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Construction of phylogenetic trees.</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Margoliash</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1967</pubdate>
            <volume>155</volume>
            <fpage>279</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="pmpid">5334057</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods.</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1996</pubdate>
            <volume>266</volume>
            <fpage>418</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8743697</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <aug>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>MOLPHY: Programs for Molecular Phylogenetics. Tokyo: Institute of Statistical Mathematics;</source>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Maximum likelihood inference of protein phylogeny and the origin of chloroplasts.</p>
            </title>
            <aug>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Miyata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1990</pubdate>
            <volume>31</volume>
            <fpage>151</fpage>
            <lpage>160</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>

