<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-0509-2-90</ui>
   <ji>1752-0509</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>From protein interactions to functional annotation: graph alignment in <it>Herpes</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Kol&#225;&#345;</snm>
               <fnm>Michal</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>kolarmi@thp.uni-koeln.de</email>
            </au>
            <au id="A2">
               <snm>L&#228;ssig</snm>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>lassig@thp.uni-koeln.de</email>
            </au>
            <au ca="yes" id="A3">
               <snm>Berg</snm>
               <fnm>Johannes</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>berg@thp.uni-koeln.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institut f&#252;r Theoretische Physik, Universit&#228;t zu K&#246;ln, Z&#252;lpicher Stra&#223;e 77, 50937 K&#246;ln, Germany</p>
            </ins>
            <ins id="I2">
               <p>Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, V&#237;de&#328;sk&#225; 1083, 14220 Praha, Czech Republic</p>
            </ins>
            <ins id="I3">
               <p>Kavli Institute for Theoretical Physics, University of California, Santa Barbara, CA 93106-4030 Santa Barbara, USA</p>
            </ins>
         </insg>
         <source>BMC Systems Biology</source>
         <issn>1752-0509</issn>
         <pubdate>2008</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>90</fpage>
         <url>http://www.biomedcentral.com/1752-0509/2/90</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18957106</pubid>
               <pubid idtype="doi">10.1186/1752-0509-2-90</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Kol&#225;&#345; et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Sequence alignment is a prolific basis of functional annotation, but remains a challenging problem in the 'twilight zone' of high sequence divergence or short gene length. Here we demonstrate how information on gene interactions can help to resolve ambiguous sequence alignments. We compare two distant <it>Herpes </it>viruses by constructing a <it>graph alignment</it>, which is based jointly on the similarity of their protein interaction networks and on sequence similarity. This hybrid method provides functional associations between proteins of the two organisms that cannot be obtained from sequence or interaction data alone.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We find proteins where interaction similarity and sequence similarity are individually weak, but together provide significant evidence of orthology. There are also proteins with high interaction similarity but without any detectable sequence similarity, providing evidence of functional association beyond sequence homology. The functional predictions derived from our alignment are consistent with genomic position and gene expression data.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our approach shows that evolutionary conservation is a powerful filter to make protein interaction data informative about functional similarities between the interacting proteins, and it establishes graph alignment as a powerful tool for the comparative analysis of data from highly diverged species.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>With the advent of genome-wide functional data, cross-species comparisons are no longer limited to sequence information. A classic extension of sequence alignment is structural alignment, which has been used to compare evolutionary distant RNAs <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and proteins conserved in structure rather than sequence <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Here we use protein interactions as evolutionary information beyond sequence <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>We perform a cross-species analysis of two herpesviruses, the <it>varicella-zoster virus </it>(VZV), causing chicken pox and shingles, and the <it>Kaposi's sarcoma-associated herpesvirus </it>(KSHV), responsible for cancer of the connective tissue. The two viruses have diverged approximately 200 million years ago. Their sequence dynamics is characterised by a high rate of point mutations (at least an order of magnitude faster than their host populations <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>) and a high rate of gain and loss of genes (an order of magnitude higher than the mutation rates of prokaryotes <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>). As a result, homologous proteins have an amino acid sequence identity of only about 20%. Moreover, many open reading frames are only about 60 amino acids long. Thus, the sequence similarity between the two species is in the 'twilight zone' of detection by alignment, <it>i.e</it>., orthologous open reading frames have alignment scores just marginally above the background of unrelated sequences.</p>
         <p>To improve the cross-species comparison, we jointly use the similarity of coding sequences and of protein interactions. Our hybrid comparison method called <it>graph alignment </it>establishes a mapping between genes of two species <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> using a probabilistic scoring system based on evolutionary rates of sequences and interaction networks. Several recent studies have used orthologs identified by sequence similarity to compare networks, for instance to identify ancestral networks <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, network parts enriched in conserved links <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> or to decide between paralogous genes, see <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Here this approach is turned on its head: we use network information to identify evolutionary and functional relationships in cases where there is no detectable sequence similarity. Related approaches appeared in <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, reviewed in <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>However, these approaches use ad-hoc scoring parameters, or parameters derived from a database of known orthologous genes <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> to determine the alignment. Our method uses an evolutionary model to infer all necessary parameters from the data set itself. In ref. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> we have applied this method to co-expression networks, which are fully connected. Here we explore the complementary regime of sparsely connected networks with noisy link and node similarity data, where graph alignment is used to resolve the twilight regime of evolutionary correlations. In this regime, statistically significant alignments have to be distinguished from a <it>low-fidelity regime </it>of spurious graph alignments. Understanding the statistics of graph alignment in both regimes turns out to be important for validation of the results in the twilight regime.</p>
         <p>Our cross-species comparison is grounded on a two-level evolutionary picture for protein coding sequence including (i) the specific sequence parts responsible for protein-protein interactions and (ii) the background coding sequence, most of which is unrelated to these interactions. The relevant processes include divergent sequence evolution, gain and loss of interactions, duplication of genes and the corresponding interactions, and gain and loss of genes. Functional relationships may stem from common ancestry and thus be detectable by sequence <it>homology</it>, but they may also arise by convergent evolution, this <it>analogy </it>displayed by similar interactions without sequence similarity. An example is given in Figure <figr fid="F1">1</figr>, where one gene has functionally replaced another gene by acquiring its interactions, a process called non-orthologous gene displacement <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Similarly, an orthologous gene pair may diverge in sequence beyond detectability, but conserved interaction patterns remain detectable due to functional constraints. Such functional or evolutionary relationships are to be deduced from the network of interactions between genes.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Detecting functional relationships by graph alignment</p>
            </caption>
            <text>
               <p><b>Detecting functional relationships by graph alignment</b>. In this example, the gene labelled <it>C </it>is replaced in one lineage with its functional equivalent <it>E</it>, which has the same interaction partners in the network. While some genes can still be correctly mapped across species using sequence information (green lines), the full evolutionary history and the mapping <it>C' </it>- E* are accessible from cross-species analysis only by taking into account the interaction networks.</p>
            </text>
            <graphic file="1752-0509-2-90-1"/>
         </fig>
         <p>The essence of our graph alignment approach is as follows: Experimental data on interacting proteins defines a protein interaction network consisting of nodes (proteins) and links (protein interactions). At this point, nodes can simply be labelled by a protein name, or ORF identifier, without recourse to sequence information. The local link similarity between pairs of aligned nodes defines the link score of the alignment. Aligned nodes can either both interact (resulting in a positive link score, <it>e.g</it>. nodes <it>A'</it>, <it>D' </it>and <it>A</it>*, <it>D</it>* in Fig. <figr fid="F1">1</figr>), both not interact (resulting in a small positive link score), or interact in one species and not in the other (resulting in a negative link score). The sequence similarity between aligned nodes defines their <it>node score</it>. The total score is the sum of link and node terms, with scoring parameters depending on the evolutionary distance between the species compared. Finding high-scoring graph alignments is an algorithmically hard problem, and we use the algorithm introduced in <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to perform the search. However, since many proteins have no clear sequence ortholog <it>and </it>few interaction partners, it turns out that high-scoring alignments are not guaranteed to be statistically or biologically significant. There exists a regime of spurious alignments consisting of islands of locally matching topology which do not respect sequence similarity: the low-fidelity regime discussed further in the methods section. It turns out that optimal alignments are produced using scoring parameters in the high-fidelity regime close to the transition to the low-fidelity regime.</p>
         <p>For the graph alignment between the VZV and KSHV viruses studied here, both the interaction networks and the gene sequences are crucial to determine functional or evolutionary relationships, while each part of the data by itself is less significant. In particular, we find protein pairs with low sequence similarity for which the interaction similarity strengthens the statistical inference of homology, as well as protein pairs without sequence similarity, which are aligned based on their interactions alone. We use this alignment to make functional predictions, which turn out to be consistent with published gene expression data, as well as gene position and molecular weight. Given a validated alignment, we can quantify the evolution of protein interactions. We find that interactions between functionally related proteins are more conserved than other interactions.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Optimal graph alignment between VZV and KSHV</p>
            </st>
            <p>The protein interaction network of the herpesvirus VZV consists of 76 open reading frames (ORFs) and 173 protein-protein interactions (of these ORFs, 19 have no detected interactions and are disregarded from the subsequent analysis). The protein interaction network of KSHV consists of 84 ORFs and 123 interactions (34 ORFs have no detected interactions), <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, see Figure <figr fid="F2">2a</figr>. Thirty-four ORFs in VZV have reciprocally best matching sequence homologs with reading frames in KSHV. Between pairs of ORFs with such homologous partners, there are 44 interactions in VZV and 25 interactions in KSHV. Of these interactions, 8 occur in both species, that is the overlap between interaction networks is about 13% when the alignment is given by sequence homology. The optimal alignment of the two networks is shown in Figure <figr fid="F2">2b</figr>. The list of aligned ORFs and details on the scoring are given in the supplementary text [see Additional file <supplr sid="S1">1</supplr>]. The alignment consists of 26 pairs of aligned ORFs, spanning one third of the protein interaction networks of VZV and KSHV. The alignment contains 44 interactions, 10 of which are self-interactions. Of the 34 interactions between distinct ORFs, 11 are matching interactions occurring in both protein interaction networks, only one of the 10 self-interactions matches. Of the 26 pairs of aligned ORFs, 24 pairs have detectable sequence similarity. The remaining 2 aligned pairs involve ORFs which have no detectable sequence similarity with each other or any other ORF. The mean connectivity of the aligned part of the protein interaction network is 3.0 interactions per ORF, compared with a mean connectivity of 2.4 of VZV and 1.5 of KSHV.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Supplementary Text</b>. The Supplementary Text [Additional file <supplr sid="S1">1</supplr>] gives full detail of the graph alignment method. The text further describes the methods used for sequence comparison and for calculations of statistical significance of the presented results.</p>
               </text>
               <file name="1752-0509-2-90-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Alignment of the protein interaction networks of herpesviruses VZV and KSHV</p>
               </caption>
               <text>
                  <p><b>Alignment of the protein interaction networks of herpesviruses VZV and KSHV</b>. a) The alignment maps the nodes from the highlighted sub-networks of the PINs. Nodes are colour coded according to sequence similarity, measured by the sequence alignment score <it>&#952; </it>[see Additional file <supplr sid="S1">1</supplr>]. Green nodes have high sequence similarity with <it>&#952; </it>> 0, red nodes have no sequence similarity detected, red/green nodes have low similarity with <it>&#952; </it>&#8804; 0. The ORFs that do not belong to the network alignment are shown in pale colours. Protein interactions are represented by links between nodes, interactions between ORFs in the alignment are shown in blue. Supplementary animation [Additional file <supplr sid="S2">2</supplr>] puts the aligned network further into the context of the PINs. b) The optimal alignment is shown with nodes representing aligned pairs of ORFs. Green links indicate interactions which have been detected in both KSHV and VZV. Interactions which have only been detected in KSHV or VZV are shown in magenta or red, respectively. The cluster of matching interactions linking nodes KSHV ORF23/VZV ORF39, 29b/42, 28/65, and 67.5/25 is highlighted. c) From the alignment to functional annotation: We show the alignment of the VZV ORF65 with KSHV ORF28 (central nodes) and the context in the protein interaction graphs. The aligned partners are connected with dashed lines, the green lines connect ORFs with significant sequence similarity and the red lines connect ORFs that are aligned solely due to similarity of their interactions. An ORF belongs either among structural ORFs (green squares) or information-processing ORFs (red squares), or its function is unknown (white squares). According to the alignment of KSHV ORF28 to VZV ORF65 the KSHV ORF28 is predicted to belong among structural genes. The fact that all but one of its conserved interacting partners have the same functional annotation further supports this prediction (guilt by conserved association).</p>
               </text>
               <graphic file="1752-0509-2-90-2"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Supplementary Animation</b>. The Supplementary Animation [Additional file <supplr sid="S2">2</supplr>] illustrates the network alignment algorithm and shows the intermediate steps between the Figure <figr fid="F2">2a</figr> and Figure <figr fid="F2">2b</figr>. See caption of the Figure <figr fid="F2">2a,b</figr> for the colour coding of the nodes and links.</p>
               </text>
               <file name="1752-0509-2-90-S2.mp4">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The quality of the alignment we have obtained can be tested by comparing the genomic positions of the aligned ORFs. We count the ranks of ORFs from the initial terminal repeats of the two genomes (left TR of KSHV, TRL of VZV). In Figure <figr fid="F3">3a</figr> the ranks of reading frames in VZV are plotted against the ranks of their alignment partners in KSHV. Aligned ORFs without any sequence similarity fit very well into the sequence of ORFs in their respective genomes. The molecular weights of the aligned nodes are highly correlated, see Figure <figr fid="F3">3b</figr>. In addition, we find that interactions among the aligned ORFs are more likely to be conserved across several other herpes species, including <it>herpes simplex virus </it>(HHV-1) and <it>murine cytomegalovirus </it>(mCMV). The mutual information on the interactions in different species within the alignment is 6.6-times higher than for the interactions among ORFs outside of the alignment [see Additional file <supplr sid="S1">1</supplr> for details].</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Corroborating evidence for the network alignment from gene position and molecular weight</p>
               </caption>
               <text>
                  <p><b>Corroborating evidence for the network alignment from gene position and molecular weight</b>. a) The gene rank of reading frames of VZV is plotted against the rank in KSHV of their alignment partner. The points fall into two diagonal bands indicating the conservation of gene order between the two viruses. The ORF pairs aligned solely on the basis of matching interactions fall within those bands. The only significant deviation from those bands, the pair KSHV ORF28/VZV ORF65, has related sequences, see text. b) The molecular weights of aligned pairs of reading frames show a strong correlation (Pearson's correlation coefficient <it>r </it>= 0.94). The two exceptions again are aligned because they have related sequences (top left, indicated in green). The aligned ORFs with little or no sequence similarity (red circles, see text) show highly correlated molecular weights.</p>
               </text>
               <graphic file="1752-0509-2-90-3"/>
            </fig>
            <p>In some cases, sequence similar pairs of ORFs are not aligned because of mismatched interactions. As an extreme case an ORF may have several interactions in one species, but none in the other, indicating most likely an unsuccessful yeast-two-hybrid assay (Y2H) experiment. Examples are KSHV ORF64/VZV ORF22, 22/37, 42/53, 36/47, and 33/44.</p>
         </sec>
         <sec>
            <st>
               <p>Functional relationships detected by interaction similarity</p>
            </st>
            <p>Some ORFs are aligned due to their matching interactions, either with low or with no detectable sequence similarity. We discuss these cases separately.</p>
            <sec>
               <st>
                  <p>KSHV ORF67.5/VZV ORF25</p>
               </st>
               <p>These ORFs have a sequence identity of only 18% over 76 aa (see Methods for details). They are listed as homologs in the VIDA3 database <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and both of them are thought to be homologs of the HHV-1 protein U<sub>L</sub>33 <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The alignment of these ORFs largely results from 4 matching links out of 5 in KSHV and 12 in VZV (p-value of 4 &#215; 10<sup>-3</sup>, [see Additional file <supplr sid="S1">1</supplr>]) with a local link score <it>S</it><sub><it>L </it></sub>= 4.57 versus node score <it>S</it><sub><it>N </it></sub>= 4.20. Our alignment thus confirms the homology.</p>
            </sec>
            <sec>
               <st>
                  <p>KSHV ORF28/VZV ORF65</p>
               </st>
               <p>These ORFs have a sequence identity of only 11% over 102 aa. They are not listed as sequence homologs in databases VOCS <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, VIDA3 <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and NCBI <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. However, the sequence alignment extends over their complete length, with no gaps. Again, the alignment of these nodes results from 4 matching links out of 4 in KSHV and out of 5 in VZV (p-value of 10<sup>-3</sup>) with a local link score <it>S</it><sub><it>L </it></sub>= 6.30 versus node score <it>S</it><sub><it>N </it></sub>= 3.50. Functional annotation is available only for VZV ORF65; it belongs to the membrane/glycoprotein class, most likely it is a type-II membrane protein <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The alignment of KSHV ORF28 with VZV ORF65 leads us to predict that KSHV ORF28 also codes for a membrane glycoprotein, see Figure <figr fid="F2">2c</figr> for illustration.</p>
               <p>Several experimental studies support this prediction. Gene expression studies show that ORF28 is co-expressed with tertiary lytic ORFs and hence probably falls in the classes of structural or host-virus-interaction genes <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. The expression of ORF28 is affected by blocking DNA replication <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> showing ORF28 is a secondary or tertiary gene. Furthermore, ORF28 has been detected in the virion by mass spectroscopy, leading to a tentative functional classification as a glycoprotein-envelope protein <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Finally, ORF28 is a positional homolog of the <it>Epstein-Barr virus </it>ORF BDLF3, which is known to encode glycoprotein gp150.</p>
            </sec>
            <sec>
               <st>
                  <p>KSHV ORF23/VZV ORF39</p>
               </st>
               <p>These ORFs have no significant sequence similarity: although the alignment obtained with <it>clustalW </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp> has a sequence identity of 18% over 240 aa, it is statistically insignificant; a randomised test yields a p-value of 0.43. A systematic analysis involving a wide range of different scoring parameters does not yield a statistically significant sequence alignment either [see Additional file <supplr sid="S1">1</supplr>]. The reading frames KSHV ORF23 and VZV ORF39 are aligned purely due to 3 matching interactions out of 4 of KSHV and 4 of VZV (p-value 2 &#215; 10<sup>-2</sup>). The local link score equals 4.47 versus a node score of <it>-</it>0.49. Functional classification is available only for VZV ORF39 as a membrane/glycoprotein <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The alignment thus leads us to predict that KSHV ORF23 also codes for a membrane glycoprotein.</p>
               <p>This prediction is supported by several experimental studies. Again ORF23 is co-expressed with tertiary lytic ORFs <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and is sensitive to blocked DNA replication <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, so it is a late gene. The expression patterns of ORF23 are similar to those of structural and packaging genes.</p>
            </sec>
            <sec>
               <st>
                  <p>KSHV ORF41/VZV ORF60</p>
               </st>
               <p>These ORFs have 3 matching interactions out of 3 in KSHV and 6 in VZV (<it>p </it>= 2 &#215; 10<sup>-2</sup>), but no significant sequence similarity (The <it>clustalW </it>sequence alignment has identity of 12% over 160 aa with p-value 0.94). They are aligned with a local link score of 4.39 versus a node score of -0.49. Both ORFs are functionally annotated. KSHV ORF41 codes for a helicase/primase associated factor <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and is not affected by blocking DNA replication <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. On the other hand, VZV ORF60 codes for the glycoprotein L <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B31">31</abbr></abbrgrp>. It may be that either of them has a so-far unknown function, leading to the matching protein interactions. This idea finds support in <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, where the expression maximum of ORF41 was found to come after the secondary lytic phase. This is surprising because the transcript is needed already during the secondary lytic phase (DNA replication). No other DNA-replicating gene controlled by a different operon to KSHV ORF41 has an expression dynamics with this property. Such a delay of the maximum of expression may have two reasons: either the transcription of the ORF41 is not controlled after its role is finished, or ORF41 indeed has a hitherto uncharacterised function in the tertiary lytic phase, possibly a structural one.</p>
               <p>We also note that ORF41 is specific to the class of <it>&#947;</it>-herpesviruses, of which KSHV is a member. Analogously, ORF60 is <it>a</it>-herpesvirus specific. It is possible that the homolog of ORF41 in VZV and the homolog of ORF60 in KSHV were lost as a result of either of these proteins acquiring a new function. This would be an example of non-orthologous gene displacement <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Interaction clusters</p>
            </st>
            <p>The alignment shown in the Figure <figr fid="F2">2</figr> contains a cluster of proteins all interacting with each other. This cluster comprises the aligned pairs KSHV ORF23/VZV ORF39, 28/65, 29b/42, and 67.5/25 connected by matching links only. The p-value for such a fully connected cluster (a clique) to emerge at random is approximately 5 &#215; 10<sup>-11</sup>. The pair KSHV ORF41/VZV ORF60 discussed above is connected to this cluster by two matching links, forming an almost fully connected cluster of 5 ORFs pairs with 8 of 10 possible links present and matching. Surprisingly, while all the other ORFs in the cluster code for structural proteins (virion assembly and structure proteins), ORF41 of KSHV is annotated as a helicase/primase associated factor, and hence codes a protein involved in DNA replication. The association with structure-related genes may be interpreted as a further evidence towards another function of ORF41 as a structural protein.</p>
            <p>This cluster of interacting proteins is also found in a third species, the Epstein-Barr virus EBV, which is of the same viral family as KSHV. Three of the four ORFs of the cluster in KSHV have sequence homologs in EBV, namely ORF23, ORF67.5, ORF29b. All of the corresponding ORFs in EBV are found to interact with each other (Peter Uetz, private communication).</p>
            <p>The individual species KSHV and VZV contain further clusters, but these are not conserved across species. For instance, the cluster comprising ORFs 28, 29b, 41 and K10 in KSHV contains genes coding for predicted virion proteins, virion assembly and host-virus interaction proteins. ORFs 25, 19, 27, and 38 forming a fully connected cluster in VZV code for proteins involved in virion assembly, nucleotide repair, metabolism, and host-virus interaction.</p>
         </sec>
         <sec>
            <st>
               <p>Interaction conservation and protein function</p>
            </st>
            <p>Protein interactions which are conserved across species shed further light on the functional relationship of the interaction partners. We compare the functions of interacting proteins (i) when the interaction is conserved between KSHV and VZV, and (ii) regardless of conservation.</p>
            <p>Each annotated protein can be assigned to one of two functional classes: it is either a 'structural protein' (its functional annotation is one of capsid/core protein, membrane/glycoprotein, virion protein, virion assembly), or an 'information-processing' protein (DNA replication, gene expression regulation, nucleotide repair/metabolism, host-virus interaction). We take the functions of two proteins to be similar if both their functional annotations fall into the same class. Based on this classification, we measure the correlation between functional annotations of interacting proteins by mutual information. For conserved interactions, this is nearly 20-times higher than for the set of all interactions (0.107 bits vs. 0.006 bits). Hence, conserved interactions are more likely to connect functionally similar proteins. Conversely, functionally similar proteins have more conserved interactions than functionally unrelated genes. The mutual information between interactions in the two species is nearly ten times higher for pairs of functionally similar proteins than for pairs of functionally different proteins (0.071 bits vs. 0.007 bits).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <sec>
            <st>
               <p>Graph alignment results from sequence and interaction similarity</p>
            </st>
            <p>Protein interactions are encoded in mutually matching binding domains. The evolutionary dynamics of these domains is governed by different evolutionary constraints and hence, by different tempi than the overall coding sequence. Moreover, the sequence of a domain may evolve considerably while its interaction is conserved. Therefore, we treat the experimental interaction data as evolutionary information independent of sequence data. Our alignment of herpesviruses VZV and KSHV yields a cross-species mapping between ORFs based jointly on the correlation between amino acid sequences and on the correlation between their protein interactions. The latter correlation depends both on the evolutionary divergence of the interaction networks, and on experimental noise. This approach is distinct from searching for the overrepresentation of matching interactions among sequence homologs <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. It allows the identification of homology in cases where sequence similarity between two ORFs has decayed to statistically insignificant levels. Resolving the 'twilight zone' of sequence similarity by additional information on protein interactions is particularly relevant for the case of short genes (such as in the present application), or high levels of domain shuffling. Our method also allows to detect functional analogs, <it>i.e</it>., proteins with similar interactions but without common ancestry. The resulting alignment is corroborated by genomic position and by molecular weight of aligned ORFs.</p>
         </sec>
         <sec>
            <st>
               <p>Functional predictions from interaction similarity</p>
            </st>
            <p>We find several cases of ORFs with no detectable sequence similarity which are aligned with each other solely on the basis of matching interactions. There are different possible mechanisms generating this situation; (i) a pair of orthologous genes loses their sequence similarity below the threshold of detectability, (ii) convergent evolution, and (iii) a gene functionally substitutes for another gene. The original gene may then be excised from the genome without phenotypic effect. This process has been termed non-orthologous gene displacement <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. In all three cases, sequence information is insufficient for functional prediction. Based on the alignment due to matching interactions and on the annotation of one of the alignment partners, we predict the function of several ORFs. These predictions are supported by gene expression experiments and by the genomic position of the ORFs.</p>
         </sec>
         <sec>
            <st>
               <p>Functional cluster as conserved subgraph</p>
            </st>
            <p>The optimal alignment (Figure <figr fid="F2">2</figr>) contains a cluster of 4 ORFs whose products all interact with each other in both viruses. All members of this cluster belong to a single functional class; they are involved in virion formation and structure and code for tertiary lytic transcripts.</p>
            <p>There are other fully connected clusters both in VZV and KSHV, but none of them occur in <it>both </it>viruses. These clusters contain proteins in different functional classes; one cluster in VZV contains proteins involved in virion assembly, nucleotide repair, metabolism, and host-virus interaction.</p>
         </sec>
         <sec>
            <st>
               <p>Guilt by conserved association, evolutionary constraints on network links</p>
            </st>
            <p>The guilt-by-association scheme of assigning like functions to interacting proteins <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> would fail in these cases of non-conserved clusters. However, we can refine this principle to guilt by conserved association, assigning similar functions only to proteins with an interaction in <it>both </it>species, which correctly describes the functional correlations in the above clusters. Indeed, while the functional classes of interacting proteins in a single species are only very weakly correlated, pairs of proteins with conserved interactions are more likely to share the same function.</p>
            <p>The guilt-by-conserved-association principle might be more than a statistical filter for false positive interactions by cross-species comparison. Interactions between proteins of the same functional class are more likely to be conserved across species than interactions between proteins of different functions, which may indicate a lower rate of evolution of interactions related to function. This, in turn, is consistent with natural selection imposing a specific constraint acting jointly on protein interactions that contribute to a cellular pathway. With data on further species, phylogenetic analysis will shed light on the evolutionary forces at the level of the protein interaction networks, particularly if adaptive events can be traced in the data <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Scoring sequence alignments</p>
            </st>
            <p>To account the uneven level of sequence divergence along the herpesviral genome, we optimise scoring parameters of the Needleman-Wunsch algorithm individually for each pair of ORFs. We then normalise the scores in the way that allows comparison of scores obtained with various scoring parameters following <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, [see Additional file <supplr sid="S1">1</supplr> for details]. The scores are directly comparable to <it>i.e</it>. ClustalW scores.</p>
         </sec>
         <sec>
            <st>
               <p>Scoring graph alignments</p>
            </st>
            <p>Consider a set of genes (or open reading frames) as nodes of a network, with pair-wise interactions between the corresponding proteins represented as Boolean network links. Given two such networks in related species, we construct a graph alignment, <it>i.e</it>., a mapping <it>p </it>of nodes of one network to nodes of the other network. This alignment is scored by <it>interaction similarity </it>and <it>sequence similarity </it>as follows: (i) Aligned node pairs (<it>i, j </it>= <it>&#960;</it>(<it>i</it>)) and (<it>i', j' </it>= <it>&#960;</it>(<it>i'</it>)) contribute a positive <it>link score </it>if a link is present both between the pair (<it>i, i'</it>) in one network and (<it>j, j'</it>) in the other (matching links, such as <it>D' </it>- <it>C' </it>and <it>D</it>* - <it>E</it>* in the example of Figure <figr fid="F1">1</figr>). A negative contribution results if a link is present in one network, but not in the other (mismatched links, such as <it>D' </it>- <it>B' </it>and <it>D</it>* - <it>B</it>* in Figure <figr fid="F1">1</figr>). The link score accounts for evolutionary divergence of the interaction networks, as well as for experimental errors in the network data. (ii) An aligned node pair (<it>i, j </it>= <it>&#960;</it>(<it>i</it>)) contributes a <it>node score </it>depending on the sequence similarity <it>&#952;</it><sub><it>ij</it></sub>, rewarding similarity between aligned pairs and penalising similarity between pairs not respected by the graph alignment.</p>
            <p>The total graph alignment score is the sum of independent contributions from sequence similarity and from link similarity. Hence, any high-scoring alignment will contain node pairs aligned primarily due to similarity of their interactions or of their sequences, or of both. Of course, the outcome of the alignment depends crucially on the relative weight of node score and link score. We determine optimal scoring functions self-consistently from the data within a Bayesian framework [see Additional file <supplr sid="S1">1</supplr>].</p>
         </sec>
         <sec>
            <st>
               <p>Computation of p-values</p>
            </st>
            <p>We consider pairs of independently generated random networks, and compare them to the alignments found in empirical data. The probability of finding in random networks two nodes with the same or higher interaction overlap as a given alignment is estimated, and serves as a p-value for the corresponding alignment [See Additional file <supplr sid="S1">1</supplr> for details.]</p>
         </sec>
         <sec>
            <st>
               <p>Graph alignment algorithm</p>
            </st>
            <p>We use an iterative algorithm as described in <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to find the high-scoring graph alignments. This algorithm is based on a mapping to the quadratic assignment problem. At each step, the highest scoring alignment is identified individually for each node, while keeping the rest of the alignment fixed. A certain amount of noise is used to help the alignment to escape from local score maxima, a procedure called simulated annealing <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. This noise amplitude is gradually decreased to zero, starting from some initial value <it>T </it>and an initial alignment of reciprocal best sequence matches. An R-package implementing the graph alignment is available from the bioconductor website <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Alignment regimes and parameter selection</p>
            </st>
            <p>We have performed extensive tests, both on artificially generated networks and on the experimental PIN data, to find the optimal scoring parameters. For network pairs with low link similarity, we have found two different alignment regimes depending on the initial noise level <it>T</it>. In the <it>high-fidelity </it>regime for values of <it>T </it>well below a threshold value <it>T</it><sub><it>D</it></sub>, the alignment consists mainly of the nodes with sequence similarity, but does not extend much beyond. In the <it>low-fidelity </it>regime for <it>T </it>above <it>T</it><sub><it>D</it></sub>, high-scoring alignments contain many link matches (even more than in the biologically correct alignment), but different runs have little overlap and most nodes (even with sequence similarity) are misaligned.</p>
            <p>Optimal detection of similarity occurs in the high-fidelity regime for values of <it>T </it>just below <it>T</it><sub><it>D</it></sub>. In this region, the alignment is still guided by sequence similarity, yet extends as much as possible into the set of nodes without sequence similarity.</p>
            <p>The occurrence of high-scoring alignments of low significance can be understood intuitively from the special case of two uncorrelated graphs with a narrow range of connectivities. Aligning a pair of randomly chosen nodes with each other, their neighbours, and their next neighbours, etc., will lead to a high link score (possibly offset to some extent by a low node score). There are many such alignments with a high score, yet low statistical significance. These spurious alignments occur for sparse networks at sufficiently low fractions of link matches and low numbers of nodes with sequence similarity. They are comparable to the score islands known in local sequence alignment <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B34">34</abbr><abbr bid="B39">39</abbr></abbrgrp>. However, unlike sequences with their one-dimensional structure, locally tree-like graphs can generate an exponentially large number of such score islands.</p>
         </sec>
         <sec>
            <st>
               <p>Reproducibility and robustness</p>
            </st>
            <p>To ensure reproducibility of our results the alignment procedure is repeated several times over in order to record how often a given pair of nodes is aligned. The results are shown in supplementary Fig. 2 [see Additional file <supplr sid="S1">1</supplr>]. As a conservative pruning procedure, we only consider aligned node pairs which appear in more than half the runs (for comparison, under random matching a given alignment partner appears with probability 1/<it>N </it>~ 0.03). The optimal scoring parameters turn out not to change between alignment runs.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>All authors contributed equally to the work. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Data deposition</p>
         </st>
         <p>The protein interactions for KSHV strain BC-1 and VZV Oka-parental were taken from the yeast two-hybrid screens (Y2H) of the Peter Uetz lab <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The sequences of the two herpesviruses were downloaded from the VOCs database <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and the NCBI database <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>.</p>
         <p><b>Accession numbers: Genomes</b>: KSHV: <it>Human herpesvirus 8 </it>strain <it>cell line BC-1 </it>(VOCs genome ID 890); VZV: <it>Human herpesvirus 3 </it>strain <it>Oka parental </it>(VOCs genome ID 921). <b>KSHV ORFs</b>: ORF 67.5: provided by Peter Uetz, sequence follows: "MEYASDQLLP RDMQILFPTI YCRLNAINYC QYLKTFLVQR AQPAACDHTL VLESKVDTVR QVLRKIVSTD AVFSEARARP"; ORF 28 [GenBank: <ext-link ext-link-id="NP 572080.1" ext-link-type="gen">NP 572080.1</ext-link>]; ORF 23: [GenBank: <ext-link ext-link-id="NP 572075.1" ext-link-type="gen">NP 572075.1</ext-link>]; ORF 41: [GenBank: <ext-link ext-link-id="NP 572094.1" ext-link-type="gen">NP 572094.1</ext-link>]; ORF 29b: [GenBank: <ext-link ext-link-id="NP 572081.1" ext-link-type="gen">NP 572081.1</ext-link>]. <b>VZV ORFs</b>: ORF25: VOCs ID 59436; ORF65: 59475; ORF39: 59450; ORF60: 59470; ORF42: 59453.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Peter Uetz for several fruitful discussions and making the interaction data available prior to publication, Gordon Brown and Derek Gatherer for discussions on the protein sequence alignment, and Maria Mar Alb&#224; for providing functional information data. Funding from the DFG is acknowledged under Grants SFB 680, SFB-TR12, and BE 2478/2-1. This research was supported in part by the Academy of Sciences of the Czech Republic under Project No. AV0Z50520514 and by the National Science Foundation under Grant No. PHY05-51164.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%</p>
            </title>
            <aug>
               <au>
                  <snm>Havgaard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lyngso</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>1815</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti279</pubid>
                  <pubid idtype="pmpid" link="fulltext">15657094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Assigning new GO annotations to Protein Data Bank sequences by structural homology</p>
            </title>
            <aug>
               <au>
                  <snm>Ponomarenko</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bourne</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shindyalov</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Proteins: Structure, Function and Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>58</volume>
            <fpage>855</fpage>
            <lpage>865</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/prot.20355</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>TM-align: a protein structure alignment algorithm based on the TM-score</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Skolnick</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>7</issue>
            <fpage>2302</fpage>
            <lpage>2309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1084323</pubid>
                  <pubid idtype="pmpid" link="fulltext">15849316</pubid>
                  <pubid idtype="doi">10.1093/nar/gki524</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Herpesviral protein networks and their interaction with the human proteome</p>
            </title>
            <aug>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>YA</fnm>
               </au>
               <au>
                  <snm>Zeretzke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Atzler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Baiker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Berger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rajagopala</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Roupelieva</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fossum</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <fpage>239</fpage>
            <lpage>242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1116804</pubid>
                  <pubid idtype="pmpid" link="fulltext">16339411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Molecular phylogeny of the Alphaherpesvirinae subfamily and a proposed evolutionary timescale</p>
            </title>
            <aug>
               <au>
                  <snm>McGeoch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>1994</pubdate>
            <volume>238</volume>
            <fpage>9</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.1264</pubid>
                  <pubid idtype="pmpid" link="fulltext">8145260</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genomewide function conservation and phylogeny in the Herpesviridae</p>
            </title>
            <aug>
               <au>
                  <snm>Mar Alb&#224;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Das</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Orengo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kellam</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>43</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311046</pubid>
                  <pubid idtype="pmpid" link="fulltext">11156614</pubid>
                  <pubid idtype="doi">10.1101/gr.149801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Cross-species analysis of biological networks by Bayesian alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Berg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>L&#228;ssig</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>29</issue>
            <fpage>10967</fpage>
            <lpage>10972</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1544158</pubid>
                  <pubid idtype="pmpid" link="fulltext">16835301</pubid>
                  <pubid idtype="doi">10.1073/pnas.0602294103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Reconstruction of ancestral protein interaction networks for the bZIP transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Pinney</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Amoutzias</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Rattray</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>51</issue>
            <fpage>20449</fpage>
            <lpage>20453</lpage>
            <url>http://www.pnas.org/content/104/51/20449.abstract</url>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0706339104</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Conserved pathways within Bacteria and Yeast as revealed by global protein network alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Kelley</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sittler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Root</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stockwell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>20</issue>
            <fpage>11394</fpage>
            <lpage>11399</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208768</pubid>
                  <pubid idtype="pmpid" link="fulltext">14504397</pubid>
                  <pubid idtype="doi">10.1073/pnas.1534710100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Conserved patterns of protein interaction in multiple species</p>
            </title>
            <aug>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Suthram</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McCuine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sittler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>6</issue>
            <fpage>1974</fpage>
            <lpage>1979</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548573</pubid>
                  <pubid idtype="pmpid" link="fulltext">15687504</pubid>
                  <pubid idtype="doi">10.1073/pnas.0409522102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The identification of similarities between biological networks: application to the metabolome and interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Cootes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Muggleton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sternberg</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>369</volume>
            <issue>4</issue>
            <fpage>1126</fpage>
            <lpage>1139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2007.03.013</pubid>
                  <pubid idtype="pmpid" link="fulltext">17466331</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Systematic identification of functional orthologs based on protein network comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Bandyopadhyay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>428</fpage>
            <lpage>435</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1415213</pubid>
                  <pubid idtype="pmpid" link="fulltext">16510899</pubid>
                  <pubid idtype="doi">10.1101/gr.4526006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Alignment of metabolic pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Pinter</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rokhlenko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Yeger-Lotem</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ziv-Ukelson</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3401</fpage>
            <lpage>3408</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti554</pubid>
                  <pubid idtype="pmpid" link="fulltext">15985496</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Alignment of molecular networks by integer quadratic programming</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>L</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>1631</fpage>
            <lpage>1639</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm156</pubid>
                  <pubid idtype="pmpid" link="fulltext">17468121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Pairwise global alignment of protein interaction networks by matching neighborhood topology</p>
            </title>
            <aug>
               <au>
                  <snm>Singh</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Berger</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proceedings of the 11th Annual International Conference on Research in Computational Molecular Biology (2007): Lecture Notes in Computer Science</source>
            <pubdate>2007</pubdate>
            <volume>4453</volume>
            <fpage>16</fpage>
            <lpage>31</lpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Gr&#230;mlin: General and robust alignment of multiple large interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Flannick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Novak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>McAdams</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1169</fpage>
            <lpage>1181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1557769</pubid>
                  <pubid idtype="pmpid" link="fulltext">16899655</pubid>
                  <pubid idtype="doi">10.1101/gr.5235706</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Current progress in network research: toward reference networks for key model organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Flannick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abeliuk</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Novak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Briefings in Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>318</fpage>
            <lpage>332</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/bbm038</pubid>
                  <pubid idtype="pmpid" link="fulltext">17728341</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Biomolecular network querying: a promising approach in systems biology</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Systems Biology</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <fpage>5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2245906</pubid>
                  <pubid idtype="pmpid" link="fulltext">18205908</pubid>
                  <pubid idtype="doi">10.1186/1752-0509-2-5</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Non-orthologous gene displacement</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends in Genetics</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <issue>9</issue>
            <fpage>334</fpage>
            <lpage>336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-9525(96)20010-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">8855656</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>VIDA: a virus database system for the organisation of virus genome open reading frames</p>
            </title>
            <aug>
               <au>
                  <snm>Mar Alb&#224;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pearl</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Shepherd</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Orengo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kellam</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nuleic Acids Research</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>1</issue>
            <fpage>133</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/29.1.133</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Characterization of the UL 33 gene product of herpes simplex virus 1</p>
            </title>
            <aug>
               <au>
                  <snm>Reynolds</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Baines</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2000</pubdate>
            <volume>266</volume>
            <issue>2</issue>
            <fpage>310</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/viro.1999.0090</pubid>
                  <pubid idtype="pmpid" link="fulltext">10639317</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Viral Genome Database: A tool for storing and analyzing genes and proteins from complete viral genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Hiscock</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Upton</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>484</fpage>
            <lpage>485</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.5.484</pubid>
                  <pubid idtype="pmpid" link="fulltext">10871272</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>National Center for Biotechnology Information Viral Genomes Project</p>
            </title>
            <aug>
               <au>
                  <snm>Bao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Leipe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pham</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Resenchuk</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rozanov</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2004</pubdate>
            <volume>78</volume>
            <issue>14</issue>
            <fpage>7291</fpage>
            <lpage>7298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">434121</pubid>
                  <pubid idtype="pmpid" link="fulltext">15220402</pubid>
                  <pubid idtype="doi">10.1128/JVI.78.14.7291-7298.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Varicella-Zoster Virus (VZV) ORF65 virion protein is dispensable for replication in cell culture and is phosphorylated by casein kinase II, but not by the VZV protein kinases</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Srinivas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lekstrom</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2001</pubdate>
            <volume>280</volume>
            <fpage>62</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/viro.2000.0741</pubid>
                  <pubid idtype="pmpid" link="fulltext">11162819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Kaposi's sarcoma-associated Herpesvirus latent and lytic gene expression as revealed by DNA arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Jenner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mar Alb&#224;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boshoff</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kellam</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2001</pubdate>
            <volume>75</volume>
            <issue>2</issue>
            <fpage>891</fpage>
            <lpage>902</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">113985</pubid>
                  <pubid idtype="pmpid" link="fulltext">11134302</pubid>
                  <pubid idtype="doi">10.1128/JVI.75.2.891-902.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Transcription program of human Herpesvirus 8 (Kaposi's sarcoma-associated Herpesvirus)</p>
            </title>
            <aug>
               <au>
                  <snm>Paulose-Murphy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ha</snm>
                  <fnm>NK</fnm>
               </au>
               <au>
                  <snm>Xiang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gillim</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yarchoan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meltzer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bittner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Trent</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zeichner</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2001</pubdate>
            <volume>75</volume>
            <issue>10</issue>
            <fpage>4843</fpage>
            <lpage>4853</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">114239</pubid>
                  <pubid idtype="pmpid" link="fulltext">11312356</pubid>
                  <pubid idtype="doi">10.1128/JVI.75.10.4843-4853.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Dissection of the Kaposi's sarcoma-associated Herpesvirus gene expression program by using the viral DNA replication inhibitor Cidofovir</p>
            </title>
            <aug>
               <au>
                  <snm>Lu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Suen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Frias</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pfeiffer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Chuang</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zeichner</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2004</pubdate>
            <volume>78</volume>
            <issue>24</issue>
            <fpage>13637</fpage>
            <lpage>13652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">533899</pubid>
                  <pubid idtype="pmpid" link="fulltext">15564474</pubid>
                  <pubid idtype="doi">10.1128/JVI.78.24.13637-13652.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Virion proteins of Kaposi's sarcoma-associated Herpesvirus</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Chong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2005</pubdate>
            <volume>79</volume>
            <issue>2</issue>
            <fpage>800</fpage>
            <lpage>811</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">538588</pubid>
                  <pubid idtype="pmpid" link="fulltext">15613308</pubid>
                  <pubid idtype="doi">10.1128/JVI.79.2.800-811.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Origin-independent assembly of Kaposi's sarcoma-associated Herpesvirus DNA replication compartments in transient cotransfection assays and association with the ORF-K8 protein and cellular PML</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ahn</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Alcendor</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jang</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hayward</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hayward</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Journal of Virology</source>
            <pubdate>2001</pubdate>
            <volume>75</volume>
            <issue>3</issue>
            <fpage>1487</fpage>
            <lpage>1506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">114054</pubid>
                  <pubid idtype="pmpid" link="fulltext">11152521</pubid>
                  <pubid idtype="doi">10.1128/JVI.75.3.1487-1506.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Characterization of interaction of gH and gL glycoproteins of varicella-zoster virus: their processing and trafficking</p>
            </title>
            <aug>
               <au>
                  <snm>Mare&#353;ov&#225;</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kutinov&#225;</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ludv&#237;kov&#225;</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>&#381;&#225;k</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mare&#353;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>N&#283;me&#269;kov&#225;</snm>
                  <fnm>v</fnm>
               </au>
            </aug>
            <source>Journal of General Virology</source>
            <pubdate>2000</pubdate>
            <volume>81</volume>
            <fpage>1545</fpage>
            <lpage>1552</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10811938</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Guilt-by-association goes global</p>
            </title>
            <aug>
               <au>
                  <snm>Oliver</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>601</fpage>
            <lpage>603</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001165</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The frailty of adaptive hypotheses for the origins of organismal complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>Suppl 1</issue>
            <fpage>8597</fpage>
            <lpage>8604</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0702207104</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Similarity detection and localization</p>
            </title>
            <aug>
               <au>
                  <snm>Hwa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>L&#228;ssig</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Physical Review Letters</source>
            <pubdate>1996</pubdate>
            <volume>76</volume>
            <fpage>2591</fpage>
            <lpage>2594</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1103/PhysRevLett.76.2591</pubid>
                  <pubid idtype="pmpid" link="fulltext">10060738</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Optimization by Simulated Annealing</p>
            </title>
            <aug>
               <au>
                  <snm>Kirkpatrick</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gelatt</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Vecchi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1983</pubdate>
            <volume>220</volume>
            <fpage>671</fpage>
            <lpage>680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.220.4598.671</pubid>
                  <pubid idtype="pmpid" link="fulltext">17813860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>BioConductor: open source software for bioinformatics</p>
            </title>
            <url>http://www.bioconductor.org</url>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Limit distributions of the maximal segmental score among markov-dependent partial sums</p>
            </title>
            <aug>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dembo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Adv Appl Prob</source>
            <pubdate>1992</pubdate>
            <volume>24</volume>
            <fpage>113</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/1427732</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Applications and statistics for multiple high-scoring segments in molecular sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Altschul</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1993</pubdate>
            <volume>90</volume>
            <fpage>5873</fpage>
            <lpage>5877</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">46825</pubid>
                  <pubid idtype="pmpid" link="fulltext">8390686</pubid>
                  <pubid idtype="doi">10.1073/pnas.90.12.5873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Asymmetric exclusion process and extremal statistics of random sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Bundschuh</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Phys Rev E</source>
            <pubdate>2002</pubdate>
            <volume>65</volume>
            <issue>3</issue>
            <fpage>031911</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1103/PhysRevE.65.031911</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The complete DNA sequence of varicella-zoster virus</p>
            </title>
            <aug>
               <au>
                  <snm>Davison</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Journal of General Virology</source>
            <pubdate>1986</pubdate>
            <volume>67</volume>
            <issue>9</issue>
            <fpage>1759</fpage>
            <lpage>1816</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/0022-1317-67-9-1759</pubid>
                  <pubid idtype="pmpid" link="fulltext">3018124</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Nucleotide sequence of the Kaposi sarcoma-associated herpesvirus (HHV8)</p>
            </title>
            <aug>
               <au>
                  <snm>Russo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bohenzky</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chien</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maddalena</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Parry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peruzzi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Edelman</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <issue>25</issue>
            <fpage>14862</fpage>
            <lpage>14867</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">26227</pubid>
                  <pubid idtype="pmpid" link="fulltext">8962146</pubid>
                  <pubid idtype="doi">10.1073/pnas.93.25.14862</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
