<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2180-7-61</ui>
   <ji>1471-2180</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Structure and evolution of a proviral locus of <it>Glyptapanteles indiensis </it>bracovirus</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Desjardins</snm>
               <mi>A</mi>
               <fnm>Christopher</fnm>
               <insr iid="I1"/>
               <insr iid="I5"/>
               <email>cdesjar3@mail.rochester.edu</email>
            </au>
            <au id="A2">
               <snm>Gundersen-Rindal</snm>
               <mi>E</mi>
               <fnm>Dawn</fnm>
               <insr iid="I2"/>
               <email>dawn.gundersen-rindal@ars.usda.gov</email>
            </au>
            <au id="A3">
               <snm>Hostetler</snm>
               <mi>B</mi>
               <fnm>Jessica</fnm>
               <insr iid="I1"/>
               <email>jessicah@jcvi.org</email>
            </au>
            <au id="A4">
               <snm>Tallon</snm>
               <mi>J</mi>
               <fnm>Luke</fnm>
               <insr iid="I1"/>
               <email>ljtallon@jcvi.org</email>
            </au>
            <au id="A5">
               <snm>Fuester</snm>
               <mi>W</mi>
               <fnm>Roger</fnm>
               <insr iid="I3"/>
               <email>roger.fuester@ars.usda.gov</email>
            </au>
            <au id="A6">
               <snm>Schatz</snm>
               <mi>C</mi>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>mschatz@umiacs.umd.edu</email>
            </au>
            <au id="A7">
               <snm>Pedroni</snm>
               <mi>J</mi>
               <fnm>Monica</fnm>
               <insr iid="I2"/>
               <email>pedronim@ba.ars.usda.gov</email>
            </au>
            <au id="A8">
               <snm>Fadrosh</snm>
               <mi>W</mi>
               <fnm>Douglas</fnm>
               <insr iid="I1"/>
               <email>dfadrosh@jcvi.org</email>
            </au>
            <au id="A9">
               <snm>Haas</snm>
               <mi>J</mi>
               <fnm>Brian</fnm>
               <insr iid="I1"/>
               <email>bhaas@jcvi.org</email>
            </au>
            <au id="A10">
               <snm>Toms</snm>
               <mi>S</mi>
               <fnm>Bradley</fnm>
               <insr iid="I1"/>
               <email>btoms@jcvi.org</email>
            </au>
            <au id="A11">
               <snm>Chen</snm>
               <fnm>Dan</fnm>
               <insr iid="I1"/>
               <email>danchen@jcvi.org</email>
            </au>
            <au id="A12" ca="yes">
               <snm>Nene</snm>
               <fnm>Vishvanath</fnm>
               <insr iid="I1"/>
               <email>nene@jcvi.org</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>The Institute for Genomic Research, a division of J. Craig Venter Institute, Rockville, Maryland, USA</p>
            </ins>
            <ins id="I2">
               <p>USDA-ARS Insect Biocontrol Laboratory, Beltsville, Maryland, USA</p>
            </ins>
            <ins id="I3">
               <p>USDA-ARS Beneficial Insect Introductions Research Laboratory, Newark, Delaware, USA</p>
            </ins>
            <ins id="I4">
               <p>Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA</p>
            </ins>
            <ins id="I5">
               <p>Department of Biology, University of Rochester, Rochester, New York, USA</p>
            </ins>
         </insg>
         <source>BMC Microbiology</source>
         <issn>1471-2180</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>61</fpage>
         <url>http://www.biomedcentral.com/1471-2180/7/61</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17594494</pubid>
               <pubid idtype="doi">10.1186/1471-2180-7-61</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>26</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Desjardins et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Bracoviruses (BVs), a group of double-stranded DNA viruses with segmented genomes, are mutualistic endosymbionts of parasitoid wasps. Virus particles are replication deficient and are produced only by female wasps from proviral sequences integrated into the wasp genome. Virus particles are injected along with eggs into caterpillar hosts, where viral gene expression facilitates parasitoid survival and therefore perpetuation of proviral DNA. Here we describe a 223 kbp region of <it>Glyptapanteles indiensis </it>genomic DNA which contains a part of the <it>G. indiensis </it>bracovirus (GiBV) proviral genome.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Eighteen of ~24 GiBV viral segment sequences are encoded by 7 non-overlapping sets of BAC clones, revealing that some proviral segment sequences are separated by long stretches of intervening DNA. Two overlapping BACs, which contain a locus of 8 tandemly arrayed proviral segments flanked on either side by ~35 kbp of non-packaged DNA, were sequenced and annotated. Structural and compositional analyses of this cluster revealed it exhibits a G+C and nucleotide composition distinct from the flanking DNA. By analyzing sequence polymorphisms in the 8 GiBV viral segment sequences, we found evidence for widespread selection acting on both protein-coding and non-coding DNA. Comparative analysis of viral and proviral segment sequences revealed a sequence motif involved in the excision of proviral genome segments which is highly conserved in two other bracoviruses.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Contrary to current concepts of bracovirus proviral genome organization our results demonstrate that some but not all GiBV proviral segment sequences exist in a tandem array. Unexpectedly, non-coding DNA in the 8 proviral genome segments which typically occupies ~70% of BV viral genomes is under selection pressure suggesting it serves some function(s). We hypothesize that selection acting on GiBV proviral sequences maintains the genetic island-like nature of the cluster of proviral genome segments described herein. In contrast to large differences in the predicted gene composition of BV genomes, sequences that appear to mediate processes of viral segment formation, such as proviral segment excision and circularization, appear to be highly conserved, supporting the hypothesis of a single origin for BVs.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Much recent attention in genomics has focused on bacterial endosymbionts of insects, including the ubiquitous <it>Wolbachia </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>, the sap-feeder symbionts <it>Buchnera</it>, <it>Baumannia</it>, and <it>Sulcia </it><abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, and several others <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Many of these symbionts bring unique metabolic capabilities to their hosts, allowing these insects to flourish on diets which otherwise would be difficult to utilize. Less attention has been given to viral endosymbionts. Bracoviruses (BVs) and ichnoviruses (IVs) form subgroups of polydnaviruses (PDVs) that have evolved as obligate endosymbionts of braconid and ichneumonid endoparasitoid wasps, respectively, and appear to provide their primary hosts with pathogenic abilities <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Endoparasitoid wasps primarily parasitize other insects and usually kill the host organism they develop in. Most endoparasitoid wasps, including those that house PDVs, utilize a particularly difficult developmental strategy, known as koinobioncy, whereby the host continues to develop after it has been parasitized. Wasp eggs therefore begin development in a hostile environment in which they come under attack from the host's immune system. PDVs disrupt these responses.</p>
         <p>Members of Polydnaviridae represent the only known viruses with segmented double-stranded DNA genomes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. They exist in two forms: as an asymptomatic proviral form integrated into the genome of male and female wasps <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, and as virions. Proviral DNA is amplified from wasp genomic DNA, and viral genome segments are excised, circularized, and packaged into virus particles only within specialized ovarian calyx cells of females <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Virions are released into the reproductive tract and do not appear to cause any ill effects. During oviposition, virions, along with wasp eggs and other factors, are injected into a secondary host, usually a caterpillar, where viral gene expression facilitates endoparasitoid survival by disrupting secondary host immunity, physiology, and development <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Additional wasp factors such as venom, ovarian proteins, and egg-associated teratocytes may contribute to parasitism success. Virus particles do not replicate within the secondary (or primary) host, yet viral-mediated pathology ensures perpetuation of the proviral form of the virus within the parasitoid life cycle.</p>
         <p>PDVs are involved in a highly successful triad of mutualistic-parasitic relationships: it is estimated that there are over 30,000 wasp-PDV associations, with each wasp species exhibiting specific preferences in the host range they parasitize <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Drawing parallels from mitochondrial and bacterial endosymbiont genome evolution, some have hypothesized that PDVs are the product of reductive viral evolution <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B21">21</abbr></abbrgrp>. Viral terminology is used to describe PDVs, although many unusual aspects of their biology have called into question this classification. Eukaryote-like genome properties and functional similarities between some PDV genes and components of wasp ovarian fluid have led to the suggestion that PDVs are not viruses at all, but rather represent genetic delivery vehicles that have acquired a virus-like packaging system and have evolved to transfer wasp parasitism genes to the lepidopteran host <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. The evolutionary history of PDVs is further obscured by the hypothesis that, despite gross similarities in form and function, BVs and IVs have evolved independently <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Bracoviruses, however, are thought to be monophyletic, as all bracovirus-bearing wasps form a clade which originated ~74 million years ago <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         <p>To date six PDV viral genomes have been sequenced: CcBV and MdBV, BVs associated with the braconid wasps <it>Cotesia congregata </it>and <it>Microplitis demolitor</it>, respectively, and CsIV, HfIV, and TrIV, IVs associated with the ichneumonids <it>Campoletis sonorensis, Hyposoter fugitivus</it>, and <it>Tranosema rostrale </it><abbrgrp><abbr bid="B24">24</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. The sixth sequenced PDV, which is associated with the banchine ichneumonid <it>Glypta fumiferanae</it>, is hypothesized to form a third independent lineage of PDVs <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The packaged genomes of these viruses consist of between 15 and 105 circular segments and have aggregate sizes ranging from 189 to 568 Kbp. Unlike typical viruses only 17&#8211;30% of the viral genomes code for proteins, many genes are predicted to contain introns, and no genes code for obvious components of a DNA replication or transcription machinery. Thus, host enzymes may be utilized during construction of virus particles and/or viral genes may constitute part of proviral sequences which do not get packed into virus particles. In CsIV there is evidence for partitioning of genes encoding protein components of the virus particle between packaged and non-packaged genomic DNA <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, although no similar example has been shown for BVs. Compartmentalization of genes that are needed to maintain the PDV life cycle complicates study of virus biology and raises questions on the definition of sequences that constitute a PDV proviral genome.</p>
         <p>While PDV viral genomes are better characterized, information on proviral genomes is limited. Studies on the location of proviral genome segment sequences in CsIV suggest that IV proviral genomes are integrated at multiple loci in the wasp genome <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. By contrast, it is thought that BV proviral genome segments are tandemly arrayed in a single locus and separated by short intervening sequences <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. The latter hypothesis is based on studies of CcBV and CiBV in which proviral genome segments were flanked, at least on one end, by a different proviral genome segment <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr></abbrgrp> and a fluorescent in situ hybridization mapping study in which probes from three different CcBV viral genome segments hybridized to the same region of a single wasp chromosome <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
         <p>The current model for production of BV viral segment sequences is that one or more large precursor molecules encompassing multiple proviral viral genome segments are excised from genomic DNA and amplified, and this DNA forms the substrate from which viral segments are excised <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. According to studies of CcBV and CiBV (BV associated with <it>Chelonus inanitus</it>), all amplification of BV DNA occurs at the level of the precursor molecule&#8211;no amplification occurs following excision of viral genome segments <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. The DNA sequence at the segmental boundaries of a limited number of proviral genome segments of CsIV, CiBV and CcBV have been studied <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B33">33</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, and, in each, a direct DNA sequence repeat occurs at the boundaries. Proviral genome segment sequences are excised from the precursor molecules at these repeats, possibly via conservative site-specific recombination, and a single copy of the repeat is retained within the circularized viral segment <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Additionally, genome segments are packaged into virus particles in different abundances <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B33">33</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. Recent semi-quantitative studies have shown large differences in copy number in both viral (MdBV and CiBV) and proviral (CiBV only) forms of segments <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B42">42</abbr></abbrgrp>. The details of this phenomenon and its relationship to amplification and excision are unknown.</p>
         <p>Here we describe the analyses of a 223 kbp section of genomic DNA from the braconid <it>Glyptapanteles indiensis </it>which parasitizes gypsy moth. This region contains 8 proviral genome segments of <it>G. indiensis </it>Bracovirus (GiBV). Our data provide new insight into BV proviral genome structure, as not all GiBV viral genome segment sequences are linked in a single tandem array in the wasp genome. Conserved DNA sequences identified at the junctions of GiBV proviral genome segment sequences and in GiBV, CcBV and MdBV viral segments suggest that sequence motifs governing segment excision are highly conserved across bracoviruses. Analyses of GiBV viral segment sequence polymorphism data indicate that widespread selection acts on non-coding DNA, suggesting additional functional motifs or non-coding RNAs are present in the GiBV viral genome. Finally, there is a marked difference in nucleotide composition between proviral segment sequences and flanking DNA that is not packaged into virus particles.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Partial sequence characterization of GiBV viral DNA</p>
            </st>
            <p>Viral DNA was subjected to whole genome shotgun sequencing using purified virus pooled from the calyx fluid of ~400 female wasps from an outbred population. As judged by sizing on agarose gels, the GiBV viral genome was expected to contain 13 segments with a genome size of ~250 kbp <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. However, assembly of our preliminary sequence data indicate an aggregate genome size of ~490 kbp and ~24 different segments. Many segments are of similar sizes and would have co-migrated on agarose gels. A high frequency of single nucleotide polymorphism (SNP) (~1/70 bp) and insertions and deletions (indels) in the DNA of the viral population that was sampled complicated the closure phase of the sequencing project. Nevertheless, 19 of the 24 preliminary viral genome segment sequences were of sufficient quality to allow development of segment-specific PCR primers (data not shown). These primers were used to determine the proviral genome segment composition of BAC clones that hybridized with <sup>32</sup>P-labeled GiBV viral DNA. Priority was given to closing sequence and physical gaps in 8 viral genome segments that were encoded by two overlapping BAC clones (see below). A consensus sequence was generated for each viral genome segment (see Materials and Methods), and the resulting sequences, which varied in length from 10 to 26 kbp, were deposited in GenBank (<ext-link ext-link-type="gen" ext-link-id="EF051505">EF051505</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="EF051512">EF051512</ext-link>). Individual sequence reads were deposited in the NCBI Trace Archive (1472627677-1472629890).</p>
         </sec>
         <sec>
            <st>
               <p>Identification of BAC clones containing GiBV proviral DNA</p>
            </st>
            <p>Radioactive probes derived from total GiBV viral DNA hybridized at varying intensity to 127 clones from a BAC library of 9,216 clones made from the larvae of <it>G. indiensis</it>. Nineteen viral genome segment-specific PCRs were used to genotype 60 BAC clones to determine the proviral genome segment composition. These BAC clones segregated into 7 sets that contained non-overlapping profiles of viral genome segments (Table <tblr tid="T1">1</tblr>). Each set contained 1 to 7 proviral genome segments, and in total 17 of the 19 proviral genome segments were identified. Additionally, a sub-set of 30 BAC clones were fingerprinted using <it>Eco</it>RI and the resulting restriction enzyme patterns were used to place the BAC clones into overlapping contigs. This method of clustering was consistent with the results of the segment-specific PCRs (data not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Proviral genome segment composition of 60 GiBV BAC clones.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Genome Segment Set</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Number of Genome Segments</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Number of positive BACs</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Number of BACs tested</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Non-overlapping sets of proviral genome segments found in BAC clones, arbitrarily designated as set 1&#8211;7, are shown in column 1. The second column shows the number of proviral genome segments identified in each set. The third column shows the number of BACs which tested positive for that set, and the fourth column shows the number of BACs that were tested for that set. Some segment sets were tested for on less than 60 BAC clones, as once multiple clones were identified for a set of proviral genome segments, the primer pairs representing those sets of segments were removed from PCR experiments to reduce the number of PCRs needed to identify the entire proviral genome.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Structure and composition of GiBV proviral locus 1</p>
            </st>
            <p>Two overlapping BAC clones that appeared to code for a cluster of 7 proviral genome segments were selected for sequencing. BAC clones 18I8 and 20D14 were 120,708 kbp and 116,222 kbp in length, respectively, and overlapped by 14,273 bp. The region of sequence overlap contained 53 SNPs, indicating the BAC clones were derived from different individuals from a population of <it>G. indiensis</it>. A contiguous DNA sequence was generated by fusing positions 1&#8211;109,055 of clone 18I8 with positions 2,560&#8211;116,222 of clone 20D14, resulting in a region spanning 222,657 bp. The annotated DNA sequence was deposited in GenBank (<ext-link ext-link-type="gen" ext-link-id="AC191960">AC191960</ext-link>).</p>
            <p>The coordinates of the 7 proviral genome segment sequences in this region were determined by aligning viral genome segment sequences to it. A search of the BAC sequences against the entire assembly of viral genome segment shotgun sequence data led to the identification and closure of an extra viral genome segment sequence. This assembly was not of high enough quality for primer design during the BAC clone screening phase. Thus, a cluster of 8 proviral genome segments labeled 1p to 8p separated by 7 inter-segmental regions (isg1 to isg7) that vary in length from 122 bp to 8.4 kbp occupies ~163 kbp of DNA which we call GiBV proviral locus 1. Interestingly, the 34 kbp and 25 kbp region of DNA that flank locus 1 contain a 6&#8211;7 kbp section of DNA (L1R1 and L1R2) consisting primarily of non-coding tandem DNA sequence repeats (Figure <figr fid="F1">1</figr>, Table <tblr tid="T2">2</tblr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Structural organization of GiBV proviral locus 1</p>
               </caption>
               <text>
                  <p><b>Structural organization of GiBV proviral locus 1</b>. Proviral genome segments are labeled 1p-8p, with the square and pointed ends representing the 5' and 3' ends, respectively, relative to the putative excision motif. Inter-segmental regions are labeled isg1-isg7, and sequence regions outside the proviral genome segment sequences are labeled I-IV. The flanking tandem repeat regions (solid black squares) are labeled L1R1 and L1R2, and their structure is shown in the open boxes as black boxes in parentheses followed by the copy number of repeat as a subscript. The 2 BAC sequences were joined in isg4 (*) allowing the entirety of each proviral segment sequence to originate from a single BAC clone. Colored boxes represent genes; grey boxes are non-packaged genes, light green boxes are hypothetical proteins without gene family assignment, and the remaining colors represent different gene families.</p>
               </text>
               <graphic file="1471-2180-7-61-1"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Features of the regions of GiBV proviral locus 1</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Region</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Coordinates</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Size (bp)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>% G+C (c/n-c)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>% Coding</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Predicted genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>1 &#8211; 23133</p>
                     </c>
                     <c ca="left">
                        <p>23133</p>
                     </c>
                     <c ca="left">
                        <p>31 (47/27)</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L1R1</p>
                     </c>
                     <c ca="left">
                        <p>23134 &#8211; 29250</p>
                     </c>
                     <c ca="left">
                        <p>6117</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>II</p>
                     </c>
                     <c ca="left">
                        <p>29251 &#8211; 34177</p>
                     </c>
                     <c ca="left">
                        <p>4927</p>
                     </c>
                     <c ca="left">
                        <p>35 (42/32)</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1p</p>
                     </c>
                     <c ca="left">
                        <p>34178 &#8211; 54542</p>
                     </c>
                     <c ca="left">
                        <p>20365</p>
                     </c>
                     <c ca="left">
                        <p>37 (38/36)</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg1</p>
                     </c>
                     <c ca="left">
                        <p>54543 &#8211; 54769</p>
                     </c>
                     <c ca="left">
                        <p>227</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2p</p>
                     </c>
                     <c ca="left">
                        <p>54770 &#8211; 78277</p>
                     </c>
                     <c ca="left">
                        <p>23508</p>
                     </c>
                     <c ca="left">
                        <p>36 (44/34)</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg2</p>
                     </c>
                     <c ca="left">
                        <p>78278 &#8211; 78394</p>
                     </c>
                     <c ca="left">
                        <p>117</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3p</p>
                     </c>
                     <c ca="left">
                        <p>78395 &#8211; 94733</p>
                     </c>
                     <c ca="left">
                        <p>16339</p>
                     </c>
                     <c ca="left">
                        <p>37 (41/35)</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg3</p>
                     </c>
                     <c ca="left">
                        <p>94734 &#8211; 94903</p>
                     </c>
                     <c ca="left">
                        <p>170</p>
                     </c>
                     <c ca="left">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4p</p>
                     </c>
                     <c ca="left">
                        <p>94904 &#8211; 108614</p>
                     </c>
                     <c ca="left">
                        <p>13711</p>
                     </c>
                     <c ca="left">
                        <p>36 (41/31)</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg4</p>
                     </c>
                     <c ca="left">
                        <p>108615 &#8211; 110126</p>
                     </c>
                     <c ca="left">
                        <p>1512</p>
                     </c>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5p</p>
                     </c>
                     <c ca="left">
                        <p>110127 &#8211; 135963</p>
                     </c>
                     <c ca="left">
                        <p>25837</p>
                     </c>
                     <c ca="left">
                        <p>37 (41/34)</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg5</p>
                     </c>
                     <c ca="left">
                        <p>135964 &#8211; 136085</p>
                     </c>
                     <c ca="left">
                        <p>122</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6p</p>
                     </c>
                     <c ca="left">
                        <p>136086 &#8211; 155462</p>
                     </c>
                     <c ca="left">
                        <p>19377</p>
                     </c>
                     <c ca="left">
                        <p>37 (37/37)</p>
                     </c>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg6</p>
                     </c>
                     <c ca="left">
                        <p>155463 &#8211; 156602</p>
                     </c>
                     <c ca="left">
                        <p>1140</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7p</p>
                     </c>
                     <c ca="left">
                        <p>156603 &#8211; 179005</p>
                     </c>
                     <c ca="left">
                        <p>22403</p>
                     </c>
                     <c ca="left">
                        <p>36 (41/32)</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>isg7</p>
                     </c>
                     <c ca="left">
                        <p>179006 &#8211; 187374</p>
                     </c>
                     <c ca="left">
                        <p>8369</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8p</p>
                     </c>
                     <c ca="left">
                        <p>187375 &#8211; 197431</p>
                     </c>
                     <c ca="left">
                        <p>10057</p>
                     </c>
                     <c ca="left">
                        <p>38 (42/34)</p>
                     </c>
                     <c ca="left">
                        <p>47</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>III</p>
                     </c>
                     <c ca="left">
                        <p>197432 &#8211; 204112</p>
                     </c>
                     <c ca="left">
                        <p>6681</p>
                     </c>
                     <c ca="left">
                        <p>33 (43/28)</p>
                     </c>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L1R2</p>
                     </c>
                     <c ca="left">
                        <p>204113 &#8211; 211240</p>
                     </c>
                     <c ca="left">
                        <p>7128</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IV</p>
                     </c>
                     <c ca="left">
                        <p>211241 &#8211; 222657</p>
                     </c>
                     <c ca="left">
                        <p>11417</p>
                     </c>
                     <c ca="left">
                        <p>30 (43/27)</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Coordinates are with respect to the sequence of the entire locus. The % G+C column is divided into coding (c) and non-coding (n-c) for regions predicted to encode genes.</p>
               </tblfn>
            </tbl>
            <p>A variety of nucleotide compositional differences exist between the flanking regions I-IV, inter-segmental regions, and proviral genome segments. The latter sequences and L1R1/L1R2 have the highest average G+C content (37%), followed by the flanking regions (32%) while the inter-segmental regions have the lowest G+C content (26%). The difference in G+C content between coding and non-coding DNA is greater in flanking regions I-IV (44% vs. 28%) than in proviral genome segment sequences (41% vs. 34%) (Table <tblr tid="T2">2</tblr>). Relative dinucleotide frequencies which correct for background G+C composition were calculated for each region > 500 bp in length, except L1R1 and L1R2 as tandemly repetitive sequences have highly biased dinucleotide frequencies. Neighbor-joining clustering of the distances derived from these data (Figure <figr fid="F2">2</figr>) revealed that all of the proviral genome segments cluster together and have a highly similar dinucleotide composition, which is distinct from flanking DNA. Regions I and IV clustered together and the most distantly from proviral genome segments, whereas regions II and III and the inter-segmental regions clustered between the proviral genome segments and regions I and IV.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Neighbor-joining clustering of the regions of proviral locus 1 based on relative dinucleotide frequencies</p>
               </caption>
               <text>
                  <p><b>Neighbor-joining clustering of the regions of proviral locus 1 based on relative dinucleotide frequencies</b>. All proviral genome segments (1p-8p) group together, as do the regions outside the flanking repeats (I and IV). The scale represents the normalized Euclidean distance between regions. Regions &lt; 500 bp (isg1&#8211;3, 5) and the flanking repeats (L1R1 and L1R2) were excluded from the analysis, as they have skewed dinucleotide frequencies.</p>
               </text>
               <graphic file="1471-2180-7-61-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>A conserved DNA sequence motif exists at proviral genome segment junctions</p>
            </st>
            <p>Visual examination of the DNA sequence at the junctions between GiBV proviral genome segments and inter-segmental regions led to the identification of a 6 bp direct sequence repeat (AGCTTT), which is perfectly conserved at 14 of the 16 junctions and has one nucleotide substitution at the remaining 2 junctions. Because this repeat is encoded on the top DNA strand for 3 proviral genome segments (1p, 3, p, and 5p) and the bottom DNA strand for the remaining 5 proviral genome segments (2p, 4p, 6p, 7p, and 8p), the 5' and 3' boundaries of a proviral genome segment were defined as the first and second copy of the AGCTTT repeat relative to the sequence depicted in Figure <figr fid="F1">1</figr>. The 16 junction sequences were separated into 5' and 3' boundaries and searched using MEME, a motif discovery tool. An extended sequence motif centered on the AGCTTT repeat was identified in each group of sequences. The 5' and 3' motifs are different to each other and the 5' motif is more conserved than the 3' motif. Conservation of both motifs was greater and longer on the segmental side of the excision site than on the inter-segmental side (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Nucleotide conservation extended 30 bp in both directions around the GCT excision site</p>
               </caption>
               <text>
                  <p><b>Nucleotide conservation extended 30 bp in both directions around the GCT excision site</b>. A) 5' motif of proviral genome segments in GiBV proviral locus 1, in which sequence to the left of the motif represents inter-segmental sequences and sequence to the right of the motif represents proviral genome segment sequences. B) 3' motif of proviral genome segments GiBV proviral locus 1, in which the positions of inter-segmental and proviral genome segment sequences are reversed with respect to A). C) Extended motif from the 8 viral genome segments in proviral locus 1. D) Extended motif from all 30 CcBV viral genome segments. E) Extended motif from 13 of 15 MdBV viral genome segments.</p>
               </text>
               <graphic file="1471-2180-7-61-3"/>
            </fig>
            <p>MEME analysis of the 8 GiBV viral genome segment sequences revealed the presence of a single copy of the AGCTTT repeat surrounded by a recombined motif from the 5' and 3' motifs (Figure <figr fid="F3">3</figr>). By comparing proviral and viral genome segment sequences, it was determined that the two nucleotide polymorphisms present in the AGCTTT repeat of the proviral genome segment sequences appeared in the single copy of the repeat in viral genome segment sequences. Specifically, the 5' repeat of segment 5p has a substitution at the fifth position while the 3' repeat of segment 3p has a substitution at the first position and both changes occur in the corresponding viral segment.</p>
            <p>MEME was also used to search the complete CcBV and MdBV viral genomes, and the 5 available viral genome segments of CiBV. A sequence motif highly similar to the recombined GiBV segment motif was found in all 30 CcBV viral genome segments and 13 out of 15 MdBV viral genome segments (Figure <figr fid="F3">3</figr>). No similar motif was found in CiBV, although described CiBV exision sites show conservation of varying degrees to the AGCTTT repeat <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B40">40</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Annotation of proviral locus 1 and flanking DNA</p>
            </st>
            <p>Two previously described GiBV cDNAs (p325 and p494) expressed in infected gypsy moth larvae <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> which encode hypothetical proteins map to multiple genes in proviral locus 1. These cDNAs provide direct evidence for the presence of 1 and 2 introns in the p325 and p494 gene families, respectively; p494 maps to 2 genes in proviral genome segment 2p, while p325 maps to 1 gene of proviral genome segment 3p, 4p, and 5p. The shortest and longest intron was 83 bp and 591 bp in length, respectively. Four variations of <it>ab initio </it>gene modeling programs were tested for their ability to recover the correct intron-exon structure of these 5 genes. A combination of Softberry's FGENESH trained on the honey bee (<it>Apis mellifera</it>) and the Beijing Genome Institute's BGF trained on the silkmoth (<it>Bombyx mori</it>) were most accurate and these programs, in addition to protein alignments generated with the AAT package <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> were used to predict protein-coding gene models (see Materials and Methods).</p>
            <p>A total of 62 protein-coding genes were predicted to be encoded within the 8 proviral genome segments (Table <tblr tid="T2">2</tblr> and <tblr tid="T3">3</tblr>). As judged by sequence similarity using BLASTP, 47 genes have homologs in CcBV, but only 3 genes, all of which are members of hypothetical family 4, have homologs in MdBV. A TBLASTN analysis of the predicted GiBV proteins against the MdBV and CcBV genomes showed no additional similarity to MdBV. However, of the 15 proteins which did not show BLASTP similarity to CcBV, 5 showed similarity to translated CcBV sequences, suggesting homologs of these genes may exist in CcBV but were not previously predicted. The 10 remaining GiBV genes which do not have homologs in CcBV encode novel hypothetical proteins.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Annotation of proviral locus 1</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gene identifier</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Region</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Size</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Introns</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Sigs</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Product</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Family</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>dN/dS</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00010</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>500</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>FL(2)D protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00020</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>369</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Trans-2-enoyl-CoA reductase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00030</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>240</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>oxidored-nitro domain-like protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00040</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>562</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00050</p>
                     </c>
                     <c ca="center">
                        <p>II</p>
                     </c>
                     <c ca="center">
                        <p>599</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>5' nucleotidase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00060</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>165</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00070</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>lectin-like protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00080</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>210</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00090</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>266</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00100</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>304</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>CrV1-like protein</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00110</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>161</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>Lectin C-type domain</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00120</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>138</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.81</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00130</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>133</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>Cystatin domain</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00140</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>341</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>CrV1-like protein</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00150</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>195</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>1.04</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00160</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>104</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00170</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>219</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00180</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00190</p>
                     </c>
                     <c ca="center">
                        <p>1p</p>
                     </c>
                     <c ca="center">
                        <p>198</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00200</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>143</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>u</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00210</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>494</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>P494 protein</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00220</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00230</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>147</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00240</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>582</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>P494 protein</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00250</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00260</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>147</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00270</p>
                     </c>
                     <c ca="center">
                        <p>2p</p>
                     </c>
                     <c ca="center">
                        <p>253</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00280</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>320</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00290</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>354</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>0.09</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00300</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>340</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>P325 protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00310</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>226</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00320</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>241</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00330</p>
                     </c>
                     <c ca="center">
                        <p>3p</p>
                     </c>
                     <c ca="center">
                        <p>444</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>2.12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00340</p>
                     </c>
                     <c ca="center">
                        <p>4p</p>
                     </c>
                     <c ca="center">
                        <p>337</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>P325 protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00350</p>
                     </c>
                     <c ca="center">
                        <p>4p</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00360</p>
                     </c>
                     <c ca="center">
                        <p>4p</p>
                     </c>
                     <c ca="center">
                        <p>597</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>Ribonuclease T2 domain</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>1.96</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00370</p>
                     </c>
                     <c ca="center">
                        <p>4p</p>
                     </c>
                     <c ca="center">
                        <p>898</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00380</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>166</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00390</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>171</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00400</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>430</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00410</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>247</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.06</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00420</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>215</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00430</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>108</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00440</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>767</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>lipoprotein-like protein</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00450</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>581</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>0.53</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00460</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>348</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>0.55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00470</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>304</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>P325 protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2.21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00480</p>
                     </c>
                     <c ca="center">
                        <p>5p</p>
                     </c>
                     <c ca="center">
                        <p>170</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00490</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>279</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>g</p>
                     </c>
                     <c ca="left">
                        <p>P325-like protein</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.35</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00500</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>109</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.18</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00510</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>140</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00520</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00530</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00540</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00550</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>293</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Ribonuclease T2 domain</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>0.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00560</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>118</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00570</p>
                     </c>
                     <c ca="center">
                        <p>6p</p>
                     </c>
                     <c ca="center">
                        <p>896</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00580</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>1066</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00590</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>478</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00600</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>119</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00610</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>109</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.59</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00620</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>218</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.74</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00630</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>496</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>0.58</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00640</p>
                     </c>
                     <c ca="center">
                        <p>7p</p>
                     </c>
                     <c ca="center">
                        <p>127</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00650</p>
                     </c>
                     <c ca="center">
                        <p>8p</p>
                     </c>
                     <c ca="center">
                        <p>253</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, t</p>
                     </c>
                     <c ca="left">
                        <p>EP1-like protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>6.01</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00660</p>
                     </c>
                     <c ca="center">
                        <p>8p</p>
                     </c>
                     <c ca="center">
                        <p>177</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s, g</p>
                     </c>
                     <c ca="left">
                        <p>conserved hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.92</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00670</p>
                     </c>
                     <c ca="center">
                        <p>8p</p>
                     </c>
                     <c ca="center">
                        <p>1132</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>dentin-like protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00680</p>
                     </c>
                     <c ca="center">
                        <p>III</p>
                     </c>
                     <c ca="center">
                        <p>599</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>s</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00690</p>
                     </c>
                     <c ca="center">
                        <p>III</p>
                     </c>
                     <c ca="center">
                        <p>130</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>hypothetical protein</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00700</p>
                     </c>
                     <c ca="center">
                        <p>IV</p>
                     </c>
                     <c ca="center">
                        <p>480</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>N-myristoyltransferase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GIP_L1_00710</p>
                     </c>
                     <c ca="center">
                        <p>IV</p>
                     </c>
                     <c ca="center">
                        <p>326</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Hyaluronidase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Gene identifier indicates the Genbank locus tag for each predicted gene. Region is the location of genes according to the delineations in Table 2. Sizes of the genes are given in amino acids. Signatures (Sigs) include "s" signal peptide, "t" trans-membrane domain, and "g" potential glycosylphosphatidylinisotol anchor. Family indicates the gene family to which the predicted gene belongs, if any. dN/dS ratios are given when applicable, and an "*" represents insufficient data to calculate a ratio, while a "u" represents a mathematically undefined ratio.</p>
               </tblfn>
            </tbl>
            <p>Only 10 of the 62 predicted GiBV genes in the locus were assigned a potential function, namely C-type lectins and proteins containing a cystatin or ribonuclease T2 domain (Table <tblr tid="T3">3</tblr>). Surprisingly, 50 genes were predicted to access the secretory pathway as they contained a signal peptide at the N-terminus. Of these 6 genes were predicted to have trans-membrane domains, and 6 genes were predicted to have potential glycosylphosphatidylinisotol anchors. Only 3 proviral genome segment genes were not predicted to contain introns and the remaining genes contain either 1 or 2 introns. A protein domain-based clustering pipeline placed 43 of the 62 proteins into 14 gene families (see Methods and Table <tblr tid="T3">3</tblr>). The distribution of members of these gene families was generally not restricted to specific proviral genome segments&#8211;8 gene families, including all families with 4 members or more, were located on at least 2 non-adjoining proviral genome segments.</p>
            <p>Regions L1R1 and L1R2 and the inter-segmental regions were not predicted to contain protein-coding genes, nor did these sequences produce any significant matches when tested against the GenBank non-redundant protein database using BLASTX (E = e-10). On the other hand, regions I to IV were predicted to encode 9 genes and potential function was assigned to 6 of them (Table <tblr tid="T3">3</tblr>). These genes had a top blast hit to genes from <it>Apis mellifera </it>(BLASTP, E &lt; e-45), including the 5'-nucleotidase, trans-2-enoyl-CoA reductase, hyaluronidase, N-myristoyltransferase, and 1 hypothetical protein. By contrast, none of the genes encoded by the proviral genome segments had any sequence similarity to <it>A. mellifera </it>(BLASTP, E = e-10), other than proteins with conserved domains encoded in a large number of genomes (e.g., the C-type lectin domain). Four of 6 genes are encoded on <it>A. mellifera </it>chromosome 14, although only the honey bee hyaluronidase and N-myristoyltransferase genes were located in close proximity to each other.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of sequence polymorphisms in GiBV viral genome segment sequences</p>
            </st>
            <p>Proviral genome segments in locus 1 share 99.5&#8211;99.9% sequence identity with their homologous viral genome segment sequence. The distribution of 2,159 SNPs in the 8 GiBV viral genome segment sequences relative to the corresponding proviral genome segment sequence is shown in Table <tblr tid="T4">4</tblr>. Viral genome segment 2 showed a low frequency of polymorphisms, averaging ~5 SNPs/kbp, while the remaining segments had an average SNP density of ~16 SNPs/kbp. The majority of genome segments showed no significant correlation between sequence coverage and SNP density (Table <tblr tid="T4">4</tblr>), with the exception of segment 1, which showed a slight correlation (R<sup>2 </sup>= 0.25, p &lt; 0.05). All SNPs were placed in one of three classes: non-coding, synonymous, and non-synonymous. As expected, there was a significantly higher SNP density in synonymous sites than non-synonymous sites (&#967;<sup>2</sup><sub>1 df </sub>= 37.3, p &lt; 0.01). However, there was also a higher SNP density in synonymous sites relative to non-coding sites (&#967;<sup>2</sup><sub>1 df </sub>= 38.2, p &lt; 0.01), and no difference in SNP density between non-coding and non-synonymous sites (&#967;<sup>2</sup><sub>1 df </sub>= 1.8, p > 0.05).</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Single Nucleotide Polymorphisms (SNPs) in the viral genome segment sequences</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>
                           <b>GiVB genome segment</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>1</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>4</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>6</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>7</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>8</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Total</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SNPs</p>
                     </c>
                     <c ca="left">
                        <p>351</p>
                     </c>
                     <c ca="left">
                        <p>107</p>
                     </c>
                     <c ca="left">
                        <p>270</p>
                     </c>
                     <c ca="left">
                        <p>166</p>
                     </c>
                     <c ca="left">
                        <p>216</p>
                     </c>
                     <c ca="left">
                        <p>354</p>
                     </c>
                     <c ca="left">
                        <p>421</p>
                     </c>
                     <c ca="left">
                        <p>174</p>
                     </c>
                     <c ca="left">
                        <p>2159</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>per Kbp</p>
                     </c>
                     <c ca="left">
                        <p>17.53</p>
                     </c>
                     <c ca="left">
                        <p>4.55</p>
                     </c>
                     <c ca="left">
                        <p>16.52</p>
                     </c>
                     <c ca="left">
                        <p>12.18</p>
                     </c>
                     <c ca="left">
                        <p>12.27</p>
                     </c>
                     <c ca="left">
                        <p>18.32</p>
                     </c>
                     <c ca="left">
                        <p>18.79</p>
                     </c>
                     <c ca="left">
                        <p>17.40</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Non-Coding</p>
                     </c>
                     <c ca="left">
                        <p>232</p>
                     </c>
                     <c ca="left">
                        <p>91</p>
                     </c>
                     <c ca="left">
                        <p>149</p>
                     </c>
                     <c ca="left">
                        <p>74</p>
                     </c>
                     <c ca="left">
                        <p>195</p>
                     </c>
                     <c ca="left">
                        <p>239</p>
                     </c>
                     <c ca="left">
                        <p>269</p>
                     </c>
                     <c ca="left">
                        <p>102</p>
                     </c>
                     <c ca="left">
                        <p>1351</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Coding</p>
                     </c>
                     <c ca="left">
                        <p>119</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>121</p>
                     </c>
                     <c ca="left">
                        <p>92</p>
                     </c>
                     <c ca="left">
                        <p>121</p>
                     </c>
                     <c ca="left">
                        <p>115</p>
                     </c>
                     <c ca="left">
                        <p>152</p>
                     </c>
                     <c ca="left">
                        <p>72</p>
                     </c>
                     <c ca="left">
                        <p>808</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Synonymous</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>46</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>250</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Non-synonymous</p>
                     </c>
                     <c ca="left">
                        <p>83</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>84</p>
                     </c>
                     <c ca="left">
                        <p>65</p>
                     </c>
                     <c ca="left">
                        <p>77</p>
                     </c>
                     <c ca="left">
                        <p>73</p>
                     </c>
                     <c ca="left">
                        <p>106</p>
                     </c>
                     <c ca="left">
                        <p>57</p>
                     </c>
                     <c ca="left">
                        <p>558</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Coverage</p>
                     </c>
                     <c ca="left">
                        <p>10.1</p>
                     </c>
                     <c ca="left">
                        <p>11.5</p>
                     </c>
                     <c ca="left">
                        <p>9.8</p>
                     </c>
                     <c ca="left">
                        <p>9.8</p>
                     </c>
                     <c ca="left">
                        <p>16.3</p>
                     </c>
                     <c ca="left">
                        <p>10.9</p>
                     </c>
                     <c ca="left">
                        <p>10.3</p>
                     </c>
                     <c ca="left">
                        <p>5.2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R<sup>2</sup></p>
                     </c>
                     <c ca="left">
                        <p>0.25*</p>
                     </c>
                     <c ca="left">
                        <p>&lt; 0.01</p>
                     </c>
                     <c ca="left">
                        <p>0.14</p>
                     </c>
                     <c ca="left">
                        <p>&lt; 0.01</p>
                     </c>
                     <c ca="left">
                        <p>&lt; 0.01</p>
                     </c>
                     <c ca="left">
                        <p>0.05</p>
                     </c>
                     <c ca="left">
                        <p>0.02</p>
                     </c>
                     <c ca="left">
                        <p>0.08</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*p &lt; 0.05</p>
                  <p>Coverage indicates average sequence coverage across the viral genome segment in the whole genome shotgun, and R<sup>2 </sup>represents the correlation between the number of SNPs and sequence coverage. Only viral genome segment 1 showed a significant (p &lt; 0.05) correlation between SNP density and coverage.</p>
               </tblfn>
            </tbl>
            <p>The number of SNPs per gene ranged from 0 to 68, and dN/dS ratios were calculated for the 39 out of 62 genes that contained 5 or more SNPs (Table <tblr tid="T3">3</tblr>). Most of these genes appear to be under purifying selection and 32 of 39 genes had dN/dS ratio &lt; 0.8 with a majority of the ratios falling in the range of 0.40&#8211;0.59 (Figure <figr fid="F4">4</figr>). Three genes appear to be evolving neutrally (dN/dS = 0.8&#8211;1.2) and code for 2 hypothetical proteins and 1 member of gene family 3. Four genes had a dN/dS > 1.9, including 1 member each of gene families 1, 10, and 11 (the ribonuclease T2 domain) and an EP1-like protein. No correlation was found between dN/dS ratios and specific genome segments or gene families&#8211;most segments and gene families contained genes under different degrees of selection.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Histogram of dN/dS ratios of 39 genes in the viral genome segments</p>
               </caption>
               <text>
                  <p>
                     <b>Histogram of dN/dS ratios of 39 genes in the viral genome segments.</b>
                  </p>
               </text>
               <graphic file="1471-2180-7-61-4"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Not all GiBV proviral genome segments occur in a tandem array</p>
            </st>
            <p>Prior to this study, it was believed that bracovirus proviral genome segments were closely linked in a tandem array in the wasp genome with short stretches of intervening DNA separating them <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. Our study indicates that this is not the case for GiBV. While some GiBV proviral genome segment sequences are clustered in tandem arrays others occur in isolation as singletons. This conclusion is supported by the segregation of BAC clones coding for 18 of ~24 proviral genome segments into 7 non-overlapping sets of clones via viral genome segment-specific PCRs (Table <tblr tid="T1">1</tblr>), and preliminary BAC shotgun sequence data support the typing data (not shown). Furthermore, although we describe a tandem array of proviral genome segments in this paper at GiBV proviral locus 1, the array codes for only 8 proviral genome segment sequences and this cluster is flanked by at least 34 kbp and 25 kbp of DNA (Figure <figr fid="F1">1</figr>) that is not packaged into GiBV virions. It remains to be determined whether the 7 loci encoding GiBV proviral segment sequences are linked on the same chromosome as a macrolocus but with longer stretches of intervening DNA between them, or whether they are dispersed across more than one chromosome. Although the former scenario remains compatible with a study of <it>C. congregata </it>where probes from 3 different viral genome segments bound to the same location on <it>C. congregata </it>chromosome 5 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, the structural organization of BV proviral genome segment sequences appears to be more complex than previously hypothesized.</p>
            <p>It is reasonable to propose that the inter-segmental regions in GiBV proviral locus 1 should be classified as part of the GiBV proviral genome. However, to what extent the proviral genome extends into flanking DNA is less easily determined. BV viral genome segments are thought to be excised from the amplified products of one or more large precursor molecules, and there is no evidence for post-excision amplification of segments <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. Thus copy number studies of regions immediately flanking GiBV proviral locus 1 and other loci containing proviral genome segment sequences at the time of viral genome segment formation could be used as a surrogate marker for identifying potential components of the GiBV proviral genome.</p>
         </sec>
         <sec>
            <st>
               <p>Gene content of proviral locus 1 and flanking regions</p>
            </st>
            <p>Due to the limited transcriptional data available for BVs, there is substantial disagreement on the structural complexity of BV genes, particularly with regards to the percentage of PDV genes that contain introns. While Espagne <it>et al </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp> predicted that 69% of CcBV proteins contain introns, Webb <it>et al </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp> re-annotated the CcBV genome and predicted only 6.8% of CcBV genes contain introns&#8211;a ten-fold difference in intron content. In GiBV proviral locus 1, using a combination of Hymenoptera- and Lepidoptera-trained gene prediction programs (see Methods), we predicted that 81% of the 63 genes contain introns. Sequence data from 2 cDNAs derived from genes in proviral locus 1 suggests that the 7 introns predicted for 5 members of the 2 gene families are real and not artifacts of improper gene modeling. However, this number is probably not reflective of the entire GiBV genome, as PTP and ankyrin genes usually do not contain introns <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp> and generally comprise a large percentage of BV genes (21% and 41% of predicted CcBV and MdBV genes, respectively), but are not present in GiBV proviral locus 1. Regardless, the accuracy of most predicted gene models awaits experimental verification. While the presence of introns may be unusual for virus genes, some DNA viruses which replicate in the host cell nucleus encode genes with introns (e.g., adenoviruses <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>).</p>
            <p>GiBV genes in proviral locus 1 predicted to contain introns have an extremely simple intron-exon structure compared to often complex higher eukaryotic genes, and generally contain a single short exon followed by a long exon encoding the remainder of the protein. Remarkably, 80% of the genes at this locus, including the p494 and p325 gene families which are transcribed in infected gypsy moth larvae, are predicted to encode a secretion signal peptide within the first exon. Secretion of some proteins may compensate for differences in the abundance of segment sequences in virions. Since it is unclear whether the entirety of the GiBV genome is packaged into a single virion <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, secretion of a large number of proteins may be necessary for properly delivery of these proteins. Attempts to functionally annotate the 62 predicted genes in the 8 GiBV proviral genome segment sequences identified the presence of a C-type lectin <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, CrV1-like proteins <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, and a number of conserved hypothetical proteins encoded by other PDV genomes <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B50">50</abbr></abbrgrp>. Most of the genes in locus 1 were predicted to have homologs in CcBV, while only gene family 4 showed homology to a gene on MdBV segment B. Although the function of this gene family is unknown, it is the only gene family in GiBV proviral locus 1 for which none of the members are predicted to contain signal peptides.</p>
            <p>The placement of 43 GiBV genes into 14 gene families suggests that extensive duplication of genes has occurred within proviral locus 1. Typically, gene duplications are thought to result in relaxation of the selection on the duplicated gene, allowing it to acquire a new function. However, the majority of genes in proviral locus 1, even multiple members of the same gene family, appear to be under purifying selection (Figure <figr fid="F4">4</figr>). This implies that members of gene families are, for the most part, not free to acquire entirely new functions but may play different roles within the constraints of their gene family, such as differential targeting as seen in some inhibitors of NF-&#954;Bs <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> or differential expression as seen in some PTPs <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Alternatively, conserved function across duplicated genes may be important for increasing the level of expression of functional classes of genes <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B51">51</abbr></abbrgrp>. Despite the large proportion of genes under purifying selection, 7 genes appear to be evolving neutrally or under diversifying selection, potentially allowing a limited set of genes to acquire new functions or adapt to changes in host defenses.</p>
            <p>The inter-segmental regions which separate the proviral genome segments are not predicted to contain protein coding genes. However, regions that map outside of proviral locus 1 are predicted to contain 9 genes, and potential function has been assigned to some of them, e.g., N-myristoyltransferase, ecto-5'-nucleotidase, and hyaluronidase (Figure <figr fid="F1">1</figr>). It is interesting to note that viral proteins are often modified with a lipid tail, hyaluronidase is a component of venom <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> that hydrolyzes complex carbohydrate structures allowing tissue diffusion, and ecto-5'-nucleotidase is involved in the extracellular formation of adenosine, a regulator of innate immune responses <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. It is unclear whether these regions constitute part of the GiBV proviral genome but there is a striking difference in the structural complexity of these predicted gene models and those present in proviral locus 1. Nevertheless, it is tempting to speculate that proteins encoded in the flanking regions, perhaps as components of ovarian fluids, and genes that are located close to other proviral segment loci, may play a role in GiBV biology. Also notable is a sex-linked wasp gene coding for a homolog of female-lethal(2) [<it>fl(2)d</it>] that is present in region I. In <it>Drosophila fl(2)d </it>plays a critical role in alternative splicing regulation of genes involved in sex determination (including <it>Sex-lethal </it>and <it>transformer</it>), dosage compensation, oogenesis, and differentiation, as well as non sex-specific functions, and is expressed throughout larval and adult life <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. Since excision of proviral genome segments from the wasp chromosome and encapsidation into virion particles occurs only in females, it is possible that regulation of this sex-linked process is related, at least in part, to expression of <it>fl(2)d</it>.</p>
         </sec>
         <sec>
            <st>
               <p>A proviral genome segment excision motif is highly conserved across bracoviruses</p>
            </st>
            <p>The presence of a near perfect AGCTTT direct DNA sequence repeat was discovered at the boundaries of proviral genome segment sequences and flanking sequences (Figure <figr fid="F3">3</figr>). As the viral genome segment sequences contain a single copy of this repeat, it appears to define the site of proviral genome segment excision. This suggests an excision mechanism via conservative site specific recombination as described for formation of other PDV genome segments <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B33">33</abbr><abbr bid="B39">39</abbr></abbrgrp>. The presence of two SNPs within this repeat at the junction of proviral segment sequences and the ability to follow these nucleotide differences from the proviral to the viral genome segments suggests that the site of proviral genome segment excision and circularization must be located between the first and fifth position within the AGCTTT repeat (Figure <figr fid="F3">3</figr>). A study of excision sites in CiBV similarly concluded that GCT was the preferred site of excision <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
            <p>An extended but different sequence motif around the excision site was identified at the 5' and 3' proviral genome segment junction sequences using MEME and the recombined sequence motif is found on viral genome segment sequences (Figure <figr fid="F3">3</figr>). While sequence conservation exists on both sides of excision sites, a higher level of conservation is seen in the side of the motif which is retained in circularized segment, and in particular at the 5' junction. The asymmetry of the 5' and 3' sequence motifs suggests that there is directionality to the recognition of excision sites. Since recombined sites have a different motif we predict they are no longer substrates for the excision enzymes. Excision and circularization of segments from a large precursor molecule could occur via release of single segments or a smaller molecule containing multiple segments. In the latter case the segments flanking the site of circularization would no longer be available for excision. For example, if a molecule encompassing 1p through 3p in proviral locus 1 were excised, only 2p would remain a substrate for subsequent excision and circularization (Figure <figr fid="F1">1</figr>). Such a pathway could contribute to differences in the abundance of packaged viral genome segments but it portrays a complex scenario. Assuming that sequence coverage of a viral genome segment in our shotgun sequencing approach correlates with the abundance of the segment it is interesting to note that the GiBV viral genome segments encoded in proviral locus 1 appear to be present in about the same levels (Table <tblr tid="T4">4</tblr>), suggesting that generation of intermediate excision products is not a common occurrence. The sequencing data also suggest that intermediates or by-products of excision, if they occur, are excluded from the packaging process, perhaps by the presence of inter-segmental DNA.</p>
            <p>We found that the predicted site of excision/circularization and the recombined extended motif present in GiBV viral genome segments is also present in CcBV and MdBV viral genome segments (Figure <figr fid="F3">3</figr>). Conservation of the GCT portion of the excision repeat sequence exists in the CiBV viral genome segment sequences that are available <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, although more CiBV sequences will be required to determine how closely the CiBV extended motif mirrors that of GiBV, CcBV, and MdBV. As <it>C. congregata</it>, <it>G. indiensis</it>, and <it>M. demolitor </it>are all members of Microgasterinae, the most derived clade of bracovirus-bearing braconids, and <it>C. inanitus </it>is a member of Cheloninae, the most basal clade of the bracovirus-bearing wasps <abbrgrp><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>, it is possible that the predicted excision motif is one of the very few sequence features that is highly conserved across bracoviruses, and provides additional support for the hypothesis that bracoviruses have a single evolutionary origin <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B60">60</abbr></abbrgrp>. This observation also predicts conservation of the enzyme(s) involved in BV proviral genome segment excision and circularization.</p>
         </sec>
         <sec>
            <st>
               <p>Selective pressure on non-coding DNA in proviral segment sequences in locus 1</p>
            </st>
            <p>Analysis of SNP data derived from sequencing the GiBV viral genome from an outbred population of female wasps revealed that non-coding sites in the 8 viral genome segments derived from locus 1 had a significantly lower SNP density than synonymous sites within coding DNA. As we presume synonymous sites to be evolving neutrally, this result suggest that there is likely to be selective pressure on non-coding DNA. The lack of difference between rates of change at non-coding and non-synonymous sites suggests that in these segment sequences, non-coding DNA may be as highly conserved as coding DNA. Such areas could encode non-coding RNAs or contain sequence motifs vital to DNA replication, gene expression or segment packaging. Limited experimental evidence support the idea that PDV non-coding DNA is functional&#8211;studies of CsIV segment B found 2 sequences of 0.6 and 1.2 kbp which are transcribed but do not encode proteins <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Proviral locus 1&#8211;a genetic island?</p>
            </st>
            <p>Several differences between the cluster of 8 GiBV proviral segment sequences which are excised and packed into virus particles and flanking DNA suggest that proviral segment sequences are not simply host genetic elements evolved for the export of wasp parasitism genes. For example, the proviral segments exhibit similar nucleotide compositions to each other but their G+C composition and dinucleotide frequencies differ from those of inter-segmental regions and flanking regions I-IV (Table <tblr tid="T2">2</tblr> and Figure <figr fid="F2">2</figr>). Given the estimated age of the integration of bracoviruses into the wasp genome, ~74 million years, and using substitution rates estimated from <it>Drosophila </it><abbrgrp><abbr bid="B63">63</abbr></abbrgrp>, one would predict that a sufficient period of time has passed for the process of ameliorization, i.e., the adjustment over time of the nucleotide composition of the integrated DNA to that of the resident genome <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>, to have occurred. The different nucleotide composition of the proviral segment sequences may be maintained or its ameliorization may be slowed by the purifying selection found to be acting on both non-coding and coding DNA. However, as differences in nucleotide composition can be caused by different origins of DNA <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> or by the widespread purifying selection itself <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>, the origins of the compositional differences between proviral and flanking DNA remain to be determined. Additionally, it is possible that inter-segmental and flanking regions, rather than the proviral segment sequences, differ from the remainder of the wasp genome.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Here we provide the first report of a 223 kbp region of genomic DNA from the braconid <it>Glyptapanteles indiensis</it>, and the characterization of a cluster of 8 proviral genome segments encoded within it. Our data show that, contrary to current concepts of bracovirus proviral genome organization, the proviral segments are not entirely contained within a single tandem array in the wasp genome. However, it remains unclear whether the multiple GiBV proviral loci are linked on a single wasp chromosome as a macrolocus, and how representative this pattern is of BVs as a whole. The dispersed nature of GiBV proviral genome segments raises the question as to how to define proviral DNA within the wasp genome. It is reasonable to propose that sequences which can be shown to be physically linked to proviral genome segment sequences within amplified precursor molecules should be classified as part of the proviral genome. Whether such studies will reveal the entire composition of a proviral genome remains to be determined, as it is not known whether all genes involved in virion formation are components of precursor molecules.</p>
         <p>Our study provides, for the first time, evidence for widespread purifying selection acting on BV non-coding DNA, suggesting that a large amount of the non-coding DNA in bracoviral genomes may be functional. Our analysis also reveals a variety of notable differences between flanking and proviral genome segment sequences. We hypothesize that selection acting on proviral DNA is maintaining the distinctive nucleotide composition of the proviral genome. However, the origins of these differences remain unknown. Neither proviral locus 1 nor any of the BV viral genomes sequenced to date encode homologs of known viral coat proteins or components of a transcription or DNA replication machinery, which are often the only genes conserved enough for viral phylogenetic studies. Identification of genes that perform these functions in <it>Glyptapanteles indiensis </it>will be essential for determining whether GiBV has a viral or cellular origin. As multiple lines of evidence, including the conserved excision motif described herein, support the hypothesis of a single evolutionary origin of BVs, an understanding of the evolutionary history of GiBV will reveal much about the evolution of BVs as a whole.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Rearing of parasitoid wasps</p>
            </st>
            <p>Outbred populations of <it>Glyptapanteles indiensis</it>, solitary endoparasitoids of gypsy moths (<it>Lymantria dispar</it>), were maintained at the USDA-ARS-Beneficial Insects Introduction Research Unit, Newark, Delaware, as part of a biocontrol program. The colony was founded in May 1998 from a shipment of 168 moths collected from 4 localities in India. In May 2002, the colony was outcrossed with 242 moths collected from the same localities. The mean colony size was 400 with an average sex ratio of 7 females:13 males. Host larvae were fed on a high wheat-germ diet. Both wasp and host larvae were maintained at 26&#176;C, 58% relative humidity, and a light-dark (L:D) cycle of 16L:8D hr according to established protocol <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. <it>G. indiensis </it>parasitize late first instar gypsy moth larvae. Cocoons formed from parasitized hosts were stored at 24&#176;C until adult parasitoid emergence and then separated by sex. <it>G. indiensis </it>larvae were dissected from parasitized host 10 days post parasitization, briefly rinsed in phosphate buffered saline (PBS), flash frozen in liquid nitrogen and stored frozen at -80&#176;C.</p>
         </sec>
         <sec>
            <st>
               <p>Virion purification and DNA extraction</p>
            </st>
            <p>Virions were purified from <it>G. indiensis </it>females using established protocols <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Briefly, female wasps were anaesthetized in 75% ethanol and rinsed in PBS. Ovaries were dissected from the females in a drop of PBS and ruptured, draining the calyx fluid. Pooled calyx fluid was subsequently filtered through a 0.45 &#956;m filter to remove eggs and cellular debris <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. Viral DNA was extracted according to established protocol <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Briefly, viral DNA was isolated from the calyx fluid using a proteinase K/SDS buffer, DNA was extracted with phenol, precipitated with ethanol, and recovered by centrifugation.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of BAC clones containing proviral DNA</p>
            </st>
            <p>A BAC library of <it>G. indiensis </it>with a 120 kb average insert size was constructed by Amplicon Express <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>, using a partial <it>Bam</it>HI digest inserted into an <it>Mbo</it>I site of a pECBAC1 vector. A nylon filter arrayed with 9,216 BAC clones was created from the library. In order to identify BAC clones containing proviral DNA, GiBV viral DNA was radioactively labeled with <sup>32</sup>P-labeled &#945;-dCTP (NEN/Perkin-Elmer) using the Redi-prime II DNA labeling kit (Amersham Biosciences). Labeled DNA was then purified using a QIAquick PCR purification kit (Qiagen). The filter was pre-hybridized at 65&#176; for at least 3 hours with Rapid-hyb Buffer (Amersham Biosciences) and 500 &#956;g of salmon testes DNA (denatured at 100&#176;C, Sigma-Aldritch). The probe was added and allowed to hybridize overnight at 65&#176;C. The filter was then washed 2 times for 60 minutes each at 65&#176;C with a 0.1 &#215; SSC/0.1% SDS solution, wrapped in plastic wrap, and autoradiographed using Kodac BioMax MS film.</p>
         </sec>
         <sec>
            <st>
               <p>BAC DNA preparation and fingerprinting</p>
            </st>
            <p>BAC clones were grown in 5 mL LB with 12.5 &#956;g/ml chloramphenicol overnight at 37&#176;C and shaking at 200 rpm. BAC DNA was extracted using the Sigma Phaseprep BAC DNA Kit (Sigma-Aldritch) without the endotoxin removal step. BAC DNA was digested with <it>EcoR</it>I (Invitrogen) in a 1:150 dilution of RNase cocktail (Sigma Phaseprep Kit) at 37&#176;C for 2 hours. Digested DNA was run overnight on a 1.2% agarose gel, stained with Vistra Green and imaged using a FluorImager SI (Amersham Biosciences). Gel images were processed using Image <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, and contigs were assembled using FingerPrintContig <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> using the default e-value of e-10.</p>
         </sec>
         <sec>
            <st>
               <p>GiBV and BAC clone sequencing</p>
            </st>
            <p>Approximately 7.5 &#956;g of GiBV DNA was sheared and DNA fragments in the size range of 3.5&#8211;4.5 kbp purified after separation by agarose gel electrophoresis. The fragments were blunt ended and, after addition of <it>Bst</it>XI adaptors, cloned into the <it>Bst</it>XI site of pHOS2. Shotgun libraries were made from the 2 BAC clones as described for GiBV DNA. Celera Assembler <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> and TIGR Assembler <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> were used to assemble random sequence data from the viral whole genome shotgun and BAC clones, respectively. Gap closure was assisted by a closure editor tool called Cloe that also permits the manual inspection and editing of sequence data. A variety of methods were used to close gaps including re-sequencing the ends of random clones, transposon assisted sequencing (GPS, New England Biolabs&#8482;) or "micro-library" construction of single or pooled templates, and conversion of physical gaps to sequence gaps using "POMP" (pipette optimal multiplex PCR) <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> and or/a "Genome Walker" kit (Invitrogen&#8482;).</p>
         </sec>
         <sec>
            <st>
               <p>GiBV segment-specific PCRs</p>
            </st>
            <p>Primers were developed to be specific to 19 GiBV viral genome segment sequences. Primers were designed to be 22&#8211;26 nt in length, have a Tm of 62&#8211;65&#176;C, a GC clamp, and a maximum identity to the remainder of the unclosed GiBV genome of 70%. Designed primers were tested for potential secondary structure using NetPrimer <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. PCR was performed in a 10 &#956;l solution which included 0.1 &#956;l template DNA, 0.3 &#956;l 50 mM MgCl<sub>2</sub>, 1 &#956;l 10 &#215; PCR buffer, 0.2 &#956;10 mM dNTPs, 7.9 &#956;l H<sub>2</sub>O, 0.1 &#956;l Platinum Taq (Invitrogen), 0.2 &#956;l F primer (20 pm/&#956;l), and 0.2 &#956;l RC primer (20 pm/&#956;l). PCR protocol was 94&#176; for 2 min; 35 cycles of 94&#176; for 30 sec, 58&#176; for 30 sec, 72&#176; for 45 sec; followed by 72&#176; for 7 min.</p>
         </sec>
         <sec>
            <st>
               <p>Derivation of consensus GiBV segment sequences</p>
            </st>
            <p>As shotgun sequencing of the GiBV DNA was carried out using a sample pooled from a population of ~400 wasps, a large number of SNPs and indels were present in the sequence assembly. Because individual sequence reads could not be associated with individual wasps, a conical consensus sequence was generated for each viral genome segment using the SliceTools package <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. At a given position in a conical consensus, all bases with a cumulative quality value within 50% of the highest cumulative quality value are assigned to that position.</p>
         </sec>
         <sec>
            <st>
               <p>Annotation</p>
            </st>
            <p>Gene models were generated with a variety of software: Softberry's FGENESH <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> using both the honey bee (<it>Apis mellifera</it>) and fruit fly (<it>Drosophila melanogaster</it>) training sets, the Beijing Genome Institute's BGF <abbrgrp><abbr bid="B78">78</abbr></abbrgrp> trained on the silkmoth (<it>Bombyx mori</it>), and GENSCAN <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> using the vertebrate training set. Predicted gene models were compared to gene models generated using cDNA from 2 gene families for their ability to predict correct intron-exon structure. Most of the gene finders accurately predicted the 2 intron structure of the p494 genes, with the exception of GENSCAN which predicted an extra exon. The single intron in p325 genes were significantly more difficult to predict &#8211; only FGENESH (<it>A. mellifera</it>) and BGF properly predicted these genes. FGENESH (<it>D. melanogaster</it>) and GENSCAN both mis-predicted the majority of intron-exon boundaries and showed a tendency to combine multiple genes into single genes with a large number of introns. Based on these results, a combination of FGENESH (<it>A. mellifera</it>) and BGF was used for gene prediction, in addition to the AAT package <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> which allows spliced alignment of proteins to genomic DNA thereby revealing potential exon-intron boundaries. Gene models from FGENESH were generally accepted except when multiple other sources of information contradicted those models. SignalP <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr></abbrgrp>, TM-HMM <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>, and GPI-SOM <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>, were used to predict signal peptides, transmembrane domains, and glycosylphosphatidylinisotol anchors, respectively. Predicted genes were clustered into gene families using previously described methods <abbrgrp><abbr bid="B84">84</abbr></abbrgrp>, which utilize Pfam <abbrgrp><abbr bid="B85">85</abbr></abbrgrp> and TIGRFAM <abbrgrp><abbr bid="B86">86</abbr></abbrgrp> domains and calculate novel shared domains within the genome. Predicted GiBV proviral segment genes were analyzed for potential homology to genes in CcBV and MdBV and CsIV using BLASTP (CcBV only) and TBLASTN (CcBV and MdBV), with a cutoff of E = e-10.</p>
         </sec>
         <sec>
            <st>
               <p>Nucleotide composition analysis</p>
            </st>
            <p>Relative dinucleotide frequencies <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>, were calculated for each region > 500 bp in length except the flanking repeats, as they are expected to have highly biased dinucleotide frequencies. A Euclidean distance matrix between the regions was constructed from these frequencies. Regions were then clustered using the Neighbor-joining algorithm in PAUP* <abbrgrp><abbr bid="B88">88</abbr></abbrgrp> and the resulting tree was visualized using PHY&#183;FI <abbrgrp><abbr bid="B89">89</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Motif analysis</p>
            </st>
            <p>Boundaries between the proviral segments and inter-segmental regions, and the inter-segmental regions themselves were analyzed for motifs using MEME <abbrgrp><abbr bid="B90">90</abbr></abbrgrp>. In the first analysis a 103 bp DNA sequence (50 bp upstream to 50 bp downstream of the GCT excision motif) was extracted from each segmental boundary. The boundaries of proviral segments 1p, 3p, and 5 bp were reverse complemented so that orientation of the excision motif was the same for all sequences. All 16 sequences were analyzed together, and then split into 8 5' (upstream) and 8 3' (downstream) motifs relative to the directionality of the excision motif. Next, an analysis was conducted using the entire length of the 7 inter-segmental regions. Analyses used a minimal and maximal motif length of 5 and 100 bp, respectively. MEME was also used to search 30 CcBV [Genbank :<ext-link ext-link-type="gen" ext-link-id="AJ632304">AJ632304</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="AJ632333">AJ632333</ext-link>], 15 MdBV [Genbank:<ext-link ext-link-type="gen" ext-link-id="AY887894">AY887894</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY875680">AY875680</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="AY875690">AY875690</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY848690">AY848690</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY842013">AY842013</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ000240">DQ000240</ext-link>], and 5 CiBV viral genome segments [Genbank :<ext-link ext-link-type="gen" ext-link-id="AJ627175">AJ627175</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AJ278677">AJ278677</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AJ319654">AJ319654</ext-link>, <ext-link ext-link-type="gen" ext-link-id="Z58828">Z58828</ext-link>, <ext-link ext-link-type="gen" ext-link-id="Z31378">Z31378</ext-link>] for common motifs. All motifs were visualized using WebLogo <abbrgrp><abbr bid="B91">91</abbr><abbr bid="B92">92</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>SNP analysis</p>
            </st>
            <p>Ambiguous consensus sequences were generated from the viral genome sequence by recalling contigs so that all high quality (quality value > = 30) base calls in the reads were represented in the new consensus as ambiguity codes. This ensured all variants of a given circle were encoded within a single consensus sequence, while preventing low quality sequencing error from introducing artificial polymorphisms. Then, the ambiguous viral genome segment consensus sequences were globally aligned to their corresponding proviral genome segment sequences using nucmer from the MUMmer package <abbrgrp><abbr bid="B93">93</abbr></abbrgrp>. This alignment was parsed to determine the positions of all polymorphisms relative to the reference proviral sequence, including both substitutions and indels. Substitutions were found by mismatches in the alignment between the viral consensus sequence and proviral reference sequence. The distribution of polymorphisms was analyzed using the gene-snps tool from the AMOS package <abbrgrp><abbr bid="B94">94</abbr></abbrgrp>. The tool examines each polymorphism to determine if it occurs within an exon, and if so, whether the change is synonymous or non-synonymous. Additionally, the tool estimates dN/dS for each gene using the unweighted pathway method <abbrgrp><abbr bid="B95">95</abbr></abbrgrp>. The final analysis the tool performs is a test of independence between SNP density and sequence coverage (i.e. if more sequences covering any given position means that position is more likely to contain a polymorphism). To do so, it computes the Pearson's correlation of the polymorphism rate and depth of coverage using a sliding window of size 500 bp offset by 250 bp across each circle. Statistical significance of correlation coefficients were evaluated using a 2-tailed t test, where degrees of freedom equals the number of SNPs minus two. Differences between the relative number of substitutions of non-coding, synonymous, and non-synonymous sites were evaluated using Pearson's &#967;<sup>2 </sup>test.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>PDV, polydnavirus; BV, bracovirus; IV, ichnovirus; GiBV, <it>Glyptapanteles indiensis </it>bracovirus; CcBV, <it>Cotesia congregata </it>bracovirus; MdBV, <it>Microplitis demolitor </it>bracovirus; CiBV, <it>Chelonus inanitus </it>bracovirus; CsIV, <it>Campoletis sonorensis </it>ichnovirus; SNP, single nucleotide polymorphism; dN/dS, ratio of non-synonymous to synonymous substitutions; PTP, protein tyrosine phosphatase</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>VN and DEGR conceived the project. VN, CAD, and DEGR coordinated the project. CAD, DEGR, VN, and MJP designed and performed laboratory procedures and experiments. CAD, MCS, and VN designed and performed computational analyses. CAD, VN, and DEGR wrote the manuscript. CAD and BJH participated in annotation. JBH and LJT participated in genome closure. RWF reared parasitoids. DWF, BST, and DC participated in library construction. All authors read and approved this manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Teresa Utterback, Tamara Feldblyum, and the staff at J. Craig Venter Institute's Joint Technology Center for sequencing and viral library construction, and the JCVI IT department for general support. We would also like to thank Dongying Wu for providing bioinformatics tools, Jessica Vamathevan and Mihai Pop for initial work on viral genome closure and analysis, Hean Koo for handling sequence submissions, Linda Hannick for help with gene family computation, Joana Silva for advice on SNP analysis, and Jonathan Badger and 3 anonymous reviewers for comments on the manuscript. Funding for this study was provided by the National Science Foundation (0413618) and United States Department of Agriculture (2004-35600-15032).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Vamathevan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Riegler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Deboy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Brownlie</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>McGraw</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Esser</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ahmadinejad</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wiegand</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Beanan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Mohamoud</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weidman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>O'Neill</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <issue>3</issue>
            <fpage>E69</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">368164</pubid>
                  <pubid idtype="pmpid" link="fulltext">15024419</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020069</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Serendipitous discovery of Wolbachia genomes in multiple Drosophila species</p>
            </title>
            <aug>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hotopp</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Pop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>3</issue>
            <fpage>R23</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088942</pubid>
                  <pubid idtype="pmpid" link="fulltext">15774024</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-3-r23</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Metabolic Complementarity and Genomics of the Dual Bacterial Symbiosis of Sharpshooters</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Pai</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Watkins</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tallon</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Zaborsky</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Dunbar</snm>
                  <fnm>HE</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <issue>6</issue>
            <fpage>e188</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1472245</pubid>
                  <pubid idtype="pmpid" link="fulltext">16729848</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040188</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genome sequence of the endocellular bacterial symbiont of aphids Buchnera sp. APS</p>
            </title>
            <aug>
               <au>
                  <snm>Shigenobu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <issue>6800</issue>
            <fpage>81</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35024074</pubid>
                  <pubid idtype="pmpid" link="fulltext">10993077</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>50 million years of genomic stasis in endosymbiotic bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Tamas</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Klasson</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Canback</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Naslund</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Eriksson</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Sandstrom</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <issue>5577</issue>
            <fpage>2376</fpage>
            <lpage>2379</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1071278</pubid>
                  <pubid idtype="pmpid" link="fulltext">12089438</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidia</p>
            </title>
            <aug>
               <au>
                  <snm>Akman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aksoy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <issue>3</issue>
            <fpage>402</fpage>
            <lpage>407</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng986</pubid>
                  <pubid idtype="pmpid" link="fulltext">12219091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Genome sequence of Blochmannia pennsylvanicus indicates parallel evolutionary trends among bacterial mutualists of insects</p>
            </title>
            <aug>
               <au>
                  <snm>Degnan</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Lazarus</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>8</issue>
            <fpage>1023</fpage>
            <lpage>1033</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182215</pubid>
                  <pubid idtype="pmpid" link="fulltext">16077009</pubid>
                  <pubid idtype="doi">10.1101/gr.3771305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The genome sequence of Blochmannia floridanus: comparative analysis of reduced genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gil</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Zientz</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Delmotte</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Candelas</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Latorre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rausell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kamerbeek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gadau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holldobler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van Ham</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Moya</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>16</issue>
            <fpage>9388</fpage>
            <lpage>9393</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170928</pubid>
                  <pubid idtype="pmpid" link="fulltext">12886019</pubid>
                  <pubid idtype="doi">10.1073/pnas.1533499100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Polydnaviridae</p>
            </title>
            <aug>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Beckage</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Blissard</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Fleming</snm>
                  <fnm>JGW</fnm>
               </au>
               <au>
                  <snm>Krell</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Theilmann</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Virus Taxonomy Sixth report of the international committee on taxonomy of viruses</source>
            <publisher>Vienna , Springer Verlag</publisher>
            <editor>Murphy FA, Fauquet CM, Bishop DHL, Ghabrial SA, Jarvis AW, Martelli GP, Mayo MA, Summers MD</editor>
            <pubdate>1995</pubdate>
            <fpage>143</fpage>
            <lpage>147</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The integration of polydnavirus genomes in parasitoid genomes: implications for biocontrol and genetic analyses of parasitoid wasps</p>
            </title>
            <aug>
               <au>
                  <snm>Fleming</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Biological Control</source>
            <pubdate>1991</pubdate>
            <volume>1</volume>
            <fpage>127</fpage>
            <lpage>135</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/1049-9644(91)90111-C</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Polydnavirus DNA of the braconid wasp Chelonus inanitus is integrated in the wasp's genome and excised only in later pupal and adult stages of the female</p>
            </title>
            <aug>
               <au>
                  <snm>Gruber</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stettler</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Heiniger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schumperli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>1996</pubdate>
            <volume>77</volume>
            <issue>11</issue>
            <fpage>2873</fpage>
            <lpage>2879</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8922483</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Excision of the polydnavirus chromosomal integrated EP1 sequence of the parasitoid wasp Cotesia congregata (Braconidae, Microgastinae) at potential recombinase binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Savary</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beckage</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Periquet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>1997</pubdate>
            <volume>78</volume>
            <issue>12</issue>
            <fpage>3125</fpage>
            <lpage>3134</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9400960</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The polydnavirus life cycle</p>
            </title>
            <aug>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Parasites and Pathogens of Insects</source>
            <publisher>San Diego , Academic Press</publisher>
            <editor>Beckage NE, Thompson SN, Federici BA</editor>
            <pubdate>1993</pubdate>
            <volume>1: Parasites</volume>
            <fpage>167</fpage>
            <lpage>187</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Correlating the initiation of virus replication with a specific phase of pupal development in an ichneumonid parasitoid</p>
            </title>
            <aug>
               <au>
                  <snm>Norton</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Vinson</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Cell Tissue Research</source>
            <pubdate>1983</pubdate>
            <volume>231</volume>
            <issue>2</issue>
            <fpage>387</fpage>
            <lpage>398</lpage>
            <xrefbib>
               <pubid idtype="pmpid">6850807</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Molecular analysis of Campoletis sonorensis virus DNA in the lepidopteran host Heliothis virescens</p>
            </title>
            <aug>
               <au>
                  <snm>Theilmann</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>1986</pubdate>
            <volume>67</volume>
            <issue>9</issue>
            <fpage>1961</fpage>
            <lpage>1969</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3746255</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Fate of polydnavirus DNA of the egg-larval parasitoid Chelonus inanitus in the host Spodoptera littoralis</p>
            </title>
            <aug>
               <au>
                  <snm>Wyder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blank</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>491</fpage>
            <lpage>500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(03)00056-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">12770628</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Polydnaviruses: potent mediators of host insect immune dysfunction</p>
            </title>
            <aug>
               <au>
                  <snm>Lavine</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Beckage</snm>
                  <fnm>NE</fnm>
               </au>
            </aug>
            <source>Parasitol Today</source>
            <pubdate>1995</pubdate>
            <volume>11</volume>
            <issue>10</issue>
            <fpage>368</fpage>
            <lpage>378</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0169-4758(95)80005-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15275399</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Hormonal interactions between insect endoparasites and their host insects</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Parasites and Pathogens of Insects</source>
            <publisher>New York , Academic Press</publisher>
            <editor>Beckage NE, Thompson SN, Federici BA</editor>
            <pubdate>1993</pubdate>
            <volume>1: Parasites</volume>
            <fpage>59</fpage>
            <lpage>86</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The biology and genomics of polydnaviruses</p>
            </title>
            <aug>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Strand</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Comprehensive Molecular Insect Science</source>
            <publisher>San Diego , Elsevier</publisher>
            <editor>Gilbert LI, I. I, Gill S</editor>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>323</fpage>
            <lpage>360</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Phylogeny of microgastroid braconid wasps, and what it tells us about polydnavirus evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Hymenoptera: Evolution, Biodiversity and Biological Control</source>
            <publisher>Melbourne, Australia , CSIRO</publisher>
            <editor>Austin AD, Dowton M</editor>
            <pubdate>2000</pubdate>
            <fpage>97</fpage>
            <lpage>105</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Origin and evolution of polydnaviruses by symbiogenesis of insect DNA viruses in endoparasitic wasps</p>
            </title>
            <aug>
               <au>
                  <snm>Federici</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Bigot</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>419</fpage>
            <lpage>432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(03)00059-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">12770621</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Viruses and virus-like entities in the parasitic Hymenoptera</p>
            </title>
            <aug>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Journal of Hymenoptera Research</source>
            <pubdate>1992</pubdate>
            <volume>1</volume>
            <fpage>125</fpage>
            <lpage>139</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Virus or not? Phylogenetics of polydnaviruses and their wasp carriers</p>
            </title>
            <aug>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Asgari</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>397</fpage>
            <lpage>405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(03)00057-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12770619</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Genome sequence of a polydnavirus: insights into symbiotic virus evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Espagne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Huguet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Provost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Martins</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Poirie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Periquet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <issue>5694</issue>
            <fpage>286</fpage>
            <lpage>289</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1103066</pubid>
                  <pubid idtype="pmpid" link="fulltext">15472078</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Estimating the age of the polydnavirus/braconid wasp symbiosis</p>
            </title>
            <aug>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>11</issue>
            <fpage>7508</fpage>
            <lpage>7513</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">124262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12032313</pubid>
                  <pubid idtype="doi">10.1073/pnas.112067199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Perspectives on polydnavirus origins and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Turnbull</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Adv Virus Res</source>
            <pubdate>2002</pubdate>
            <volume>58</volume>
            <fpage>203</fpage>
            <lpage>254</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12205780</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Shared and species-specific features among ichnovirus genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lapointe</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barney</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Makkay</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cusson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2007</pubdate>
            <volume>363</volume>
            <issue>1</issue>
            <fpage>26</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.virol.2006.11.034</pubid>
                  <pubid idtype="pmpid" link="fulltext">17306851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Polydnavirus genomes reflect their dual roles as mutualists and pathogens</p>
            </title>
            <aug>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Strand</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Dickey</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Hilgarth</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Barney</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Kadash</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kroemer</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Lindstrom</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Rattanadechakul</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Shelby</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Thoetkiattikul</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Turnbull</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Witherell</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2006</pubdate>
            <volume>347</volume>
            <issue>1</issue>
            <fpage>160</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.virol.2005.11.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">16380146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Genomic and morphological features of a banchine polydnavirus: a comparison with bracoviruses and ichnoviruses</p>
            </title>
            <aug>
               <au>
                  <snm>Lapointe</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Barney</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Banks</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Beliveau</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Cusson</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>2007</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A gene encoding a polydnavirus structural polypeptide is not encapsidated</p>
            </title>
            <aug>
               <au>
                  <snm>Deng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stoltz</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2000</pubdate>
            <volume>269</volume>
            <issue>2</issue>
            <fpage>440</fpage>
            <lpage>450</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/viro.2000.0248</pubid>
                  <pubid idtype="pmpid" link="fulltext">10753722</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Cloning and expression of a gene encoding a Campoletis sonorensis polydnavirus structural protein</p>
            </title>
            <aug>
               <au>
                  <snm>Deng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Arch Insect Biochem Physiol</source>
            <pubdate>1999</pubdate>
            <volume>40</volume>
            <issue>1</issue>
            <fpage>30</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1520-6327(1999)40:1&lt;30::AID-ARCH4>3.0.CO;2-Y</pubid>
                  <pubid idtype="pmpid" link="fulltext">9987819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Polydnavirus biology, genome structure, and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>The Insect Viruses</source>
            <publisher>New York , Plenum Press</publisher>
            <editor>Miller LK, Ball LA</editor>
            <pubdate>1998</pubdate>
            <fpage>105</fpage>
            <lpage>139</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Characterization of Chelonus inanitus polydnavirus segments: sequences and analysis, excision site and demonstration of clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Wyder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tschannen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hochuli</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gruber</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Saladin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Zumbach</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2002</pubdate>
            <volume>83</volume>
            <issue>1</issue>
            <fpage>247</fpage>
            <lpage>256</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11752722</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Polydnavirus genome: integrated vs. free virus</p>
            </title>
            <aug>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Provost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Espagne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Poirie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Periquet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Huguet</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>407</fpage>
            <lpage>417</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(03)00058-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12770620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Visualization of polydnavirus sequences in a parasitoid wasp chromosome</p>
            </title>
            <aug>
               <au>
                  <snm>Belle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Beckage</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Rousselet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Poirie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lemeunier</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>2002</pubdate>
            <volume>76</volume>
            <issue>11</issue>
            <fpage>5793</fpage>
            <lpage>5796</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137038</pubid>
                  <pubid idtype="pmpid" link="fulltext">11992007</pubid>
                  <pubid idtype="doi">10.1128/JVI.76.11.5793-5796.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Ovary development and polydnavirus morphogenesis in the parasitic wasp Chelonus inanitus. I. Ovary morphogenesis, amplification of viral DNA and ecdysteroid titres</p>
            </title>
            <aug>
               <au>
                  <snm>Marti</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grossniklaus-Burgin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wyder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wyler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2003</pubdate>
            <volume>84</volume>
            <issue>Pt 5</issue>
            <fpage>1141</fpage>
            <lpage>1150</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/vir.0.18832-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12692279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Polydnavirus replication: the EP1 segment of the parasitoid wasp Cotesia congregata is amplified within a larger precursor molecule</p>
            </title>
            <aug>
               <au>
                  <snm>Pasquier-Barre</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Huguet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Monteiro</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Poirie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2002</pubdate>
            <volume>83</volume>
            <issue>Pt 8</issue>
            <fpage>2035</fpage>
            <lpage>2045</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12124468</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Polydnavirus DNA is integrated in the DNA of its parasitoid wasp host</p>
            </title>
            <aug>
               <au>
                  <snm>Fleming</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <issue>21</issue>
            <fpage>9770</fpage>
            <lpage>9774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">52802</pubid>
                  <pubid idtype="pmpid" link="fulltext">1946402</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.21.9770</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Homologous sequences in the Campoletis sonorensis polydnavirus genome are implicated in replication and nesting of the W segment family</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>1997</pubdate>
            <volume>71</volume>
            <issue>11</issue>
            <fpage>8504</fpage>
            <lpage>8513</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">192314</pubid>
                  <pubid idtype="pmpid" link="fulltext">9343208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Genome organization of the Chelonus inanitus polydnavirus: excision sites, spacers and abundance of proviral and excised segments</p>
            </title>
            <aug>
               <au>
                  <snm>Annaheim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lanzrein</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2007</pubdate>
            <volume>88</volume>
            <issue>Pt 2</issue>
            <fpage>450</fpage>
            <lpage>457</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/vir.0.82396-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">17251562</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Morphological and genomic characterization of the polydnavirus associated with the parasitoid wasp Glyptapanteles indiensis (Hymenoptera: Braconidae)</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Gundersen-Rindal</snm>
                  <fnm>DE</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2003</pubdate>
            <volume>84</volume>
            <issue>Pt 8</issue>
            <fpage>2051</fpage>
            <lpage>2060</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/vir.0.19234-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12867635</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Microplitis demolitor bracovirus genome segments vary in abundance and are individually packaged in virions</p>
            </title>
            <aug>
               <au>
                  <snm>Beck</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Inman</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Strand</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2006</pubdate>
         </bibl>
         <bibl id="B43">
            <title>
               <p>A tool for analyzing and annotating genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1997</pubdate>
            <volume>46</volume>
            <issue>1</issue>
            <fpage>37</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1997.4984</pubid>
                  <pubid idtype="pmpid" link="fulltext">9403056</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Characterization and transcriptional analysis of protein tyrosine phosphatase genes and an ankyrin repeat gene of the parasitoid Glyptapanteles indiensis polydnavirus in the parasitized host</p>
            </title>
            <aug>
               <au>
                  <snm>Gundersen-Rindal</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Pedroni</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2006</pubdate>
            <volume>87</volume>
            <issue>Pt 2</issue>
            <fpage>311</fpage>
            <lpage>322</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/vir.0.81326-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">16432017</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Inhibitor kappaB-like proteins from a polydnavirus inhibit NF-kappaB activation and suppress the insect immune response</p>
            </title>
            <aug>
               <au>
                  <snm>Thoetkiattikul</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Strand</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>32</issue>
            <fpage>11426</fpage>
            <lpage>11431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1183600</pubid>
                  <pubid idtype="pmpid" link="fulltext">16061795</pubid>
                  <pubid idtype="doi">10.1073/pnas.0505240102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Bracoviruses contain a large multigene family coding for protein tyrosine phosphatases</p>
            </title>
            <aug>
               <au>
                  <snm>Provost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Varricchio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Arana</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Espagne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Falabella</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huguet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>La Scaleia</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Poirie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Malva</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Olszewski</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Drezen</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>2004</pubdate>
            <volume>78</volume>
            <issue>23</issue>
            <fpage>13090</fpage>
            <lpage>13103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524979</pubid>
                  <pubid idtype="pmpid" link="fulltext">15542661</pubid>
                  <pubid idtype="doi">10.1128/JVI.78.23.13090-13103.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Genetic content and evolution of adenoviruses</p>
            </title>
            <aug>
               <au>
                  <snm>Davison</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Benko</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harrach</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Gen Virol</source>
            <pubdate>2003</pubdate>
            <volume>84</volume>
            <issue>Pt 11</issue>
            <fpage>2895</fpage>
            <lpage>2908</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/vir.0.19497-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">14573794</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Characterization of a novel protein with homology to C-type lectins expressed by the Cotesia rubecula bracovirus in larvae of the lepidopteran host, Pieris rapae</p>
            </title>
            <aug>
               <au>
                  <snm>Glatz</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Asgari</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>22</issue>
            <fpage>19743</fpage>
            <lpage>19750</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M301396200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12644452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>A coiled-coil region of an insect immune suppressor protein is involved in binding and uptake by hemocytes</p>
            </title>
            <aug>
               <au>
                  <snm>Asgari</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Insect Biochem Mol Biol</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <issue>5</issue>
            <fpage>497</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0965-1748(01)00127-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11891126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Polydnavirus genes and genomes: emerging gene families and new insights into polydnavirus replication</p>
            </title>
            <aug>
               <au>
                  <snm>Kroemer</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Annu Rev Entomol</source>
            <pubdate>2004</pubdate>
            <volume>49</volume>
            <fpage>431</fpage>
            <lpage>456</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.ento.49.072103.120132</pubid>
                  <pubid idtype="pmpid" link="fulltext">14651471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Relationships between polydnavirus gene expression and host range of the parasitoid wasp Campoletis sonorensis</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Soldevila</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2000</pubdate>
            <volume>46</volume>
            <issue>10</issue>
            <fpage>1397</fpage>
            <lpage>1407</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(00)00059-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10878266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Bee venom hyaluronidase is homologous to a membrane protein of mammalian sperm</p>
            </title>
            <aug>
               <au>
                  <snm>Gmachl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kreil</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1993</pubdate>
            <volume>90</volume>
            <issue>8</issue>
            <fpage>3569</fpage>
            <lpage>3573</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">46342</pubid>
                  <pubid idtype="pmpid" link="fulltext">7682712</pubid>
                  <pubid idtype="doi">10.1073/pnas.90.8.3569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Adenosine: an endogenous regulator of innate immunity</p>
            </title>
            <aug>
               <au>
                  <snm>Hasko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cronstein</snm>
                  <fnm>BN</fnm>
               </au>
            </aug>
            <source>Trends Immunol</source>
            <pubdate>2004</pubdate>
            <volume>25</volume>
            <issue>1</issue>
            <fpage>33</fpage>
            <lpage>39</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.it.2003.11.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">14698282</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>A role for adenosine deaminase in Drosophila larval development</p>
            </title>
            <aug>
               <au>
                  <snm>Dolezal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dolezelova</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zurovec</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>7</issue>
            <fpage>e201</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1135298</pubid>
                  <pubid idtype="pmpid" link="fulltext">15907156</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>RNA binding protein sex-lethal (Sxl) and control of Drosophila sex determination and dosage compensation</p>
            </title>
            <aug>
               <au>
                  <snm>Penalva</snm>
                  <fnm>LO</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2003</pubdate>
            <volume>67</volume>
            <issue>3</issue>
            <fpage>343</fpage>
            <lpage>359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193869</pubid>
                  <pubid idtype="pmpid" link="fulltext">12966139</pubid>
                  <pubid idtype="doi">10.1128/MMBR.67.3.343-359.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>The gene fl(2)d is needed for the sex-specific splicing of transformer pre-mRNA but not for double-sex pre-mRNA in Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Granadino</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Penalva</snm>
                  <fnm>LO</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1996</pubdate>
            <volume>253</volume>
            <issue>1-2</issue>
            <fpage>26</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380050292</pubid>
                  <pubid idtype="pmpid">9003283</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Evidence of a dual function in fl(2)d, a gene needed for Sex-lethal expression in Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Granadino</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>San Juan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Santamaria</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1992</pubdate>
            <volume>130</volume>
            <issue>3</issue>
            <fpage>597</fpage>
            <lpage>612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1204876</pubid>
                  <pubid idtype="pmpid" link="fulltext">1551580</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The Drosophila melanogaster fl(2)d gene is needed for the female-specific splicing of Sex-lethal RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Granadino</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Campuzano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1990</pubdate>
            <volume>9</volume>
            <issue>8</issue>
            <fpage>2597</fpage>
            <lpage>2602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">552292</pubid>
                  <pubid idtype="pmpid">1695150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Evolutionary relationships among the Braconidae (Hymenoptera: Ichneumonoidea) inferred from partial 16S rDNA gene sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Dowton</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Austin</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Antolin</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>129</fpage>
            <lpage>150</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2583.1998.72058.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9535159</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Molecular and morphological data suggest a single origin of the polydnaviruses among braconid wasps </p>
            </title>
            <aug>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Naturwissenschaften</source>
            <pubdate>1997</pubdate>
            <volume>84</volume>
            <fpage>502</fpage>
            <lpage>507</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s001140050434</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Identification and comparison of Campoletis sonorensis virus transcripts expressed from four genomic segments in the insect hosts Campoletis sonorensis and Heliothis virescens</p>
            </title>
            <aug>
               <au>
                  <snm>Theilmann</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>1988</pubdate>
            <volume>167</volume>
            <issue>2</issue>
            <fpage>329</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3201745</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Characterization of Campoletis sonorensis ichnovirus unique segment B and excision locus structure</p>
            </title>
            <aug>
               <au>
                  <snm>Rattanadechakul</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>J Insect Physiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>523</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-1910(03)00053-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12770631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Molecular Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Massachusetts , Sinauer Associates</publisher>
            <pubdate>1997</pubdate>
            <fpage>487</fpage>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Amelioration of bacterial genomes: rates of change and exchange</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <issue>4</issue>
            <fpage>383</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006158</pubid>
                  <pubid idtype="pmpid" link="fulltext">9089078</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Limitations of compositional approach to identifying horizontally transferred genes</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2001</pubdate>
            <volume>53</volume>
            <issue>3</issue>
            <fpage>244</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002390010214</pubid>
                  <pubid idtype="pmpid" link="fulltext">11523011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Development  of mass-rearing technology</p>
            </title>
            <aug>
               <au>
                  <snm>Bell</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Owens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shapiro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tardif</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>The gypsy moth: research towards integrated pest management</source>
            <editor>Doane CC, McManus AML</editor>
            <pubdate>1981</pubdate>
            <fpage>599</fpage>
            <lpage>633</lpage>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Virus with a Multipartite Superhelical DNA Genome from the Ichneumonid Parasitoid Campoletis sonorensis</p>
            </title>
            <aug>
               <au>
                  <snm>Krell</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Vinson</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>1982</pubdate>
            <volume>43</volume>
            <issue>3</issue>
            <fpage>859</fpage>
            <lpage>870</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">256196</pubid>
                  <pubid idtype="pmpid" link="fulltext">16789230</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Characterization and biological effects of Cotesia congregata polydnavirus on host larvae of the tobacco hornworm, Manduca sexta</p>
            </title>
            <aug>
               <au>
                  <snm>Beckage</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>FF</fnm>
               </au>
               <au>
                  <snm>Schleifer</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Cherubin</snm>
                  <fnm>LL</fnm>
               </au>
            </aug>
            <source>Arch Insect Biochem Physiol</source>
            <pubdate>1994</pubdate>
            <volume>26</volume>
            <fpage>165</fpage>
            <lpage>195</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/arch.940260209</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Amplicon Express</p>
            </title>
            <url>http://www.genomex.com</url>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Image - the fingerprint analysis software system</p>
            </title>
            <url>http://www.sanger.ac.uk/Software/Image/</url>
         </bibl>
         <bibl id="B71">
            <title>
               <p>FPC: a system for building contigs from restriction fingerprinted clones</p>
            </title>
            <aug>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mott</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <issue>5</issue>
            <fpage>523</fpage>
            <lpage>535</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>A whole-genome assembly of Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Dew</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Fasulo</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Flanigan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kravitz</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Mobarry</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Reinert</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Remington</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Anson</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Bolanos</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Chou</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Lonardi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beasley</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Brandon</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nusskern</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Zhan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <issue>5461</issue>
            <fpage>2196</fpage>
            <lpage>2204</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2196</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>TIGR Assembler: A new tool for assembling large shotgun sequencing projects</p>
            </title>
            <aug>
               <au>
                  <snm>Sutton</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Science &amp; Technology</source>
            <pubdate>1995</pubdate>
            <volume>1</volume>
            <fpage>9</fpage>
            <lpage>19</lpage>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Optimized multiplex PCR: efficiently closing a whole-genome shotgun sequencing project</p>
            </title>
            <aug>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Radune</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1999</pubdate>
            <volume>62</volume>
            <issue>3</issue>
            <fpage>500</fpage>
            <lpage>507</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1999.6048</pubid>
                  <pubid idtype="pmpid" link="fulltext">10644449</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>NetPrimer</p>
            </title>
            <url>http://www.premierbiosoft.com/netprimer/index.html</url>
         </bibl>
         <bibl id="B76">
            <title>
               <p>Slice Tools</p>
            </title>
            <url>http://slicetools.sourceforge.net</url>
         </bibl>
         <bibl id="B77">
            <title>
               <p>SoftBerry - FGENESH</p>
            </title>
            <url>http://www.softberry.com</url>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Beijing Gene Finder</p>
            </title>
            <url>http://bgf.genomics.org.cn</url>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Prediction of complete gene structures in human genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Burge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>268</volume>
            <issue>1</issue>
            <fpage>78</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1997.0951</pubid>
                  <pubid idtype="pmpid" link="fulltext">9149143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Improved prediction of signal peptides: SignalP 3.0</p>
            </title>
            <aug>
               <au>
                  <snm>Bendtsen</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <issue>4</issue>
            <fpage>783</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.05.028</pubid>
                  <pubid idtype="pmpid" link="fulltext">15223320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Engelbrecht</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>1997</pubdate>
            <volume>10</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/10.1.1</pubid>
                  <pubid idtype="pmpid" link="fulltext">9051728</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>305</volume>
            <issue>3</issue>
            <fpage>567</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4315</pubid>
                  <pubid idtype="pmpid" link="fulltext">11152613</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Identification of GPI anchor attachment signals by a Kohonen self-organizing map</p>
            </title>
            <aug>
               <au>
                  <snm>Fankhauser</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Maser</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>1846</fpage>
            <lpage>1852</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti299</pubid>
                  <pubid idtype="pmpid" link="fulltext">15691858</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release</p>
            </title>
            <aug>
               <au>
                  <snm>Haas</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ronning</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Hannick</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RK</fnm>
                  <suf>Jr.</suf>
               </au>
               <au>
                  <snm>Maiti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Farzad</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>BMC Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1082884</pubid>
                  <pubid idtype="pmpid" link="fulltext">15784138</pubid>
                  <pubid idtype="doi">10.1186/1741-7007-3-7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>The Pfam protein families database</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Yeats</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D138</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308855</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681378</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>The TIGRFAMs database of protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>371</fpage>
            <lpage>373</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165575</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520025</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg128</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Comparisons of eukaryotic genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ladunga</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <issue>26</issue>
            <fpage>12832</fpage>
            <lpage>12836</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">45534</pubid>
                  <pubid idtype="pmpid" link="fulltext">7809130</pubid>
                  <pubid idtype="doi">10.1073/pnas.91.26.12832</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Phylogenetic Analysis Using Parsimony (*and Other Methods)</p>
            </title>
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Massachusetts , Sinauer Associates</publisher>
            <edition>Version 4</edition>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B89">
            <title>
               <p>PHY.FI: fast and easy online creation and manipulation of phylogeny color figures</p>
            </title>
            <aug>
               <au>
                  <snm>Fredslund</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>315</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1513607</pubid>
                  <pubid idtype="pmpid" link="fulltext">16792795</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-315</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Fitting a mixture model by expectation maximization to discover motifs in biopolymers</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Elkan</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology</source>
            <publisher>Menlo Park, California , AAAI Press</publisher>
            <pubdate>1994</pubdate>
            <fpage>28</fpage>
            <lpage>36</lpage>
         </bibl>
         <bibl id="B91">
            <title>
               <p>WebLogo: a sequence logo generator</p>
            </title>
            <aug>
               <au>
                  <snm>Crooks</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Hon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chandonia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>1188</fpage>
            <lpage>1190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419797</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173120</pubid>
                  <pubid idtype="doi">10.1101/gr.849004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>Sequence logos: a new way to display consensus sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <issue>20</issue>
            <fpage>6097</fpage>
            <lpage>6100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">332411</pubid>
                  <pubid idtype="pmpid" link="fulltext">2172928</pubid>
                  <pubid idtype="doi">10.1093/nar/18.20.6097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>Versatile and open software for comparing large genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Phillippy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Smoot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shumway</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Antonescu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>R12</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395750</pubid>
                  <pubid idtype="pmpid" link="fulltext">14759262</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-2-r12</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>AMOS: A Modular Open-Source Assembler</p>
            </title>
            <url>http://amos.sourceforge.net/</url>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1986</pubdate>
            <volume>3</volume>
            <issue>5</issue>
            <fpage>418</fpage>
            <lpage>426</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3444411</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
