<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-10-293</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A BAC-based physical map of the Hessian fly genome anchored to polytene chromosomes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Aggarwal</snm>
               <fnm>Rajat</fnm>
               <insr iid="I1"/>
               <email>raggarw@purdue.edu</email>
            </au>
            <au id="A2">
               <snm>Benatti</snm>
               <mi>R</mi>
               <fnm>Thiago</fnm>
               <insr iid="I1"/>
               <email>tbenatti@purdue.edu</email>
            </au>
            <au id="A3">
               <snm>Gill</snm>
               <fnm>Navdeep</fnm>
               <insr iid="I2"/>
               <email>gilln@purdue.edu</email>
            </au>
            <au id="A4">
               <snm>Zhao</snm>
               <fnm>Chaoyang</fnm>
               <insr iid="I1"/>
               <email>czhao@purdue.edu</email>
            </au>
            <au id="A5">
               <snm>Chen</snm>
               <fnm>Ming-Shun</fnm>
               <insr iid="I3"/>
               <email>ming-shun.chen@ars.usda.gov</email>
            </au>
            <au id="A6">
               <snm>Fellers</snm>
               <mi>P</mi>
               <fnm>John</fnm>
               <insr iid="I3"/>
               <email>john.fellers@ars.usda.gov</email>
            </au>
            <au id="A7">
               <snm>Schemerhorn</snm>
               <mi>J</mi>
               <fnm>Brandon</fnm>
               <insr iid="I4"/>
               <email>bschemer@purdue.edu</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Stuart</snm>
               <mi>J</mi>
               <fnm>Jeff</fnm>
               <insr iid="I1"/>
               <email>stuartjj@purdue.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Entomology, Purdue University, West Lafayette, Indiana 47907, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Agronomy, Purdue University, West Lafayette, Indiana 47907, USA</p>
            </ins>
            <ins id="I3">
               <p>USDA-ARS, Plant Science and Entomology Research Unit, 4008 Throckmorton, Kansas State University, Manhattan, Kansas 66502, USA</p>
            </ins>
            <ins id="I4">
               <p>USDA-ARS, Department of Entomology, Purdue University, West Lafayette, Indiana 47907, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>293</fpage>
         <url>http://www.biomedcentral.com/1471-2164/10/293</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19573234</pubid>
               <pubid idtype="doi">10.1186/1471-2164-10-293</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>02</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>02</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Aggarwal et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The Hessian fly (<it>Mayetiola destructor</it>) is an important insect pest of wheat. It has tractable genetics, polytene chromosomes, and a small genome (158 Mb). Investigation of the Hessian fly presents excellent opportunities to study plant-insect interactions and the molecular mechanisms underlying genome imprinting and chromosome elimination. A physical map is needed to improve the ability to perform both positional cloning and comparative genomic analyses with the fully sequenced genomes of other dipteran species.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>An FPC-based genome wide physical map of the Hessian fly was constructed and anchored to the insect's polytene chromosomes. Bacterial artificial chromosome (BAC) clones corresponding to 12-fold coverage of the Hessian fly genome were fingerprinted, using high information content fingerprinting (HIFC) methodology, and end-sequenced. Fluorescence <it>in situ </it>hybridization (FISH) co-localized two BAC clones from each of the 196 longest contigs on the polytene chromosomes. An additional 70 contigs were positioned using a single FISH probe. The 266 FISH mapped contigs were evenly distributed and covered 60% of the genome (95,668 kb). The ends of the fingerprinted BACs were then sequenced to develop the capacity to create sequenced tagged site (STS) markers on the BACs in the map. Only 3.64% of the BAC-end sequence was composed of transposable elements, helicases, ribosomal repeats, simple sequence repeats, and sequences of low complexity. A relatively large fraction (14.27%) of the BES was comprised of multi-copy gene sequences. Nearly 1% of the end sequence was composed of simple sequence repeats (SSRs).</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This physical map provides the foundation for high-resolution genetic mapping, map-based cloning, and assembly of complete genome sequencing data. The results indicate that restriction fragment length heterogeneity in BAC libraries used to construct physical maps lower the length and the depth of the contigs, but is not an absolute barrier to the successful application of the technology. This map will serve as a genomic resource for accelerating gene discovery, genome sequencing, and the assembly of BAC sequences. The Hessian fly BAC-clone assembly, and the names and positions of the BAC clones used in the FISH experiments are publically available at <url>http://genome.purdue.edu/WebAGCoL/Hfly/WebFPC/</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The Hessian fly (<it>Mayetiola destructor</it>) is an important pest of wheat (<it>Triticum </it>spp.) and a member of one of the largest and most economically important families of insects, the gall midges (Cecidomyiidae). Its genetic tractability and short generation time (~28 days) make it especially attractive as an experimental model for investigating insect-plant interactions <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. This capacity was first demonstrated when it was the first insect shown to have a gene-for-gene interaction with its host plant <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Previously, it was used to study chromosome elimination and the function of the germ-line-limited "E" chromosomes that characterize all gall midge species <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Later investigations demonstrated that it has other biological attributes that are compatible with genomic analyses, including polytene chromosomes in the larval salivary glands <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and a small genome (158 Mb) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The development of a physically anchored genetic map <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, a syntenic analysis of a BAC-based contig <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and transcriptomic analyses of the first instar salivary glands <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> also demonstrated the potential of this insect for comparative genomics with other dipteran species.</p>
         <p>Improved genomic maps are essential to further develop this insect as an experimental organism. However, certain biological characteristics make fine-scale genetic mapping in the Hessian fly problematic. The first problem is the relatively low number of offspring produced by single females, typically ranging from 50 to 200, which limits the resolution of a genetic map. The second problem is the small size (2 to 3 mm in length) of the insect. This severely limits the yield of the genomic DNA extracted from individuals, and the number of molecular markers that can be genetically mapped. A third problem, the insect's unusual chromosome cycle and mechanism of sex determination, makes the construction of inbred strains difficult. Because this phenomenon is both central to the genetics of the insect and worthy of genomic investigation, it is briefly described below.</p>
         <p>The germ line of the Hessian fly contains both maternally and paternally derived copies of each of two autosomes (A1 and A2) and two X chromosomes (X1 and X2) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. It also contains a variable number (~32) of maternally inherited germ-line-limited "E" chromosomes (A1A2X1X2/A1A2X1X2; E). The E chromosomes are eliminated from all future somatic cells during the fifth cleavage division of embryogenesis <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. This post-zygotic division also determines sex. Embryos that eliminate the paternally inherited X chromosomes, together with the E chromosomes, possess a male determining somatic karyotype (A1A2X1X2/A1A2OO) and develop as males. Those that retain both sets of X chromosomes possess a female determining somatic karyotype (A1A2X1X2/A1A2X1X2) and develop as females. Maternal genotype determines whether the paternally derived X chromosomes are eliminated or retained in the soma <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. As a consequence, an unusual mating structure is established that is resistant to inbreeding; females produce either all-female or all-male families. Chromosome elimination also occurs during spermatogenesis. The result is that all sperm carry only the maternally derived autosomes and X chromosomes (A1A2X1X2). Genetic recombination between the homologous autosomes and X chromosomes occurs only during meiosis of oogenesis. All ova carry a single copy of each autosome and each X chromosome and a full complement (~32) of E chromosomes (A1A2X1X2; E). There is no heterogametic sex since all sperm and all ova are genetically equivalent and sex is determined after fertilization.</p>
         <p>In contrast to the problems that complicate genetic analyses, the Hessian fly polytene chromosomes make physical mapping relatively simple. Although they lack the banding resolution of <it>Drosophila </it>polytene chromosomes, they still provide a relatively high mapping resolution when bacterial artificial chromosomes (BACs) are used as probes for fluorescence <it>in situ </it>hybridization (FISH) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. This facility suggested that the development of a dense physical map of the Hessian fly's polytene chromosomes should precede high-resolution genetic mapping. We therefore applied the high-information content fingerprinting (HICF) method developed by Luo et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> to construct a BAC contig map using three Hessian fly BAC libraries (Table <tblr tid="T1">1</tblr>). The result was a first generation FPC-based physical map of the Hessian fly genome that is anchored to the polytene chromosomes. It is based on restriction fingerprints of 13,614 BAC clones. The largest 266 contigs were positioned on the chromosomes by fluorescence <it>in situ </it>hybridization (FISH). These contigs contain 4,563 BAC clones, and cover ~60% of the genome. The BAC end sequences (BES) of the clones used in this analysis were determined to provide the capacity to develop sequenced tagged site (STS) markers evenly distributed over the genome. BAC-end sequence data were also analyzed to improve our understanding of Hessian fly genome organization.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>BAC libraries fingerprinted for the Hessian fly physical map</p>
            </caption>
            <tblbdy cols="4">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>BAC Library</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Mde</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Hf</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>CL (MD_Bb)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>DNA source</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Population vH13</p>
                  </c>
                  <c ca="left">
                     <p>Population GP</p>
                  </c>
                  <c ca="left">
                     <p>Population L</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Vector</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>pIndigoBAC</p>
                  </c>
                  <c ca="left">
                     <p>pECBA1</p>
                  </c>
                  <c ca="left">
                     <p>pIndigoBAC536</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Cloning site</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><it>Hin</it>dIII</p>
                  </c>
                  <c ca="left">
                     <p><it>Hin</it>dIII</p>
                  </c>
                  <c ca="left">
                     <p><it>Bst</it>yI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Mean Insert Size (kb)</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>55</p>
                  </c>
                  <c ca="left">
                     <p>150</p>
                  </c>
                  <c ca="left">
                     <p>130</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>No. FP BAC clones</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>78</p>
                  </c>
                  <c ca="left">
                     <p>5,766</p>
                  </c>
                  <c ca="left">
                     <p>6,108</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Mean no. bands/clone</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>NA</p>
                  </c>
                  <c ca="left">
                     <p>90.6</p>
                  </c>
                  <c ca="left">
                     <p>87.6</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Genome coverage</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>0.006</p>
                  </c>
                  <c ca="left">
                     <p>5.5</p>
                  </c>
                  <c ca="left">
                     <p>5.0</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>BAC fingerprinting</p>
            </st>
            <p>We subjected 13,614 BACs to high-information content fingerprinting (Table <tblr tid="T2">2</tblr>). The DNA fingerprint profile of each BAC clone was then converted into an FPC compatible format using the FP Miner 1.1 software (BioinforSoft LLC, OR). The standard deviation associated with the size of two vector bands present in the fingerprints of each BAC (157.4 &#177; 0.01 bp and 369.5 &#177; 0.02 bp) indicated that fragment size reproducibility was high <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The FP Miner software was used to remove background signals and vector bands, and to assign a quality score to each BAC DNA fingerprint. We examined all of the fingerprints and removed 1662 that failed to meet each of the following quality standards: 1) 30 to 250 fragments, 2) = 10 fragments of each of the four colors, 3) a size standard match quality score that was >0.8, and 4) a fingerprint editing quality score that was >10. The fingerprints of 11,952 BACs (88% of the total) remained available for FPC assembly. The percentage of successfully fingerprinted clones compared favorably with the HICF maps developed for maize (<it>Zea mays</it>; 87%) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, Nile tilapia (<it>Oreochromis niloticus</it>; 87%) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, channel catfish (<it>Ictalurus punctatus</it>; 91.5%) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, and <it>Brassica rapa </it>(94.2%) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. DNA fragment sizes ranging from 50 to 500 bp were imported into the FPC v8.5.1 <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp> to build contigs. Among the successfully fingerprinted clones, 5,766 were derived from the Hf library, 6,108 were derived from the CL library, and 78 were derived from the Mde library. BAC clones from the Hf library had an average insert size of 150 kb and the CL library had an average insert size of 130 kb. The average number of scored bands per clone in the Hf library (90.6) and the CL library (87.6) was slightly less than the average observed during the construction of the maize map (107) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, but slightly greater than the averages of the two BAC libraries used to construct the Nile tilapia map (54 and 70) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Combined, the BACs in the Hf and CL libraries provided an estimated 10.5-fold coverage of the Hessian fly genome (Table <tblr tid="T1">1</tblr>). This was greater coverage than that used to construct either the Nile tilapia (5.6-fold) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> or the catfish (6.8-fold) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> maps, but considerably less than the coverage used to construct the maps of <it>B. rapa </it>(15.2-fold) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and maize (22-fold) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Summary of the Hessian fly physical map</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Number of processed clones</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>13,614</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Hf library</p>
                     </c>
                     <c ca="right">
                        <p>6,144</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>CL library</p>
                     </c>
                     <c ca="right">
                        <p>7,392</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Mde library</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Number of clones used in contig assembly</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>11,952</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Hf library</p>
                     </c>
                     <c ca="right">
                        <p>5,766</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>CL library</p>
                     </c>
                     <c ca="right">
                        <p>6,108</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Mde library</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Number of singletons</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>4,236</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Number of contigs</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>1,377</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>2&#8211;4 clones</p>
                     </c>
                     <c ca="right">
                        <p>913</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>5&#8211;9 clones</p>
                     </c>
                     <c ca="right">
                        <p>275</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>10&#8211;25 clones</p>
                     </c>
                     <c ca="right">
                        <p>159</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>26&#8211;50 clones</p>
                     </c>
                     <c ca="right">
                        <p>28</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>51&#8211;73 clones</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Physical length of the contigs (kb)</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>283,555</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Contig assembly</p>
            </st>
            <p>Contigs were assembled from the fingerprint data using the computer program FPC version 8.5.1 <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. FPC parameters were adjusted for the HICF technique as described by Lou et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and Nelson <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In each assembly, we used a tolerance (5) that restricted two bands with lengths greater than 0.5 bp from being considered the same band. In the initial assembly, we used a cutoff (the threshold for the probability score that matching bands are a coincidence) of 1<it>e</it>-35. Using these settings, FPC built 1258 contigs containing from 2 to 126 BAC clones per contig. The DQer function was then used to identify contigs with &#8805; 10% questionable clones (Qs). These contigs were then gradually split using three more stringent cutoffs (1<it>e</it>-38, 1<it>e</it>-41, and 1<it>e</it>-44). This produced 1477 contigs containing from 2 to 67 BAC clones per contig. The assembly was then end merged at a cutoff of 1<it>e</it>-29. This produced the first generation HICF Hessian fly map, which consisted of 1377 contigs, containing 7,716 BACs, and 4,236 (35%) singletons. The map had an average of 5.6 BACs per contig and the largest contig contained 73 BACs (Table <tblr tid="T2">2</tblr>). FPC analysis estimated that the clones used to produce the map had 1,061,478 unique bands, averaging 88.8 bands per clone. We used the known lengths of 23 BAC clones and the total number of labeled fragments for each of these clones to estimate average band size (1182 bp). Using this value, the assembled contigs had an estimated length of 283,555 kb, providing approximately 1.8-fold coverage of total genome length. The average contig length was 206 kb and the longest contig covered 1166 kb of the genome. FPC identified a total of 441 questionable clones (Qs) in the assembly. There were 88 contigs with >10% Qs. However, the average number of Qs per contig was only 0.32. Furthermore, 1212 contigs (88%) had no Qs, no contig had more than 14 Qs, and the majority of the Qs were in a few large contigs.</p>
            <p>Compared to the contigs of other HICF maps, the Hessian fly contigs were relatively small. For example, the <it>B. rapa</it>, catfish, and Nile tilapia maps each had greater average numbers of BACs per contig (37.7, 23.4, and 9.0 respectively), greater average contig lengths (512 kb, 521 kb, 390 kb) and lower percentages of singletons (20.7, 2.0, and 7.5) <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Since both BAC coverage and fragment size reproducibility were good, other factors were probably responsible for the shorter and shallower contigs in the Hessian fly map. The organization of the much smaller Hessian fly genome might have been one factor. However, we suspect that a greater frequency of restriction fragment length polymorphisms (RFLPs) in the Hessian fly BAC libraries was the major problem. This had been anticipated because the Hessian fly BAC libraries were each constructed with DNA derived from thousands of individuals in heterogeneous strains that are poorly characterized. Thus, indels, single nucleotide polymorphisms, gene duplications, and other rearrangements may all increase the number of mismatched bands. Regardless of the cause and in spite of the relatively low number of BACs per contig, the coverage of total genome length (1.8) appeared to exceed that of the maps of <it>B. rapa </it>(1.3), catfish (0.93), and Nile tilapia (1.65). In addition, the percentage of Qs in the Hessian fly assembly (0.32) was lower than that observed in maize (11.0) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, <it>B. rapa </it>(15.0), catfish (7.3), and Nile tilapia (9.6). Thus, it appeared that the Hessian fly contigs provided reasonable coverage with few questionable clones. This suggested that an abundance of RFLPs is not an absolute barrier to the construction of a HICF-based physical map.</p>
            <p>Because the BAC clones were largely derived from two different libraries (Table <tblr tid="T1">1</tblr>), we performed an FPC assembly of each library separately using the same parameters that were used to assemble both libraries combined. The BACs in the CL library assembled into a greater number of contigs (888 vs. 795) with a greater average number of BACs per contig (4.74 vs. 3.78) than the BACs in the Hf library. In addition, the percentage of singletons in the CL assembly (31%) was fewer than the singletons in the Hf assembly (48%), but the number of contigs with >10% Qs was nearly the same (CL = 33 and Hf = 30). Thus, it appeared that if RFLPs were interfering with contig assembly, they were more abundant in the Hf library. Interestingly, the sum of the total number of contigs assembled using the CL (888) and Hf BACs (793) separately was only 22% greater than the total number of BACs in the combined assembly (1377), and the sum of the coverage provided by the CL (159,451 kb) and Hf BACs (149,099 kb) was only 9% greater than that of the combined assembly (283,555 kb). Thus, it appeared as if the contigs of one library only occasionally overlapped with the contigs of the other library and this lowered contig size more than total coverage when the libraries were combined. This possibility was also evident in the contigs of the FPC map where CL BACs tended to stack with other CL BACs and Hf BACs tended to stack with other Hf BACs <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. It is also consistent with the suggestion that using libraries prepared with different restriction enzymes reduces the number of gaps in the assembly <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Contig validation</p>
            </st>
            <p>Two different approaches were used to validate contigs in the first generation Hessian fly HICF map. The first examined the FPC assembly of two contigs that had been constructed by chromosome walking (Figure <figr fid="F1">1</figr>). This was the only region of the Hessian fly genome in which chromosome walking had been previously performed. The construction and composition of one walk (walk-124) was described previously <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. It consisted of 64 BAC clones covering a 7.5-cM genetic distance that was marked with four STS markers and a physical distance of nearly 1 Mb. The other chromosome walk began from STS marker 134 (walk-134) and was composed of 60 BAC clones. Together, the contigs constructed by walk-124 and walk-134 contained 73 Mde BAC clones that were analyzed by FPC. The 134-walk was deeper and contained a greater proportion of Hf clones. Neither chromosome walk contained clones from the CL library because that library had not yet been prepared when the walking experiments were performed. Only 5 clones in the 124-walk were discovered in the contigs of the FPC assembly (Figure <figr fid="F1">1</figr>). Those clones were present in 3 small contigs (contigs #837, #938, and #1176) containing a combined total of 7 clones. FPC assembly of the 134-walk was more successful. FPC assembled 52 (87%) of the clones in the 134-walk into six contigs (Figure <figr fid="F1">1</figr>). The majority of those clones were assembled into two contigs: one (contig #12) containing 24 134-walk clones and the other (contig #56) containing 19 134-walk clones. An additional 28 CL clones were identified in these contigs. Reducing the stringency of FPC assembly (cutoff of 1<it>e</it>-15) merged all six contigs associated with walk-134 into a single contig. However, the assembly of the 124-walk was not improved. This analysis suggested that, as expected, regions of the genome with relatively deep coverage (walk-134) were well represented in the map whereas regions with less depth (walk-124) were poorly represented.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>BAC contigs established by chromosome walking</p>
               </caption>
               <text>
                  <p><b>BAC contigs established by chromosome walking</b>. BAC clones are represented as black and grey horizontal lines and BES used as probes are represented as black (SP6-end) and grey (T7-end) circles. The physical positions of BAC clones represented by grey lines were determined in FISH experiments. The number and library origin of each BAC clone is shown where M indicates the Mde library, and H indicates the Hf library. The FPC-based contig of each BAC is shown in parenthesis (NF indicates a clone that was not fingerprinted). Black vertical arrows intersect BAC clones that were positive in library screens that used STS markers as probe. Grey vertical lines intersect BAC clones that were positive in library screens that used BES as probe. (a) Chromosome walk-124. (b) Chromosome walk-134.</p>
               </text>
               <graphic file="1471-2164-10-293-1"/>
            </fig>
            <p>The second approach of assessing contig validity was to test the continuity of the 196 longest FPC assembled contigs using FISH. In these experiments, a BAC clone at one extreme of each FPC assembled contig was labeled with biotin and a BAC clone at the opposite extreme was labeled with digoxigenin. Separate FISH experiments were performed for each contig, in which both BAC clones were hybridized to the same polytene chromosome preparations (Figure <figr fid="F2">2</figr>). The FPC assembly was judged "valid" when both probes from the same contig co-localized on the Hessian fly polytene chromosomes (Table <tblr tid="T3">3</tblr>). The contigs examined contained from 9 to 73 BAC clones per contig and ranged in length from 172 to 1157 kb. Using the banding patterns of the four polytene Hessian fly chromosomes, we divided the Hessian fly genome into 26 chromosomal segments (Figure <figr fid="F2">2</figr>), and determined which segments contained each valid contig (Table <tblr tid="T4">4</tblr>). Among the 196 contigs tested, 169 contigs (86%) were scored as valid (Table <tblr tid="T3">3</tblr>). In 63 of these valid contigs (32% of the total) it was possible to discern which of the two BAC clones was the most proximal (Figure <figr fid="F2">2</figr>). It was, therefore, possible to suggest the orientation of those 63 contigs on the chromosomes. There were 27 contigs (14%) that were determined to be "invalid." These had a greater number of Qs than the valid contigs (Table <tblr tid="T3">3</tblr>). Moreover, they were easily separated into two groups: A "repetitive" group of 11 contigs, and a "non-repetitive" group of 16 contigs. The contigs in the repetitive group had at least one probe that hybridized to multiple locations (Figure <figr fid="F3">3</figr>). Seven of these had a probe that hybridized to pericentromeric heterochromatin. The non-repetitive group had probes that hybridized to different, but unique, genomic locations. This group of contigs had a greater number of Qs per contig than both the valid group of contigs and the invalid-repetitive group of contigs (Table <tblr tid="T3">3</tblr>). We reexamined all invalid contigs by performing FISH with additional BAC clones selected from the same contigs. In each experiment, these clones hybridized in the same position as one of the previous BAC clones (Figure <figr fid="F3">3</figr>). These data were then used to assign the unreliable contigs to a chromosomal segment (Table <tblr tid="T4">4</tblr>).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Assessment of 196 contigs using two BAC clones per contig as FISH probes.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Valid</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Invalid</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Invalid-R</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Invalid-NR</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total no. contigs</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>169</p>
                     </c>
                     <c ca="right">
                        <p>27</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total no. BACs</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>3208</p>
                     </c>
                     <c ca="right">
                        <p>575</p>
                     </c>
                     <c ca="right">
                        <p>186</p>
                     </c>
                     <c ca="right">
                        <p>389</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Mean no. BACs/contig</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>19 &#177; 10</p>
                     </c>
                     <c ca="right">
                        <p>21 &#177; 11</p>
                     </c>
                     <c ca="right">
                        <p>17 &#177; 11</p>
                     </c>
                     <c ca="right">
                        <p>24 &#177; 10<sup>*&#8224;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total no. Qs</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>641</p>
                     </c>
                     <c ca="right">
                        <p>161</p>
                     </c>
                     <c ca="right">
                        <p>41</p>
                     </c>
                     <c ca="right">
                        <p>120</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total Qs/BAC</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>0.20</p>
                     </c>
                     <c ca="right">
                        <p>0.28</p>
                     </c>
                     <c ca="right">
                        <p>0.22</p>
                     </c>
                     <c ca="right">
                        <p>0.31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Mean no. Qs/contig</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>3.8 &#177; 4.7</p>
                     </c>
                     <c ca="right">
                        <p>6.0 &#177; 5.3*</p>
                     </c>
                     <c ca="right">
                        <p>3.7 &#177; 5.7</p>
                     </c>
                     <c ca="right">
                        <p>7.5 &#177; 4.6<sup>**&#8224;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Mean no. Qs/BAC/contig</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>0.16 &#177; 0.13</p>
                     </c>
                     <c ca="right">
                        <p>0.25 &#177; 0.17**</p>
                     </c>
                     <c ca="right">
                        <p>0.15 &#177; 0.16</p>
                     </c>
                     <c ca="right">
                        <p>0.31 &#177; 0.15<sup>**&#8224;&#8224;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total CB units</b>
                        </p>
                        <p>
                           <b>(estimated kb)</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>58,255</p>
                        <p>(68,857)</p>
                     </c>
                     <c ca="right">
                        <p>10,214</p>
                        <p>(12,072)</p>
                     </c>
                     <c ca="right">
                        <p>3,563</p>
                        <p>(4,211)</p>
                     </c>
                     <c ca="right">
                        <p>6,651</p>
                        <p>(7,861)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Mean CB units/contig</b>
                        </p>
                        <p>
                           <b>(estimated kb/contig)</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>344 &#177; 129</p>
                        <p>(407 &#177; 152)</p>
                     </c>
                     <c ca="right">
                        <p>378 &#177; 148</p>
                        <p>(447 &#177; 175)</p>
                     </c>
                     <c ca="right">
                        <p>324 &#177; 126</p>
                        <p>(383 &#177; 150)</p>
                     </c>
                     <c ca="right">
                        <p>415 &#177; 154*</p>
                        <p>(491 &#177; 182)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Contigs in which the probes co-localized were assessed as valid. Contigs in which the probes failed to co-localize were assessed as invalid. Invalid contigs were further divided into contigs that either had one or more repetitive BAC probes (Invalid-R) or no repetitive BAC probes (Invalid-NR). *Means significantly different from the Valid mean (Student's <it>t</it>; p &lt; 0.10). **Means significantly different from the Valid mean (Student's <it>t</it>; p &lt; 0.05). <sup>&#8224;</sup>Means significantly different from the Invalid-R mean (Student's <it>t</it>; p &lt; 0.10). <sup>&#8224;&#8224;</sup>Means significantly different from the Invalid-R mean (Student's <it>t</it>; p &lt; 0.05).</p>
               </tblfn>
            </tbl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Distribution of 266 FISH-positioned contigs in 26 Hessian fly chromosomal segments (A-Z)</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Chromosome segment</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>% Relative length</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>No. (%) of contigs</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>No. (%) of BAC clones</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>No. (%) of Qs</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>contig length (% of total)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>A*</p>
                     </c>
                     <c ca="right">
                        <p>5.3</p>
                     </c>
                     <c ca="right">
                        <p>23 (8.6)</p>
                     </c>
                     <c ca="right">
                        <p>374 (8.2)</p>
                     </c>
                     <c ca="right">
                        <p>64 (7.3)</p>
                     </c>
                     <c ca="right">
                        <p>8,642 (8.8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B</p>
                     </c>
                     <c ca="right">
                        <p>4.1</p>
                     </c>
                     <c ca="right">
                        <p>15 (5.6)</p>
                     </c>
                     <c ca="right">
                        <p>249 (5.5)</p>
                     </c>
                     <c ca="right">
                        <p>53 (6.0)</p>
                     </c>
                     <c ca="right">
                        <p>5,929 (6.0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c ca="right">
                        <p>8.1</p>
                     </c>
                     <c ca="right">
                        <p>19 (7.1)</p>
                     </c>
                     <c ca="right">
                        <p>282 (6.2)</p>
                     </c>
                     <c ca="right">
                        <p>49 (5.6)</p>
                     </c>
                     <c ca="right">
                        <p>6,058 (6.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>D</p>
                     </c>
                     <c ca="right">
                        <p>3.3</p>
                     </c>
                     <c ca="right">
                        <p>5 (1.9)</p>
                     </c>
                     <c ca="right">
                        <p>51 (1.2)</p>
                     </c>
                     <c ca="right">
                        <p>8 (0.9)</p>
                     </c>
                     <c ca="right">
                        <p>1,176 (1.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E</p>
                     </c>
                     <c ca="right">
                        <p>0.9</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>F*</p>
                     </c>
                     <c ca="right">
                        <p>5.3</p>
                     </c>
                     <c ca="right">
                        <p>23 (8.6)</p>
                     </c>
                     <c ca="right">
                        <p>460 (10.1)</p>
                     </c>
                     <c ca="right">
                        <p>118 (13.4)</p>
                     </c>
                     <c ca="right">
                        <p>9,145 (9.3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G*</p>
                     </c>
                     <c ca="right">
                        <p>2.6</p>
                     </c>
                     <c ca="right">
                        <p>7 (2.6)</p>
                     </c>
                     <c ca="right">
                        <p>88 (1.9)</p>
                     </c>
                     <c ca="right">
                        <p>17 (1.9)</p>
                     </c>
                     <c ca="right">
                        <p>2,412 (2.5)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>H</p>
                     </c>
                     <c ca="right">
                        <p>1.9</p>
                     </c>
                     <c ca="right">
                        <p>10 (3.8)</p>
                     </c>
                     <c ca="right">
                        <p>183 (4.0)</p>
                     </c>
                     <c ca="right">
                        <p>31 (3.5)</p>
                     </c>
                     <c ca="right">
                        <p>4,285 (4.4)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="right">
                        <p>1.7</p>
                     </c>
                     <c ca="right">
                        <p>4 (1.5)</p>
                     </c>
                     <c ca="right">
                        <p>57 (1.2)</p>
                     </c>
                     <c ca="right">
                        <p>9 (1.0)</p>
                     </c>
                     <c ca="right">
                        <p>1,209 (1.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>J*</p>
                     </c>
                     <c ca="right">
                        <p>2.9</p>
                     </c>
                     <c ca="right">
                        <p>6 (2.3)</p>
                     </c>
                     <c ca="right">
                        <p>185 (4.0)</p>
                     </c>
                     <c ca="right">
                        <p>45 (5.1)</p>
                     </c>
                     <c ca="right">
                        <p>3,059 (3.1)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>K*</p>
                     </c>
                     <c ca="right">
                        <p>4.3</p>
                     </c>
                     <c ca="right">
                        <p>13 (4.9)</p>
                     </c>
                     <c ca="right">
                        <p>281 (6.2)</p>
                     </c>
                     <c ca="right">
                        <p>76 (8.6)</p>
                     </c>
                     <c ca="right">
                        <p>6,142 (6.3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>L*</p>
                     </c>
                     <c ca="right">
                        <p>5.3</p>
                     </c>
                     <c ca="right">
                        <p>13 (4.9)</p>
                     </c>
                     <c ca="right">
                        <p>196 (4.3)</p>
                     </c>
                     <c ca="right">
                        <p>41 (4.6)</p>
                     </c>
                     <c ca="right">
                        <p>4,871 (5.0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>M*</p>
                     </c>
                     <c ca="right">
                        <p>2.6</p>
                     </c>
                     <c ca="right">
                        <p>1 (0.4)</p>
                     </c>
                     <c ca="right">
                        <p>10 (0.2)</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>314 (0.3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>N*</p>
                     </c>
                     <c ca="right">
                        <p>6.7</p>
                     </c>
                     <c ca="right">
                        <p>9 (3.4)</p>
                     </c>
                     <c ca="right">
                        <p>148 (3.2)</p>
                     </c>
                     <c ca="right">
                        <p>32 (3.6)</p>
                     </c>
                     <c ca="right">
                        <p>3,386 (3.4)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>O</p>
                     </c>
                     <c ca="right">
                        <p>4.3</p>
                     </c>
                     <c ca="right">
                        <p>9 (3.4)</p>
                     </c>
                     <c ca="right">
                        <p>192 (4.2)</p>
                     </c>
                     <c ca="right">
                        <p>58 (6.6)</p>
                     </c>
                     <c ca="right">
                        <p>4,142 (4.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>P*</p>
                     </c>
                     <c ca="right">
                        <p>2.9</p>
                     </c>
                     <c ca="right">
                        <p>16 (6.0)</p>
                     </c>
                     <c ca="right">
                        <p>345 (7.6)</p>
                     </c>
                     <c ca="right">
                        <p>67 (7.6)</p>
                     </c>
                     <c ca="right">
                        <p>6,679 (6.8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Q*</p>
                     </c>
                     <c ca="right">
                        <p>5.0</p>
                     </c>
                     <c ca="right">
                        <p>20 (7.5)</p>
                     </c>
                     <c ca="right">
                        <p>295 (6.5)</p>
                     </c>
                     <c ca="right">
                        <p>37 (4.2)</p>
                     </c>
                     <c ca="right">
                        <p>6,908 (7.0)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R</p>
                     </c>
                     <c ca="right">
                        <p>4.3</p>
                     </c>
                     <c ca="right">
                        <p>1 (0.4)</p>
                     </c>
                     <c ca="right">
                        <p>9 (0.2)</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>201 (0.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>S</p>
                     </c>
                     <c ca="right">
                        <p>1.1</p>
                     </c>
                     <c ca="right">
                        <p>5 (1.9)</p>
                     </c>
                     <c ca="right">
                        <p>93 (2.0)</p>
                     </c>
                     <c ca="right">
                        <p>13 (1.5)</p>
                     </c>
                     <c ca="right">
                        <p>2,129 (2.2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T*</p>
                     </c>
                     <c ca="right">
                        <p>3.1</p>
                     </c>
                     <c ca="right">
                        <p>12 (4.5)</p>
                     </c>
                     <c ca="right">
                        <p>188 (4.1)</p>
                     </c>
                     <c ca="right">
                        <p>33 (3.7)</p>
                     </c>
                     <c ca="right">
                        <p>3,795 (3.9)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>U*</p>
                     </c>
                     <c ca="right">
                        <p>2.1</p>
                     </c>
                     <c ca="right">
                        <p>5 (1.9)</p>
                     </c>
                     <c ca="right">
                        <p>94 (2.1)</p>
                     </c>
                     <c ca="right">
                        <p>19 (2.2)</p>
                     </c>
                     <c ca="right">
                        <p>1,899 (1.9)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>V</p>
                     </c>
                     <c ca="right">
                        <p>2.9</p>
                     </c>
                     <c ca="right">
                        <p>1 (0.4)</p>
                     </c>
                     <c ca="right">
                        <p>18 (0.4)</p>
                     </c>
                     <c ca="right">
                        <p>4 (0.5)</p>
                     </c>
                     <c ca="right">
                        <p>275 (0.3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>W</p>
                     </c>
                     <c ca="right">
                        <p>2.9</p>
                     </c>
                     <c ca="right">
                        <p>4 (1.5)</p>
                     </c>
                     <c ca="right">
                        <p>80 (1.8)</p>
                     </c>
                     <c ca="right">
                        <p>10 (1.1)</p>
                     </c>
                     <c ca="right">
                        <p>1,279 (1.3)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>X*</p>
                     </c>
                     <c ca="right">
                        <p>5.7</p>
                     </c>
                     <c ca="right">
                        <p>11 (4.1)</p>
                     </c>
                     <c ca="right">
                        <p>169 (3.7)</p>
                     </c>
                     <c ca="right">
                        <p>24 (2.7)</p>
                     </c>
                     <c ca="right">
                        <p>3,304 (3.4)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Y</p>
                     </c>
                     <c ca="right">
                        <p>2.4</p>
                     </c>
                     <c ca="right">
                        <p>2 (0.8)</p>
                     </c>
                     <c ca="right">
                        <p>20 (0.4)</p>
                     </c>
                     <c ca="right">
                        <p>1 (0.1)</p>
                     </c>
                     <c ca="right">
                        <p>449 (0.4)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Z*</p>
                     </c>
                     <c ca="right">
                        <p>8.3</p>
                     </c>
                     <c ca="right">
                        <p>23 (8.6)</p>
                     </c>
                     <c ca="right">
                        <p>398 (8.7)</p>
                     </c>
                     <c ca="right">
                        <p>67 (7.6)</p>
                     </c>
                     <c ca="right">
                        <p>7,980 (8.1)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Centromeric*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>9 (3.4)</p>
                     </c>
                     <c ca="right">
                        <p>98 (2.1)</p>
                     </c>
                     <c ca="right">
                        <p>6 (0.7)</p>
                     </c>
                     <c ca="right">
                        <p>2,563 (2.6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="right">
                        <p>100.0</p>
                     </c>
                     <c ca="right">
                        <p>266 (100)</p>
                     </c>
                     <c ca="right">
                        <p>4,563 (100)</p>
                     </c>
                     <c ca="right">
                        <p>882 (100)</p>
                     </c>
                     <c ca="right">
                        <p>98,231 (100)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Chromosome segments known to contain at least one contig with a clone that hybridized to a different segment.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Hessian fly polytene chromosomes and BAC clone co-localizations</p>
               </caption>
               <text>
                  <p><b>Hessian fly polytene chromosomes and BAC clone co-localizations</b>. Chromosome segments (A-Z) are indicated on geimsa stained chromosomes (a, f, k, and p). White bars (10 &#956;m) indicate the relative lengths of corresponding DAPI stained chromosomes (b, g, l, and q). Four examples of co-localizing BAC clones are shown in color overlay images (e, j, o, and t). Contig #320 BAC clones CL4h4 (c) and CL31e21 (d) hybridized to chromosome A1 segment G (e). Contig #199 BAC clones Hf30g15 (h) and Hf4e2 (i) hybridized to chromosome A2 segment L (j). Note that Hf30g15 hybridization (green) was more distal than Hf4e2 hybridization (red), allowing contig orientation. Contig #320 BAC clones CL25e22 (m) and Hf10j14 (n) hybridized to chromosome X1 segment U (o). Contig #320 BAC clones CL29m24 (r) and Hf10j14 (s) hybridized to chromosome X2 segment X (t).</p>
               </text>
               <graphic file="1471-2164-10-293-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Resolving "invalid" contigs using FISH</p>
               </caption>
               <text>
                  <p><b>Resolving "invalid" contigs using FISH</b>. (a) The hybridization of BACs CL24B4 (green arrow, A2 segment L) and Hf15M12 (red arrows, pericentromeric heterochromatin) defined contig #36 as invalid and repetitive. (b) The hybridization of a third contig #36 BAC (CL24N4, yellow arrow) resolved the position of contig #36 to A2 segment L. (c) The hybridization of BACs CL17J4 (green arrow, A2 segment L) and Hf7B18 (red arrows, A1 segment J) defined contig #234 as invalid but non-repetitive. (d) Hybridization of a third contig #234 BAC clone (Hf13K2, yellow arrow) with BAC clone CL17J4 resolved the position of contig #234 to A2 segment L.</p>
               </text>
               <graphic file="1471-2164-10-293-3"/>
            </fig>
            <p>Using cytogenetic data to validate a physical map has been performed previously in Drosophila species <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. However, to our knowledge, this is the first time it has been used to anchor the physical map of an insect of agricultural importance. Our method was analogous to performing linkage analysis with the two most terminal BACs in the contigs. Using that approach, 18 of the 19 longest contigs (95%) in the catfish HICF map were validated <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Therefore, although the two approaches are not entirely comparable, our results suggest that the Hessian fly map may have a few more errors. Nevertheless, our results also clearly indicate that the Hessian fly assembly was largely valid and that most errors are associated with the contigs containing the greatest number of Qs. Moreover, they demonstrated that the Hessian fly map can be easily improved using FISH as a manual contig-editing tool.</p>
         </sec>
         <sec>
            <st>
               <p>Genome coverage</p>
            </st>
            <p>To further evaluate the coverage of the assembled contigs, FISH was used to position 70 additional contigs. In these experiments, only one BAC was used as probe, but we avoided invalid contigs by selecting those that had an average of only 0.1 &#177; 0.1 Qs per contig. These experiments brought the total number of BACs used as probes to 489 and the total number of contigs that had been FISH-mapped to 266 (Table <tblr tid="T4">4</tblr>). Of these contigs, 257 were assigned to single chromosome segments. Nine contigs hybridized to the pericentromeric heterochromatin of all four chromosomes, and could not be assigned to a single chromosome segment. The relative length of each chromosome segment was used as a measure of genome content so that the amount of coverage provided to each segment could be compared (Table <tblr tid="T4">4</tblr>). Except for the nucleolar organizing region (NOR), segment E, every chromosome segment contained at least one contig. The subtelomeric segments A, P, and Q, as well as segments B, F, and H, had a slightly greater proportion of contigs than their relative lengths. In the remaining segments, the proportion of contigs and BACs approximated relative length. Thus, the contigs appeared to be evenly distributed. Disregarding the FPC assembly errors previously discovered, and ignoring the centromeric contigs, 95,668 kb of genome length was covered by the FISH mapped contigs. We observed no evidence suggesting that any contig was exclusively associated with the E chromosomes. This coverage therefore represents approximately 60% of the total content of the Hessian fly autosomes and X chromosomes. Moreover, the heterochromatin of these chromosomes is restricted to the centromeres <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. These contigs therefore clearly appeared to be distributed in the gene rich regions of these chromosomes.</p>
            <p>The FPC assembly was developed into a publically available WebFPC archive that shows the BAC clones that were used as probes to physically anchor each contig <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Ordering the contigs that are currently grouped into chromosome segments will be one of the most immediate improvements we make to the map. The segments on the autosomes and the long arm of chromosome X1 are expected to provide sufficient resolution to order the contigs using FISH. However, morphologies of the short arm of chromosome X1 and all of chromosome X2 are problematic. We therefore expect that genetic mapping will be necessary to order the contigs in the corresponding segments.</p>
         </sec>
         <sec>
            <st>
               <p>BAC-end sequence</p>
            </st>
            <p>In order to fully utilize the physical map as a resource in genetic investigations, comparative analyses, and whole genome sequence assembly, we end sequenced the 13,614 fingerprinted BAC clones that were used to generate the physical map. This resulted in 21,814 sequenced BAC ends, 4,708 paired reads, and 13,351,753 bp of high quality bases of Hessian fly genomic DNA [GenBank Trace Archive TI numbers <ext-link ext-link-type="gen" ext-link-id="2136865139">2136865139</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="2136875614">2136875614</ext-link> and <ext-link ext-link-type="gen" ext-link-id="2136877165">2136877165</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="2136888504">2136888504</ext-link>]. If there were no overlap among the BAC ends, this sequence would represent 8.4% of the Hessian fly genome. The average number of bases sequenced per successfully sequenced BAC end was 764 &#177; 237. G/C content of all sequenced BACs was relatively low, averaging 33.4 &#177; 0.03%.</p>
            <p>A combination of a custom repeat library and RepBase database <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> was used to identify repetitive DNA sequence. Because the BACs were derived from heterogeneous strains, this analysis required parameters that limit the possibility of confusing alleles with gene families. Therefore, only repetitive sequences that were >40 bp and present in = 5 copies were selected. These criteria made identification of gene families more likely, given the low coverage of BES. Using these settings, we found that 17.91% of the BES contained repetitive elements (Table <tblr tid="T5">5</tblr>). There was, however, a 1.48% overestimate of repetitive content due to the difficulty of assigning boundaries to different repeat classes. Transposable Elements (TEs) composed only 4.53% of the repetitive DNA and 0.81% of total BES (Table <tblr tid="T5">5</tblr>). Combined, Helicases, ribosomal DNA (including tRNA, rRNA and 60S and 40S ribosomal proteins), simple sequence repeats (SSRs), and sequence of low complexity composed 15.79% of the repetitive fraction. The largest fraction of repetitive DNA (79.68%) consisted of multigene families and unclassified TEs. This observation was consistent with previous investigations of the Hessian fly salivary gland transcriptome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B29">29</abbr></abbrgrp> that discovered over 200 gene families of small putatively secreted proteins with no sequence similarities to other genes in GenBank. In the BES, the most prominent multigene families identified appeared to be tandem copies of the calponin homology domain (an actin binding domain), thrombospondin type1 repeats (which bind and activate TGF-beta), U1 small nuclear ribonucleoprotein C proteins (involved in mRNA splicing), and SMC-hinge family genes (involved in the structural maintenance of chromosomes).</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Repetitive sequence composition of Hessian fly BAC ends.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Repeat class</p>
                     </c>
                     <c ca="left">
                        <p>Repeat type</p>
                     </c>
                     <c ca="right">
                        <p>Total bp</p>
                     </c>
                     <c ca="right">
                        <p>% BES</p>
                     </c>
                     <c ca="right">
                        <p>% Total repetitive</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Class I transposable elements</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>62,774</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.47</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>2.64</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>LTRs</p>
                     </c>
                     <c ca="right">
                        <p>36,337</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>1.52</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Non-LTRs</p>
                     </c>
                     <c ca="right">
                        <p>17,197</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>0.72</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Unknown</p>
                     </c>
                     <c ca="right">
                        <p>9,240</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>0.39</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Class II transposable elements</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>
                           <b>45,599</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.34</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>1.91</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>hAT</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>67</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mos 1</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>39,806</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>1.66</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>mariner</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Other</p>
                     </c>
                     <c ca="right">
                        <p>3,664</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>0.15</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>mariner</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Other</p>
                     </c>
                     <c ca="right">
                        <p>2,062</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>0.08</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Simple sequence repeats</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>107,962</p>
                     </c>
                     <c ca="right">
                        <p>0.81</p>
                     </c>
                     <c ca="right">
                        <p>4.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Low-complexity DNA</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>239,182</p>
                     </c>
                     <c ca="right">
                        <p>1.79</p>
                     </c>
                     <c ca="right">
                        <p>10.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ribosomal</p>
                     </c>
                     <c ca="left">
                        <p>rRNA</p>
                     </c>
                     <c ca="right">
                        <p>17,249</p>
                     </c>
                     <c ca="right">
                        <p>0.13</p>
                     </c>
                     <c ca="right">
                        <p>0.72</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>tRNA</p>
                     </c>
                     <c ca="right">
                        <p>536</p>
                     </c>
                     <c ca="right">
                        <p>0.00</p>
                     </c>
                     <c ca="right">
                        <p>0.02</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>protein</p>
                     </c>
                     <c ca="right">
                        <p>3918</p>
                     </c>
                     <c ca="right">
                        <p>0.03</p>
                     </c>
                     <c ca="right">
                        <p>0.16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Helicases</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>8,862</p>
                     </c>
                     <c ca="right">
                        <p>0.07</p>
                     </c>
                     <c ca="right">
                        <p>0.37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gene families &amp; unclassified TEs</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>1,905,881</p>
                     </c>
                     <c ca="right">
                        <p>14.27</p>
                     </c>
                     <c ca="right">
                        <p>79.68</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total Repetitive</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>2,391,963</p>
                     </c>
                     <c ca="right">
                        <p>17.91</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total BES</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>13,351,753</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Within the Class I TEs, LTR retrotransposons were more prominent than the non-LTR retrotransposons (Table <tblr tid="T5">5</tblr>). The most prominent Class II TE was the <it>Mos1 mariner</it>-like DNA transposable element, which accounted for 1.7% of the repetitive fraction. The remaining Class II TEs composed 0.24% of the repetitive sequence, which had sequence similarities to the <it>Tigger2</it>, <it>Blackjack</it>, <it>Looper</it>, <it>MER45R</it>, and <it>Zaphod </it>TEs. Previous investigation found an abundance of <it>mariner</it>-like elements in Hessian fly pericentromeric heterochromatin <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Therefore, it was interesting to note that the combined proportions of TEs and low-complexity sequence in the BES (2.6%) was virtually the same as the proportion of the BACs that hybridized to pericentromeric heterochromatin in the FISH experiments that anchored the physical map to the chromosomes (2.4%). These observations were consistent with a low abundance of TEs in the euchromatin of the small Hessian fly genome, and they suggested that the BACs that hybridized to heterochromatin contained one or more TEs. This in turn suggests that seven of the eleven contigs that were initially classified as "invalid-repetitive" (Table <tblr tid="T3">3</tblr>) would have been classified as "valid" if not for the presence of one or more TEs in the BACs used as probe.</p>
            <p>SSRs, which have a wide range of applications <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, formed 0.81% of the total BES and 4.51% of the total repetitive fraction. The di- and tri-nucleotide repeats were 29.1% and 33.4% of the SSRs, respectively, and were the most frequent types of SSRs found in this analysis. The most abundant di- and tri-nucleotide motifs were (TC/GA)n and (TAA/TTA)n, which composed 49.9% of the total di-nucleotides and 43.7% of the total tri-nucleotides, respectively. The value of 'n' ranged from 10 to 56 in the (TC/GA)n motif and from 7 to 76 in the (TAA/TTA)n motif.</p>
            <p>To identify putative genic sequences in the BES, both the repeatmasked and unmasked BESs were searched against the NCBI non-redundant database using BLASTX <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Of the 20,487 total unmasked BES analyzed, 4,221 had blast hits, of which 2,770 were mapped to gene ontology (GO) terms and 1,847 were annotated. For the repeatmasked BES dataset, 3,639 had BLAST hits, 2,374 were assigned to GO categories and 1,565 were returned with annotations. Among completely sequenced genomes, mosquitoes, which belong to the same suborder (Nematocera) as the Hessian fly, had the highest percentages of the top scoring BLASTX hits (<it>Aedes aegypti</it>, 17.3%; <it>Culex quinquefasciatus</it>, 15.7%; and <it>Anopheles gambiae</it>, 10.7%). Of the total number of reads that had a blast hit, 3,386 hit a conserved protein sequence in an insect genome. In both the repeatmasked and unmasked datasets the largest GO categories comprised genes involved in transcription and cellular communication. For example, genes involved in protein binding, nucleic acid and nucleotide binding, hydrolase activity, transferase and signal transduction activities comprised the largest GO categories (86% of the total) based on the association with gene ontology terms in the molecular function category [see Additional file <supplr sid="S1">1</supplr>]. Interestingly, fewer annotated sequences were observed in the repeatmasked set for almost all categories. This observation indicated that a proportion of the predicted gene products were derived from the repetitive sequences, which when masked, resulted in lower numbers of annotations. This was also consistent with the earlier observation that multigene families composed a relatively large proportion of the repetitive BES. To examine this further, we unmasked the SSRs so that they were excluded from the repeatmasked set [see Additional file <supplr sid="S2">2</supplr>]. Again, we observed fewer annotations with that data set than with unmasked BES. Therefore, multigene families and transcripts from transposable elements probably accounted for the observed differences between the two data sets.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Gene ontology (GO) classification of Hessian fly BAC-end sequences</b>. Unmasked and repeatmasked BAC-end sequence data sets were annotated and assigned to 16 molecular function gene ontology categories. Sequences with simple sequence repeats (SSRs) were included in the repeatmasked data set. The numbers of annotated sequences were greater in the unmasked set than in the repeatmasked set.</p>
               </text>
               <file name="1471-2164-10-293-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Gene ontology (GO) classification of Hessian fly BAC-end sequences (SSRs unmasked)</b>. Unmasked and repeatmasked BAC-end sequence data sets were annotated and assigned to 16 molecular function gene ontology categories. Sequences with simple sequence repeats (SSRs) were excluded from the repeatmasked data set. The numbers of annotated sequences were greater in the unmasked set than in the repeatmasked set.</p>
               </text>
               <file name="1471-2164-10-293-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Hessian fly's small genome and the polytene chromosomes provided an excellent opportunity to determine if BAC libraries constructed from heterogeneous strains would unduly limit the mapping abilities of the HICF and FPC technologies. Although less heterogeneity would have been preferred, the results have clearly improved the Hessian fly as an experimental model. A first generation map now exists that includes 266 FPC contigs containing 4,563 BAC clones and their associated BES positioned to 26 segments of the Hessian fly genome. The BES provided new knowledge regarding Hessian fly genome organization, and the mapped BES constitutes an abundant resource for the development of physically mapped DNA markers. We expect that these improvements will be made as genetic investigations focus on the discovery of avirulence genes, the genes that are involved in plant-gall formation, and the mechanisms of chromosome elimination. In the near future, we also expect the map to facilitate the assembly of whole-genome shotgun sequences and the assignment of sequenced scaffolds to chromosome segments.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>BAC libraries</p>
            </st>
            <p>Three BAC libraries were utilized in this investigation. Each library was constructed using a separate heterogeneous source of genomic DNA. The Hf library, previously described by Liu <it>et al</it>. 2004 <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, was prepared from DNA isolated from a Kansas "Great Plains" (GP) population. To supplement the Hf library, we used clones derived from the MD_Bb library developed by the Clemson University Genomics Institute <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Herein referred to as the "CL" library, these BACs were prepared from DNA isolated from an Indiana "L" Hessian fly population. From a third library (Mde), 78 BACs were fingerprinted. Seventy-three of these clones were present in two contigs that were previously developed by chromosome walking <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. These clones therefore served as an internal control while developing contigs from the fingerprint data using FPC software <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The Mde library was constructed with DNA isolated from the Georgia derived "vH13" Hessian fly population as described previously <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Plates and filters of all of these clones are available on a cost-recovery basis by request from the Purdue Genomics Center <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>DNA fingerprinting</p>
            </st>
            <p>High-information content DNA fingerprints of BACs were obtained using the SNaPshot Primer Extension Kit (Applied Biosystems, CA) as described by Luo <it>et al</it>. (2003) <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> with little modification. Each BAC clone was grown in a separate well of a 96-well micro-plate (QIAGEN, CA) containing 150 &#956;l of 2&#215; YT medium (12.5 &#956;g/ml of chloramphenicol). BAC DNA was extracted from each well using the R.E.A.L 96-prep kit following the manufacturer's recommendations (QIAGEN, CA). A QIAvac 96 Manifold, attached to a BioRobot 3000 (QIAGEN, CA), was used to vacuum (-930 mbar) collect the lysate through the QIAfilter into new 96-well blocks. The DNA was then precipitated with iso-propanol, collected by centrifugation, and re-suspended in 40 &#956;l of distilled (dd) H<sub>2</sub>O. The DNA concentration in four wells per plate was then determined by agarose gel electrophoresis. DNA concentration ranged from 50 to 100 ng/&#956;l.</p>
            <p>To fingerprint each BAC, 1 &#956;g of each sample was transferred to a separate well of a 96-well plate. This DNA was then restricted at 37&#176;C for 3 h with 4-units of each of five restriction endonucleases (<it>Eco</it>RI, <it>Bam</it>HI, <it>Xba</it>I and <it>Xho</it>I and <it>Hae</it>III) in 50 &#956;l of a solution containing 5 mM NaCL, 10 mM Tris-Cl (pH 7.9), 1 mM MgCl<sub>2</sub>, 0.1 mM dithiothreitol, 100 &#956;g/&#956;l bovine serum albumin, 0.5 &#956;g/&#956;l DNase free RNase A, and 0.02% &#946;-mercaptoethanol. After the restriction digestions, 10 &#956;l of a solution containing 0.4 &#956;l of the SNaPshot multiplex kit, 7 &#956;l of 33 mM Tris-Cl (pH 9.0), 50 &#956;moles of NaCl, and 10 &#956;moles of MgCl<sub>2 </sub>were added directly to the solution in each well. This mixture was incubated at 65&#176;C for 1 h. The labeled DNA was then precipitated and air-dried. To prepare each sample for analysis, each labeled DNA pellet was suspended in 6 &#956;l of Hi-Di Formamide (Applied Biosystems, CA) and 3 &#956;l of ddH<sub>2</sub>O. To this solution, 0.05 &#956;l of an internal size standard (Liz 500; Applied Biosystems, CA) was added. The DNA was then denatured at 95&#176;C and the labeled fragments were separated and sized using an ABI 3730 DNA analyzer.</p>
         </sec>
         <sec>
            <st>
               <p>Fluorescence in situ hybridization (FISH)</p>
            </st>
            <p>Polytene chromosome preparations and <it>in situ </it>hybridizations were performed as previously described <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Briefly, BAC DNA (1 &#956;g) was labeled with either biotin- or digoxigenin-conjugated dUTP (Roche) by nick-translation. Hybridizations were performed overnight at 37&#176;C in a solution (10 &#956;l) containing 40&#8211;100 ng of denatured probe DNA in 10 &#956;l of hybridization solution (10% dextran sulfate, 2&#215; SSC, 50% deionized formamide, and 10 &#956;g denatured salmon sperm DNA) under a coverslip. Detection was performed using Alexa Fluor 488-conjugated anti-biotin and rhodamine-conjugated anti-digoxigenin (Molecular Probes-Invitrogen). When testing for the co-localization of BACs in the same contig, both labeled BACs (one labeled with biotin and the other labeled with digoxigenin) were hybridized simultaneously to the same polytene chromosome preparations with the expectation that the probes would co-localize if the contigs were genuine. Digital images were taken using UV optics on an ORCA-ER (Hammanmatsu) digital camera mounted on an Olympus BX51 microscope, and MetaMorph (Universal Imaging Corp.) imaging software.</p>
         </sec>
         <sec>
            <st>
               <p>Chromosome walking</p>
            </st>
            <p>Two BAC contigs were used to test the reliability of the FPC-based Hessian fly contigs. These contigs were constructed by chromosome walking from two markers flanking the Hessian fly <it>Avirulence </it>gene, <it>vH13</it>. The walk from marker 124 was previously described <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The walk from marker 134 is described above (Figure <figr fid="F3">3</figr>). Chromosome walking experiments consisted of BAC library screens using <sup>32</sup>P-dCTP-labeled probes prepared from markers and BAC-end DNA sequence. Probe labeling and the isolation of BAC-end DNA were performed as described previously <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>BAC-end Sequence Analysis</p>
            </st>
            <p>BAC clone DNA was prepared and sequenced in an automated process using a 384-well format. BAC clones were grown in 375 &#956;l of sterile TB supplemented with chloramphenicol (12.5 &#956;g/ml) for 20 hours at 30&#176;C. Alkaline lysis was used to isolate BAC DNA. Sequencing was performed in 7.5 &#956;l volumes containing 0.5 &#956;l of Big Dye Terminator v3.1 (Applied Biosystems), 1.8 &#956;l of Big Dye Terminator v1.1/v3.1 5&#215; buffer (Applied Biosystems), 3 pmoles of primer, and 100 to 250 ng of BAC DNA. Oligonucleotides T7-ZL (TAATACGACTCACTATAGGG) and BES-HR (CACTCATTAGGCACCCCA) were used to prime separate reactions from opposite ends of each BAC template. Those reactions were performed in a GeneAmp PCR System 9700 (Applied Biosystems). Using primer T7-ZL, sequencing reactions underwent 120 cycles of 96&#176;C for 10 s, 45&#176;C for 5 s, and 60&#176;C for 8 min. Using primer BES-HR, sequencing reactions underwent 120 cycles of 96&#176;C for 10 s, 45&#176;C for 5 s, and 52&#176;C for 8 min. The sequencing products were cleaned using ethanol precipitation and then resuspended in 15 &#956;l of ddH2O. Sequences were then determined using a 3730XL Genetic Analyzer (Applied Biosystems).</p>
            <p>Analysis of repetitive sequence in BES was performed using a custom repeat database combined with RepBase database version 20061006 <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The custom repeat database was constructed using RECON <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>- a <it>de novo </it>approach to identify repetitive sequences. PERL scripts were used to select repetitive sequences greater than 40 bp in length and present in 5 or more copies, which were then annotated using BLASTX <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> at e = 10<sup>-5 </sup>to the NCBI non-redundant database. The annotated repeat database was used as a custom repeat library for RepeatMasker <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. RepeatMasker version 3.1.9 was used at a default mode with WU-BLAST as the search engine (blastp version 2.0 MP-Washington University) to mask the repetitive sequences in the BESs. Another round of RepeatMasker using the RepBase update 20061006 (RepBase database version 20061006) was also run on the same set of sequences. Results from both runs were combined after manually removing the overlaps. The masked sequences were then categorized into different classes of repetitive sequences.</p>
            <p>To identify potential genic sequences and to look for differences between the repeat-masked and unmasked BESs, we used both sets of sequences for GO analysis. The two sets of sequences were used as queries to the NCBI non-redundant database using BLASTX (e = 10<sup>-5</sup>). The BLAST output in the XML format was imported into BLAST2GO (B2G) for GO analysis and functional annotation of gene or protein sequences <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The resulting annotations were converted into 'GO-Slim' format and retrieved for the three GO categories (biological process, molecular function and cell component) with an alpha score of at least 0.6 and an ontology depth level of 3.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>RA and TB performed BAC fingerprinting, and FISH, and participated in drafting the manuscript. RA performed contig assembly. NG analyzed BES and participated in drafting the manuscript. CZ performed FISH and critically reviewed the manuscript. JPF optimized BAC fingerprinting and supervised its implementation. BS performed BAC-end sequencing and critically reviewed the manuscript. JPF, MSC, and JJS conceived, initiated, and supervised the project. JPF and MSC critically reviewed the manuscript. JJS performed chromosome walking, data analysis, and drafted the manuscript. All authors read and agreed on its final version.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Sue Cambron (USDA-ARS) for material contributions and Alison Witt, Phillip San Miguel, and Rick Westerman, of the Purdue Genomics Center, for technical and bioinformatics assistance. The authors would also like to thank William M Nelson, Jeff Tomkins, Catherine Hill, Scott Jackson, Shannon Schlueter, Jessica Schlueter, and Brian Abernathy for valuable contributions and assistance. This work was supported by the USDA Cooperative State Research Education and Extension Service National Research Initiative Grant #2004-03099 and the Indiana 21<sup>st </sup>Century Research and Technology Fund Grant 042700-0207.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Grasses and gall midges: Plant defense and insect adaptation</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Mohan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nair</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lamb</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Rohfritsch</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Annu Rev Entomol</source>
            <pubdate>2003</pubdate>
            <volume>48</volume>
            <fpage>549</fpage>
            <lpage>577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.ento.48.091801.112559</pubid>
                  <pubid idtype="pmpid" link="fulltext">12460937</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Genetics of the ability of the Hessian fly, <it>Mayetiola destructor</it>, to survive on wheats having different genes for resistance</p>
            </title>
            <aug>
               <au>
                  <snm>Hatchett</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Gallun</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Ann Entomol Soc Am</source>
            <pubdate>1970</pubdate>
            <volume>63</volume>
            <issue>5</issue>
            <fpage>1400</fpage>
            <lpage>1407</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Cytology: Chromosome elimination in Cecidomyiidae</p>
            </title>
            <aug>
               <au>
                  <snm>Bantock</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1961</pubdate>
            <volume>190</volume>
            <fpage>466</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/190466a0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Experiments on chromosome elimination in the gall midge, <it>Mayetiola destructor</it></p>
            </title>
            <aug>
               <au>
                  <snm>Bantock</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>J Embryol Exp Morph</source>
            <pubdate>1970</pubdate>
            <volume>24</volume>
            <issue>2</issue>
            <fpage>257</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubid idtype="pmpid">5497456</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Animal Cytology and Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>MJD</fnm>
               </au>
            </aug>
            <source>Cambridge university press</source>
            <pubdate>1973</pubdate>
            <volume>3</volume>
            <fpage>516</fpage>
            <lpage>546</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Morphogenesis and cytology of the salivary gland of the Hessian fly, <it>Mayetiola destructor </it>(Diptera: Cecidomyiidae)</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hatchett</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Ann Entomol Soc Am</source>
            <pubdate>1987</pubdate>
            <volume>80</volume>
            <issue>4</issue>
            <fpage>475</fpage>
            <lpage>482</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Tiny genomes and endoreduplication in Strepsiptera</p>
            </title>
            <aug>
               <au>
                  <snm>Johnston</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Ross</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Bean</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Kathirithamby</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <issue>6</issue>
            <fpage>581</fpage>
            <lpage>585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.0962-1075.2004.00514.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15606806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A physically anchored genetic map and linkage to avirulence reveal recombination suppression over the proximal region of Hessian fly chromosome A2</p>
            </title>
            <aug>
               <au>
                  <snm>Behura</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Valicente</snm>
                  <fnm>FH</fnm>
               </au>
               <au>
                  <snm>SD Rider</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2004</pubdate>
            <volume>167</volume>
            <fpage>343</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470865</pubid>
                  <pubid idtype="pmpid">15166159</pubid>
                  <pubid idtype="doi">10.1534/genetics.167.1.343</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Genomic analysis of a 1 Mb region near the telomere of Hessian fly chromosome X2 and avirulence gene vH13</p>
            </title>
            <aug>
               <au>
                  <snm>Lobo</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Behura</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Aggarwal</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>M-S</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1352350</pubid>
                  <pubid idtype="pmpid" link="fulltext">16412254</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A super-family of genes coding for secreted salivary gland proteins from the Hessian fly, Mayetiola destructor</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Fellers</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hulbert</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>El-Bouhssini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>J Insect Sci</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>12</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1673/2006.06.12.1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Analysis of transcripts and proteins expressed in the salivary glands of Hessian fly (<it>Mayetiola destructor</it>) larvae</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>M-S</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>H-X</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Scheffler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hulbert</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>J Insect Phys</source>
            <pubdate>2008</pubdate>
            <volume>54</volume>
            <fpage>1</fpage>
            <lpage>16</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jinsphys.2007.07.007</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Cytogenetics of the Hessian fly, <it>Mayetiola destructor </it>(Say): II. Inheritance and behavior of somatic and germ-line-limited chromosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hatchett</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>1988</pubdate>
            <volume>79</volume>
            <fpage>190</fpage>
            <lpage>199</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genetics of sex determination in the Hessian fly, <it>Mayetiola destructor</it></p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hatchett</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>1991</pubdate>
            <volume>82</volume>
            <issue>1</issue>
            <fpage>43</fpage>
            <lpage>52</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>High-throughput fingerprinting of bacterial artifidcial chromosomes using the SNaPshot labeling kit and sizing of restriction fragments by capillary electrophoresis</p>
            </title>
            <aug>
               <au>
                  <snm>Luo</snm>
                  <fnm>M-C</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>You</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Hsiao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ouyang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Buell</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Malandro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>McGuire</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>OD</fnm>
               </au>
               <au>
                  <snm>Dvorak</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2003</pubdate>
            <volume>82</volume>
            <fpage>378</fpage>
            <lpage>389</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0888-7543(03)00128-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">12906862</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Whole-genome validation of high-information-content fingerprinting</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Bharti</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Fuks</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Messing</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>139</volume>
            <fpage>27</fpage>
            <lpage>38</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1203355</pubid>
                  <pubid idtype="pmpid" link="fulltext">16166258</pubid>
                  <pubid idtype="doi">10.1104/pp.105.061978</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A BAC-based physical map of the Nile tilapia genome</p>
            </title>
            <aug>
               <au>
                  <snm>Katagiri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tomasino</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Wishon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Carleton</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>89</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1180826</pubid>
                  <pubid idtype="pmpid" link="fulltext">15946383</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-89</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A first generation BAC-based physical map of the channel catfish genome</p>
            </title>
            <aug>
               <au>
                  <snm>Quiniou</snm>
                  <fnm>SM-A</fnm>
               </au>
               <au>
                  <snm>Waldbieser</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>MV</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>40</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800894</pubid>
                  <pubid idtype="pmpid" link="fulltext">17284319</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-8-40</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The first generation of a BAC-based physical map of <it>Brassica rapa</it></p>
            </title>
            <aug>
               <au>
                  <snm>Mun</snm>
                  <fnm>J-H</fnm>
               </au>
               <au>
                  <snm>S-J</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>T-J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H-S</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>M-H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H-I</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <etal/>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>280</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2432078</pubid>
                  <pubid idtype="pmpid" link="fulltext">18549474</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-9-280</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>FPC: a system of building contigs from restriction fingerprinted clones</p>
            </title>
            <aug>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mott</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>523</fpage>
            <lpage>535</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Contigs built with fingerprints, markers, and FPC v4.7</p>
            </title>
            <aug>
               <au>
                  <snm>Soderlund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Humphray</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dunham</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>French</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>1772</fpage>
            <lpage>1787</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310962</pubid>
                  <pubid idtype="pmpid" link="fulltext">11076862</pubid>
                  <pubid idtype="doi">10.1101/gr.GR-1375R</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>WebFPC: Hfly</p>
            </title>
            <url>http://genome.purdue.edu/WebAGCoL/Hfly/WebFPC/</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Tao</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>Y-L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Islam-Faridi</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Scheuring</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stelly</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H-B</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>158</volume>
            <fpage>1711</fpage>
            <lpage>1724</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461754</pubid>
                  <pubid idtype="pmpid">11514457</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>BAC as tools for genome sequencing</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H-B</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Plant Physiol Biochem</source>
            <pubdate>2001</pubdate>
            <volume>39</volume>
            <fpage>195</fpage>
            <lpage>209</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0981-9428(00)01236-5</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Mapping and sequencing complex genomes: Let's get physical</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Scalabrin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>578</fpage>
            <lpage>589</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15266340</pubid>
                  <pubid idtype="doi">10.1038/nrg1404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A BAC-based physical map of the major autosomes of Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Hoskins</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Berman</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Laverty</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>George</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Ciesiolka</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Naeemuddin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Arenson</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>RG</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2271</fpage>
            <lpage>2274</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5461.2271</pubid>
                  <pubid idtype="pmpid" link="fulltext">10731150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A BAC-based physical map of the <it>Drosophila buzzatii </it>genome</p>
            </title>
            <aug>
               <au>
                  <snm>Gonz&#225;lez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nefedov</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bosdet</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Casals</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Calvete</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Delprat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mathewson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wye</snm>
                  <fnm>N</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>885</fpage>
            <lpage>892</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142479</pubid>
                  <pubid idtype="pmpid" link="fulltext">15930498</pubid>
                  <pubid idtype="doi">10.1101/gr.3263105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Cytogenetics of the Hessian fly: I. Mitotic karyotype analysis and polytene chromosome correlations</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hatchett</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>1988</pubdate>
            <volume>79</volume>
            <fpage>184</fpage>
            <lpage>189</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">19554751</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Repbase Update, a database of eukaryotic repetitive elements</p>
            </title>
            <aug>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Pavlicek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klonowski</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Walichiewicz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cytogentic and Genome Research</source>
            <pubdate>2005</pubdate>
            <volume>110</volume>
            <fpage>462</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1159/000084979</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A group of unrelated cDNAs encoding secreted proteins from Hessian fly [Mayetiola destructor (Say)] salivary glands</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Fellers</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Reese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XM</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <fpage>101</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2583.2004.00465.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14728671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p><it>Mariner </it>transposase-like sequences from the Hessian fly, <it>Mayetiola destructor</it></p>
            </title>
            <aug>
               <au>
                  <snm>Shukle</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>VW</fnm>
               </au>
            </aug>
            <source>J of Hered</source>
            <pubdate>1995</pubdate>
            <volume>86</volume>
            <fpage>364</fpage>
            <lpage>368</lpage>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Microsatellites: simple sequences with complex evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Ellegren</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>435</fpage>
            <lpage>445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15153996</pubid>
                  <pubid idtype="doi">10.1038/nrg1348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Molec Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Characterization of two genes expressed in the salivary glands of the Hessian fly [Mayetiola destructor (Say)</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Fellers</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Wilde</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Insect Biochem Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>34</volume>
            <issue>3</issue>
            <fpage>229</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ibmb.2003.10.008</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Clemson University Genomics Institute Homepage</p>
            </title>
            <url>https://www.genome.clemson.edu</url>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Purdue Genomics Center Homepage</p>
            </title>
            <url>http://www.genomics.purdue.edu</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Physical mapping of DNA sequences in the Hessian fly, <it>Mayetiola destructor</it></p>
            </title>
            <aug>
               <au>
                  <snm>Shukle</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>1995</pubdate>
            <volume>86</volume>
            <fpage>1</fpage>
            <lpage>5</lpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Automated de novo identification of repeat sequence families in sequenced genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>1269</fpage>
            <lpage>1276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186642</pubid>
                  <pubid idtype="pmpid" link="fulltext">12176934</pubid>
                  <pubid idtype="doi">10.1101/gr.88502</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Institute for Systems Biology: Repeat Masker</p>
            </title>
            <url>http://www.repeatmasker.org/</url>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research</p>
            </title>
            <aug>
               <au>
                  <snm>Conesa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>G&#246;tz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Garc&#237;a-Gom&#233;z</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Terol</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tal&#243;n</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Robles</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>18</issue>
            <fpage>3674</fpage>
            <lpage>3676</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti610</pubid>
                  <pubid idtype="pmpid" link="fulltext">16081474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
