<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-103</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>SinicView: A visualization environment for comparisons of multiple nucleotide sequence alignment tools</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Shih</snm>
               <mnm>Chun-Chieh</mnm>
               <fnm>Arthur</fnm>
               <insr iid="I1"/>
               <email>arthur@iis.sinica.edu.tw</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Lee</snm>
               <fnm>DT</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>dtlee@ieee.org</email>
            </au>
            <au id="A3">
               <snm>Lin</snm>
               <fnm>Laurent</fnm>
               <insr iid="I1"/>
               <email>laurent@iis.sinica.edu.tw</email>
            </au>
            <au id="A4">
               <snm>Peng</snm>
               <fnm>Chin-Lin</fnm>
               <insr iid="I2"/>
               <email>coolpon@gate.sinica.edu.tw</email>
            </au>
            <au id="A5">
               <snm>Chen</snm>
               <fnm>Shiang-Heng</fnm>
               <insr iid="I1"/>
               <email>shiangheng@gmail.com</email>
            </au>
            <au id="A6">
               <snm>Wu</snm>
               <fnm>Yu-Wei</fnm>
               <insr iid="I1"/>
               <email>karlon@iis.sinica.edu.tw</email>
            </au>
            <au id="A7">
               <snm>Wong</snm>
               <fnm>Chun-Yi</fnm>
               <insr iid="I1"/>
               <email>robinw@iis.sinica.edu.tw</email>
            </au>
            <au id="A8">
               <snm>Chou</snm>
               <fnm>Meng-Yuan</fnm>
               <insr iid="I1"/>
               <email>mychou@iis.sinica.edu.tw</email>
            </au>
            <au id="A9">
               <snm>Shiao</snm>
               <fnm>Tze-Chang</fnm>
               <insr iid="I1"/>
               <email>supera@iis.sinica.edu.tw</email>
            </au>
            <au id="A10">
               <snm>Hsieh</snm>
               <fnm>Mu-Fen</fnm>
               <insr iid="I1"/>
               <email>mufen@tamu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan</p>
            </ins>
            <ins id="I2">
               <p>Genomics Research Center, Academia Sinica, Taipei, 115, Taiwan</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>103</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/103</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16509994</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-103</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>02</day>
               <month>3</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>02</day>
               <month>3</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Shih et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Deluged by the rate and complexity of completed genomic sequences, the need to align longer sequences becomes more urgent, and many more tools have thus been developed. In the initial stage of genomic sequence analysis, a biologist is usually faced with the questions of how to choose the best tool to align sequences of interest and how to analyze and visualize the alignment results, and then with the question of whether poorly aligned regions produced by the tool are indeed not homologous or are just results due to inappropriate alignment tools or scoring systems used. Although several systematic evaluations of multiple sequence alignment (MSA) programs have been proposed, they may not provide a standard-bearer for most biologists because those poorly aligned regions in these evaluations are never discussed. Thus, a tool that allows cross comparison of the alignment results obtained by different tools simultaneously could help a biologist evaluate their correctness and accuracy.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In this paper, we present a versatile alignment visualization system, called SinicView, (for Sequence-aligning INnovative and Interactive Comparison VIEWer), which allows the user to efficiently compare and evaluate assorted nucleotide alignment results obtained by different tools. SinicView calculates similarity of the alignment outputs under a fixed window using the sum-of-pairs method and provides scoring profiles of each set of aligned sequences. The user can visually compare alignment results either in graphic scoring profiles or in plain text format of the aligned nucleotides along with the annotations information. We illustrate the capabilities of our visualization system by comparing alignment results obtained by MLAGAN, MAVID, and MULTIZ, respectively.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>With SinicView, users can use their own data sequences to compare various alignment tools or scoring systems and select the most suitable one to perform alignment in the initial stage of sequence analysis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>With exponentially increasing genomic sequences available in the public domain <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> comparative genomics demonstrates its power to help biologists identify novel conserved and functional regions in genomes <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Based on the comparison of cross-species genomic sequences, biologists can understand the evolutionary relationship of genomic regions among species, discover conserved regions between different genomes, such as yeast species genomes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, metazoan genomes <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, vertebrate genomes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and mammalian genomes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, discover regulatory motifs in the yeast <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and human promoters <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> or identify potential conserved non-genic sequences (CNGs) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>However, genomic sequences can be megabase long and thus the traditional sequence alignment tools based on dynamic programming would not work efficiently due to their time and space complexities. To better tackle this problem, several tools for genomic sequence alignment have been proposed, such as pairwise sequence aligners like MUMmer <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, GS-Aligner <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, Avid <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and LAGAN <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and multiple sequence alignment (MSA) programs like T-COFFEE<abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, MAFFT <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, MultiPipMaker <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, MULTIZ <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, MLAGAN <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, MAVID <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and MUSCLE <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. These alignment tools, however, are heuristics based and do not provide any indication of how far they are from an optimal solution. The comparisons of alignment tools using a set of benchmarking sequences have also been conducted in recent years <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. We found that the majority of these tools usually fail to generate consistent results especially in aligning divergent cross-species sequences. As a result, the more alignment tools there are available in the public domain, the more confusion it creates for users to decide which tool is most suitable to align their sequences.</p>
         <p>Although the comparison results in <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp> provide some evaluations of several popular alignment tools, the conclusions may not be directly applicable to users' sequences. Furthermore the user usually does not know for sure whether those poorly aligned regions produced by the alignment tools are indeed non-homologous or just due to inappropriate tools or scoring systems used. Consequently, if some homologous regions are unaligned, the estimated evolution distances of these sequences may be inaccurate and therefore the constructed phylogenetic trees may be incorrect. Facing this problem, the user may have to try different tools or scoring systems to evaluate the correctness and accuracy of alignment results in the initial stage of sequence analysis. On the other hand, new alignment tools are released continually. Users may want to compare these newly released tools with those that they are most familiar with. Thus, it is desirable and most useful to have a visualization system that provides a <it>direct </it>and efficient method and can assist users to cross compare and inspect alignment results obtained by different MSA tools especially at the initial stage of sequence analysis.</p>
         <p>In recent years, a number of visualization tools have been released in the public domain. These tools can be roughly divided into two categories: integrated genome/sequence browser and individual alignment result visualization. In the former category, such as UCSC ENCODE project <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, UCSC human genome browser <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, <it>Ensembl </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, ECR Browser <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>, users can view alignment results mapped onto the sequenced genomes. Some of these browsers also provide registered users to submit alignment results and see the conservation regions between different genomes. In the latter category, the tools are developed to visualize individual alignment results. The VISTA-related tools are among the famous ones that have been developed for several years <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. mVISTA is a set of programs for comparing DNA sequences from two or more species up to megabases long and visualize these alignments with annotation information <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. rVISTA (regulatory Vista) combines database searches for transcription factor binding sites with a comparative sequence analysis <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. GenomeVISTA compares users' sequences with several whole genome assemblies <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Phylo-VISTA analyzes alignments of multiple DNA sequences from different species while considering their phylogenetic relationships <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. In general, the VISTA family of tools provides users with a novel graphical user interface (GUI) to view alignment results from different viewpoints. In addition to the VISTA family, PipMaker <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B45">45</abbr></abbrgrp>, and zPicture <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> are also popular visualization tools for sequence or genomes alignment results. All of these tools are web-based with friendly user interfaces, and allow users to easily visualize alignment results with annotations. However, these tools are limited solely to single alignment results. The capability of simultaneously comparing multiple results from different alignment tools or different parameters of a scoring system, such as changing match rewards or mismatch penalties, is notably lacking.</p>
         <p>In this article, we present a versatile alignment visualization system, SinicView (Sequence-aligning INnovative and Interactive Comparison VIEWer), which enables users to efficiently compare and evaluate assorted alignment results obtained by different tools. SinicView for the present calculates similarity of the alignment outputs under a fixed window using the sum-of-pairs method and provides scoring profiles of each set of aligned sequences. Other scoring matrices, such as EMBOSS DNA scoring matrix <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and YASS <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, are also provided in SinicView for users to select. Besides, users can also upload their preferable scoring matrices to calculate the scoring profile curves. Users can visually compare alignment results either in graphic scoring profiles or in plain text format of the aligned nucleotides. In addition, the information about alignment gaps and sequence annotations is also presented. The real-time juxtaposition of the visualization results from different MSA programs would bring more insights into the evaluation process. With SinicView, users can use their own sequences to survey and compare various multiple alignment tools and thus to unveil their merits (and shortcomings). Moreover, the cross-tools comparison can provide users more confidence in their final alignment results especially for those poorly aligned regions.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>There are three viewing sections in SinicView: Global View, Detailed View, and Information View (including annotations and gaps.) The Global View section shows the whole percent identity plots that calculate the sum-of-pair scores based on one specified reference sequence. In the Detailed View section, the panels show the whole percent identity plots of different alignment results individually. By observing the graphical results, it is much more intuitive and straightforward to judge the consistency of the alignment results. When the sliding window is less than 100 base pairs, the Detailed View section will automatically switch from the curve-based plot to the display of the detailed alignments in a colored text format where identical characters are shown. The Information View section containing annotation and gap information is stacked beneath the Detailed View section. SinicView also provides several global comparison charts that can assist biologists to choose the best alignment result among those produced by the programs under consideration. SinicView is implemented entirely in Java language to ensure portability across major platforms and is accessible with a web browser and Internet connection. The main features of SinicView are summarized as follows:</p>
         <p>1. Visualization of the scoring distribution of alignment results in a curve-based graphic format;</p>
         <p>2. Generation of the comparison charts using stacked-bar and pie charts, which shows the distribution of the identical rates among various alignment programs for benchmarking purposes;</p>
         <p>3. Inclusion of a versatile manipulative functionality (gap-display toggling, drag-and-drop zooming/shifting, and graphic/text display toggling);</p>
         <p>4. Visualization of annotation information and display of the phylogenetic trees provided by users in which the drawing tree program uses the ATVtree <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>;</p>
         <p>5. Visualization of detailed text alignments results;</p>
         <p>6. Capability to export the visualization results to portable image files.</p>
         <p>In what follows, we will introduce the characteristics and functionality of SinicView in more detail.</p>
         <sec>
            <st>
               <p>Manipulative operations in SinicView</p>
            </st>
            <p>SinicView offers a series of manipulative and navigational controls, such as zooming, shifting, and gap/annotation toggling. As shown in Figure <figr fid="F1">1</figr>, SinicView displays the alignment results obtained by three different MSA methods. The input sequences contain the orthologous regions around the Stem Cell Leukemia (SCL) gene in five vertebrate species: human, mouse, chicken, pufferfish and zebrafish. The buttons and text-field boxes of manipulative functions are located on top of the frame. Users can manually input numerical values or click on the highlighted colored region in the Global View section that specifies the zooming or shifting factors in a drag-and-drop fashion. When the highlighted region is clicked and dragged, the equivalent of a shift action will be performed and the display region can be resized by adjusting the edge of the highlighted area.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The screenshot shows the user interface of SinicView</p>
               </caption>
               <text>
                  <p><b>The screenshot shows the user interface of SinicView. </b>The alignment result is of the SCL gene regions in human, mouse, chicken, pufferfish, and zebrafish. Three alignment results of five sequences aligned by ClustalW, MAVID, and MLAGAN are shown.</p>
               </text>
               <graphic file="1471-2105-7-103-1"/>
            </fig>
            <p>SinicView can display more than one alignment result obtained by different alignment programs (either pairwise or multiple ones.) The assorted mixed-color span under the Global View panel shows among the alignment tools used the preferred aligner, which generates comparatively better results on the spot. Each of the aligners is denoted by a pre-defined color with the "performance color" label right next to the name of the tool.</p>
         </sec>
         <sec>
            <st>
               <p>Multi-panel functionality in SinicView</p>
            </st>
            <p>In the Detailed View section, the Percent Identity Plot (PIP) panels show, from top to bottom, the similarity curves of the alignment results obtained by different programs, along with the names of the alignment tools. In the Information View section, the Gap &amp; Annotation panels (in pink and gray) display the information of annotations provided by users, and gaps of aligned sequences. The information and similarity ratios can also be displayed as the current scan-line (i.e. cursor) moves. The boxes in maroon denote the annotation area and the horizontal line represents the original sequences interleaved with inserted gaps (light gray areas.) The gap display can be toggled on or off via the checkbox on the right.</p>
            <p>Because different alignment results are usually of different lengths, it is not plausible to compare these results base-pair by base-pair. In SinicView, therefore, we let users select one of input sequences as a <it>reference </it>and then calculate the sum-of-pair scores of each base pair in the reference within a fixed window. For example, each alignment result in the PIP panels at the scan-line position corresponds to human sequence, selected as the reference in Figure <figr fid="F1">1</figr>. When the user selects different sequences as the reference, SinicView can demonstrate the variations between the PIP curves of the alignment results.</p>
         </sec>
         <sec>
            <st>
               <p>Visualization of SinicView: comparison chart and text-mode comparison</p>
            </st>
            <p>The functionality under the "Tools" menu, called "Comparison Charts", offers two types of charts for quick-and-easy evaluation of the alignment quality. The stacked bar chart, in Figure <figr fid="F2">2</figr>, illustrates the distribution of the identical rates with the threshold over 40%. The pie chart, on the other hand, displays the distribution of the identical rates from 0 to 100 percent based upon a selected alignment program. The statistics on which these charts are based can also be displayed in a tabulated text form.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The tools menu functions</p>
               </caption>
               <text>
                  <p><b>The tools menu functions. </b>Two comparison charts can be generated by SinicView: the stacked-bar chart illustrates the proportion comparison of cross alignment results and the pie chart shows the proportion of different identical rates of an individual alignment result. The complete data of the charts are tabulated on the left.</p>
               </text>
               <graphic file="1471-2105-7-103-2"/>
            </fig>
            <p>SinicView also provides a plain-text view of the alignment results in the Detailed View section when the sliding window size is less than 100 aligned base pairs. As shown in Figure <figr fid="F3">3</figr>, the plain-text alignment results replace the percent identity curves and the fully identical bases in a column are labeled in red blocks. Thus, users can check the correctness of detailed alignment results base pair by base pair.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The detailed text display of the different alignment results</p>
               </caption>
               <text>
                  <p><b>The detailed text display of the different alignment results. </b>The matched identical sequences are labeled in red blocks. Interestingly, all three results do not contain consistent matching alignments in this case.</p>
               </text>
               <graphic file="1471-2105-7-103-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Installation and execution of the standalone SinicView</p>
            </st>
            <p>The applet version can be accessed via any JRE (Java Runtime Environment)-enabled browsers with Internet connection, thus making the installation and choosing the right platform hassle-free. However, the ease of running SinicView on-the-go cannot accommodate the bandwidth requirement in case of huge amount of sequence data involved. Hence, we have also implemented a standalone application of SinicView, which is wrapped in JRE, for off-line use.</p>
            <p>The execution procedure of the standalone SinicView is quite straightforward. Upon launch, the user will be prompted three options. The first two are to read user's Phylogenetic Tree files, an option, and MSA results from the local disk.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>In what follows, we will introduce two examples to demonstrate how SinicView can assist users to analyze alignment results in the initial stage of sequence comparison. The total alignment lengths in both of the examples are few hundreds of thousands of base pairs and several millions of base pairs, respectively. The conservations of the aligned sequences are different in each example. More examples can be found in <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
         <sec>
            <st>
               <p>Example 1: SCL (Stem Cell Leukemia) gene</p>
            </st>
            <p>The Stem Cell Leukemia (SCL) gene plays a critical role in normal processes that, when disrupted, can result in leukemia. The <it>SCL </it>gene, also known as <it>tal-1</it>, encodes a basic helix-loop-helix transcription factor that is pivotal for the normal development of all hematopoietic lineages, and is highly conserved between mammals and zebrafish <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. Previous analyses of the SCL genes in five vertebrate genomes, including human, mouse, chicken, pufferfish, and zebrafish, have revealed that the SCL promoter/enhancer motifs are conserved in all five species <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The alignment and visualization tools used in their analyses included BLAST <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, PipMaker <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, and DiAlign <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Shah et al. (2004) realigned these gene regions in five species by a pairwise alignment tool, LAGAN <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and demonstrated the alignment result by Phylo-VISTA <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. In this paper, we also downloaded these sequences and realigned them by the multiple alignment tools: ClustalW, MAVID and MLAGAN. The lengths of the human, mouse, chicken, pufferfish, and zebrafish sequences are approximately 100 kb, 65 kb, 67 kb, 22 kb, and 8 kb, respectively.</p>
            <p>Figure <figr fid="F4">4(a)</figr> shows the global view of the results obtained by three alignment tools using the human sequence as the reference. Generally speaking, the highest conserved region located at 30 k bp of human sequence is all well aligned by these three tools. But the highest identical rates of the alignment by ClustalW are lower than those by either MLAGAN or MAVID. Moreover, the total quantity of the result obtained by MLAGAN is better than those by both ClustalW and MAVID while the quantity of the result obtained by ClustalW is better than those by the others, as shown in Figure <figr fid="F4">4(b)</figr>. Interestingly, when we selected the zebrafish sequence as the reference, the result obtained by ClustalW shows the highest conserved region located at around 27.5 k bp whereas those by both MAVID and MLAGAN show it at around 45.89 k bp, as shown in Figure <figr fid="F4">4(c)</figr>. The comparison reveals that the region at around 27.5 k bp in the zebrafish sequence will be assumed the homologous region by ClustalW. But according to MAVID and MLAGAN, the homologous regions are located at around 45.89 k bp rather than at 27.5 k bp. This ambiguous result may be caused by segmental duplication in the sequences and by difference in alignment strategy. In this case, more advanced or further inspections should be performed to either check the detailed alignment results in both regions or realign these sequences by using other pairwise or local alignment tools.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The comparison of different alignment results of SCL gene regions</p>
               </caption>
               <text>
                  <p><b>The comparison of different alignment results of SCL gene regions. </b>(a) The comparison of three alignment results by SinicView while using the human sequence as the reference. (b) The whole (non-equalization) and equalization stacked-bar charts generated by SinicView illustrates the proportion comparison of cross alignment results. (c) Using zebrafish as the reference, the highest conserved region (around 62%) produced by ClustalW concentrates around at 27.5 k bp. However, there are discrepancies between the result of ClustalW and those of MAVID and MLAGAN.</p>
               </text>
               <graphic file="1471-2105-7-103-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Example 2: The greater CFTR region</p>
            </st>
            <p>The cystic fibrosis transmembrane conductance regulator (CFTR) gene is responsible for the cystic fibrosis disorder that spans approximately 190 k bp of genomic DNA and consists of 27 exons <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The greater CFTR region is defined as a genomic segment of about 1.8 M bp on human chromosome 7q31.3 containing the CFTR gene and nine other genes, including TES1, CAV1, CAV2, MET, CAPZA2, ST7, WNT2, GASZ, and CORTBP2 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The comparative analysis of this region in 13 vertebrate species has been reported in Thomas et al., 2003 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> in which the alignment tool used was BlastZ on PipMaker Web server <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. In this paper, we downloaded the sequences of four mammalian species, including human, baboon, dog, and mouse, from the NIH Intramural Sequencing Center (NISC) Website <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. However, the original sequences had been updated in other genome browsers. Thus, we eventually downloaded the last versions of these sequences from the UCSC Genome Browser. The lengths of these sequences are from 1.0 M bp to 1.5 M bp. We realigned these sequences by MLAGAN, MAVID, and TBA (kernel: MULTIZ) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and the total number of bases of the final alignment results, including gaps, are approximately 12 M bp, 11 M bp, and 7.5 M bp, respectively.</p>
            <p>Figures <figr fid="F5">5(a)</figr> and <figr fid="F5">5(b)</figr> show the global PIP curves and their detailed views of three alignment results, respectively. In general, most of high identity regions are well and consistently aligned by these three programs. But those not as high identities are not reported by TBA because the kernel of this program, MULTIZ, is based on the local alignment results by BlastZ. As shown in Figure <figr fid="F5">5(c)</figr>, the stacked-bar charts show the quality and the quantity of these alignment results where the average identical rates for TBA are somewhat better than those for MLAGAN and MAVID although the total number of aligned conserved regions for MLAGAN is larger than those for the others.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The comparison of different alignment results of great CFTR gene regions</p>
               </caption>
               <text>
                  <p><b>The comparison of different alignment results of great CFTR gene regions. </b>The cross comparison of three alignment results by SinicView. (a) The whole scale PIP curves using the human one as reference. (b) The detailed view of (a). (c) Comparison of the results in the whole and equalization stacked-bar charts. (d) Comparison of the results in the pie charts.</p>
               </text>
               <graphic file="1471-2105-7-103-5"/>
            </fig>
            <p>For comparisons of these alignments from a functional viewpoint, we downloaded the annotation of the human sequence, including exons and repeats, from the Ensembl Genome Browser <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The detailed comparisons of the alignment results by different aligners demonstrated that the alignments of noncoding regions are often inconsistent. But for the coding regions, the alignment results by different aligners seem consistent and well-aligned.</p>
            <p>Figures <figr fid="F6">6(a)&#8211;(b)</figr> show the detailed alignment results at four different intervals. In Figure <figr fid="F6">6(a)</figr>, we find that some conserved regions are not aligned by TBA but identified by MLAGAN and MAVID. This region is annotated by repeats and implies that some repetitive elements were inserted into these sequences of their common ancestor. However, this conserved insertion event could not be observed by using TBA. Although the kernel of TBA, MULTIZ, is known not to align regions with repetitive elements, we still find that some other regions with repetitive elements are aligned by this program, as shown in Figure <figr fid="F6">6(b)</figr>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>The detailed comparison of Example 2</p>
               </caption>
               <text>
                  <p><b>The detailed comparison of Example 2. </b>The detailed comparison of different alignment results of great CFTR gene regions at different intervals. (a) From 786,112 bp to 836,774 bp. (b) From 1,500,792 bp to 1,523,689 bp. (c) From 1,583,342 bp to 1,621,404 bp. (d) From 1,623,603 bp to 1,644,063 bp.</p>
               </text>
               <graphic file="1471-2105-7-103-6"/>
            </fig>
            <p>Generally speaking, the regions aligned by TBA usually have higher identical rates than by others. As the frames shown in red in Figures <figr fid="F6">6(c)</figr> and <figr fid="F6">6(d)</figr>, the alignment of these regions by TBA seems superior to those by others. However, the kernel of TBA, MULTIZ, usually neglects to align the regions with low conservations. Thus, some lowly conserved regions may not be aligned by TBA.</p>
            <p>Since each alignment tool has its own advantage and reveals different alignment results, we therefore wonder whether a better alignment result can be generated by hybridization of these alignment tools.</p>
         </sec>
         <sec>
            <st>
               <p>Loading performance and platforms test</p>
            </st>
            <p>SinicView is implemented totally in Java. Theoretically, it should be portable across different operating systems (OSs) and platforms. To demonstrate interoperability on real cases, we tested the applet and application versions of SinicView on different platforms and OSs. As shown in Table <tblr tid="T1">1</tblr>, both versions of SinicView seem to perform well. Thus, users can use either the applet version or the standalone application of SinicView, according to their requirements.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The test results of the applet version and standalone application of SinicView on different platforms and OS's</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Applet</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Standalone Application</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Specification (Applet)</p>
                     </c>
                     <c ca="center">
                        <p>Status</p>
                     </c>
                     <c ca="center">
                        <p>Specification (Application)</p>
                     </c>
                     <c ca="center">
                        <p>Status</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sun OS</p>
                     </c>
                     <c ca="center">
                        <p>OS : Sun OS 5.7 Sparc</p>
                        <p>JVM : java_1.4.2_08</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>OS : Sun OS 5.7 Sparc</p>
                        <p>JVM : java_1.4.2_08</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mac OS</p>
                     </c>
                     <c ca="center">
                        <p>OS : Mac OS 10.4.2 Tiger</p>
                        <p>JVM : java_1.4.2_08</p>
                        <p>java_1.5 update 4</p>
                        <p>Browser : Safari 2.0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>OS : Mac OS Tiger 10.4.2</p>
                        <p>java_1.5 update 4</p>
                        <p>java_1.5 update 4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Linux/Unix</p>
                     </c>
                     <c ca="center">
                        <p>OS : Linux Fedora Core 3</p>
                        <p>JVM : java_1.4.2_08 Browser :</p>
                        <p>Mozilla Firefox 1.0.2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>OS : Linux Fedora Core 3</p>
                        <p>JVM : java_1.4.2_08</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Windows</p>
                     </c>
                     <c ca="center">
                        <p>OS : Windows XP Service Pack 2</p>
                        <p>JVM : java_1.4.2_08</p>
                        <p>java_1.5 update 4</p>
                        <p>Browser : Internet Explorer 6.0</p>
                        <p>Mozilla Firefox 1.0.4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>OS : Windows XP Service Pack 2</p>
                        <p>JVM : java_1.4.2_08</p>
                        <p>java_1.5 update 4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>OK</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Besides, we also tested the loading performance of SinicView. Because the performance of an applet on the Web is strongly dependent on the network bandwidth and traffic, the estimation of loading time may not be a fair comparison. Thus, in this part we only estimated the loading performance of the standalone application of SinicView.</p>
            <p>In general, the loading performance of a Java application is dependent on the memory heap size. The default values of the initial heap size and the maximum size of a Java Virtual Machine (java_1.4.2 version or higher) are 4 M (mega) bytes and 64 M bytes, respectively. These values can be adjusted by the following command in the terminal mode:</p>
            <p>java -Xms64m -Xmx128m -jar SinicView.jar,</p>
            <p>where the parameters Xms64m and Xmx128m represent that the initial heap size is 64 M bytes and the maximum size is 128 M bytes, respectively. Thus, we used different input data sizes, initial heap sizes, and the maximum sizes to estimate the loading time of SinicView. As shown in Table <tblr tid="T2">2</tblr>, using the default maximum heap size, 64 M bytes, the standalone SinicView can handle up to approximately 11 M bytes alignment data. If the maximum size is set up to 256 M bytes, the loading ability of input data size could be over several dozens of mega bytes. Moreover, Table <tblr tid="T2">2</tblr> shows that the maximum data size is dependent on the maximum heap size and the loading times are linearly dependent on the sizes of input data. All performance test results were benchmarked on a 3 GHz Pentium4 PC with 1 GB RAM.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The loading performance of standalone SinicView The loading time of standalone SinicView by different sizes of input data and initial and maximum memory heap sizes. The default value for the initial JVM heap size is 4 M bytes; maximum is 64 M bytes. For the maximum 64 M byte heap size, the standalone SinicView can handle up to approximately 11 M byte alignment data. The maximum value of the input data size is linear in the maximum heap size. We observe that the initial heap memory size has little impact on the loading time. This result was benchmarked on a 3 GHz Pentium4 PC with 1 GB RAM.</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Input data size (bytes)</p>
                     </c>
                     <c cspan="5" ca="right">
                        <p>Loading Time (sec) Java Application Virtual Machine Memory Heap Size, Initial/Max (M Bytes)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>64 MB/64 MB</p>
                     </c>
                     <c ca="center">
                        <p>128 MB/128 MB</p>
                     </c>
                     <c ca="center">
                        <p>64 MB/256 MB</p>
                     </c>
                     <c ca="center">
                        <p>128 Mb/256 MB</p>
                     </c>
                     <c ca="center">
                        <p>256 MB/256 MB</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.5 M</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1 M</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5 M</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10 M</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                     <c ca="center">
                        <p>53</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20 M</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>104</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                     <c ca="center">
                        <p>105</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40 M</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>214</p>
                     </c>
                     <c ca="center">
                        <p>212</p>
                     </c>
                     <c ca="center">
                        <p>212</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>NA: Not available.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Repetitive elements in sequence alignments</p>
            </st>
            <p>The eukaryotic genome is usually characterized by the presence of repetitive DNA consisting of nucleotide sequences of various lengths and compositions that occur from a few times to thousands of times in the genome either in tandem or in a dispersed fashion<abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. The repetitive fractions can be classified into two types of repeated families: localized and dispersed <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. Localized repetitive sequences usually occur as tandem arrays and they are called tandem repetitive DNA. Dispersed repetitive sequences are dispersed throughout the genome. In addition, there are moderately repetitive sequences, which are usually transposable elements or processed pseudogenes and are usually dispersed over the genome. <it>Alu </it>is the largest family of interspersed mobile elements (~300 bp) and propagated to more than one million copies in primate genomes. This type of repeat has been inserted into these genomes within the last 65 million year period <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Because this type of repetitive elements only appears in the primate genomes, when we align homologous sequences of primate and non-primate genomic sequences, these <it>Alu </it>inserted regions should not be aligned. However, other interspersed elements may possibly have been inserted into the ancestral sequence of mammalians. The regions of these repeats may be able to align together between the sequences of different mammalians, as shown in Example 2. However, these regions in the alignment results by different aligners are inconsistent. Since these repetitive elements in sequences could be detected by RepeatMasker <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, the poorly aligned regions may have to be checked whether they belong to repetitive elements.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative approach for alignment validity</p>
            </st>
            <p>As the comparison results using SinicView show, the alignments of sequences using different MSA tools are inconsistent. We begin to wonder whether the computational results obtained by different tools may in fact lead to different findings. For identification of alignment correlation, a need for additional checks of alignment validity by using different tools and scoring systems has been recognized in the literature <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Thus, a cross comparison approach along with visualization could provide an efficient and easy way for general users to verify and validate the alignment results as to whether the aligned regions are reasonable and whether those poorly aligned regions are indeed non-homologous.</p>
         </sec>
         <sec>
            <st>
               <p>How to decide on a "good" alignment result</p>
            </st>
            <p>Except evaluation of the alignment quality by comparison charts in SinicView, how to decide on a good alignment with biological meanings may need much more experiences and knowledge. Sometimes, this judgment depends also on what kind of the biological problems users want to study. Here, we suggest some general rules for users to judge the alignments by biological meanings.</p>
            <p>In the coding regions, a triplet of adjacent nucleotides constitutes a codon. Usually, the first two nucleotides are identical between the two sequences and allow the third one to be either identical or different. Thus, when the partial alignment results reveal the two-out-of-three regularity for each triplet, it may imply that the aligned regions are potential coding regions. This alignment result should be more biologically meaningful than those without the two-out-of-three regularity.</p>
            <p>From molecular evolutionary viewpoint, nature prefers inserting or deleting considerable consecutive nucleotides together to interspersed individual nucleotides <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Thus, an alignment with consecutive gaps would be better than those with interspersed gaps.</p>
            <p>If one of the alignment sequences has been annotated, the information is definitely useful for users to judge the alignment results by different aligners.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative environment to promote new alignment tools</p>
            </st>
            <p>It is not easy to promote newly developed tools because users usually cannot directly compare the new tools with the traditional ones. With SinicView, users can compare the alignment results obtained by different tools and select an appropriate one for further analysis. Thus, if the new tool can align more regions than those by the old ones and can also indicate their statistical significances, it will be welcomed and better received by the community. We would like to make SinicView available to the community of computational biologists. In addition to helping the user find a most appropriate alignment tool to use, SinicView may also be used to check whether previously obtained alignment results by different tools are worth a re-investigation, and see if this revisit of alignment results would lead to different conclusions.</p>
         </sec>
         <sec>
            <st>
               <p>Further possible enhancements for SinicView</p>
            </st>
            <p>The capability of fine-tuning parameters relevant to the alignment process will be made available in a user-friendly interface. Furthermore, the ability to allow plug-ins of more alignment programs, in addition to the currently pre-selected ones, such as ClustalW, MAVID, MLAGAN, and GS-Aligner, will inevitably broaden the usage of SinicView. The issue of the compatibility of the input and output formats for each alignment tool also needs to be resolved. For example, both MAVID and MLAGAN require the phylogenetic tree data as input, but ClustalW does not. The ordering of the outputs of these aforementioned tools is usually switched without notice. Thus, to be able to work under a unified comparison framework requires further processing of these outputs. Besides, identifying a standard-bearer mechanism is still a challenge in entrusting existing alignment programs. So far, we have used the "sum-of-pairs" method to define the "identical rate" in each alignment result. In the future, we may provide other criteria for users to use to measure their alignment results, in addition to what have been already provided in SinicView.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Deluged by the increasing number of completed genomic sequences, biologists have encountered a challenge of aligning more and much longer sequences from divergent species. Thus, the need to align longer sequences, like mega base-pair sequences or even genome-scale sequences, and evaluate the alignment results becomes more urgent. In this paper, we have presented a visualization tool for comparison of multiple sequence alignment programs. With a standard simple protocol for the input/output format, it is quite easy for users to upload their own alignment programs to SinicView. The performance of SinicView depends on the system's internal memory. In a 64 M RAM JAVA environment, SinicView can load and visualize several mega bases alignment results. Users can easily perform sequence alignment by employing multiple alignment tools and visualize the results on the fly by SinicView. More information can be found at <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: 1. Development of Novel Large-scale Sequence Alignment and Visualization Tools and Their Applications to Bioinformatics</p>
         <p>2. Development of a web-based personalized research environment for study of computational and evolutionary genomics</p>
         <p>Project home page: <url>http://biocomp.iis.sinica.edu.tw</url></p>
         <p>Operating system(s): Window XP, Sun OS 5.7 Sparc, Mac OS 10.4.2 Tiger, and Linux Fedora Core 3</p>
         <p>Programming language: Java</p>
         <p>Other requirements: Java 1.4.2 or higher</p>
         <p>License: Any restrictions to use by non-academics: free downloads and usage for academics only.</p>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations</p>
         </st>
         <p><b>SinicView</b>: Sequence-aligning INnovative and Interactive Comparison VIEWer</p>
         <p><b>JRE</b>: Java Runtime Environment</p>
         <p><b>SCL</b>: Stem Cell Leukemia</p>
         <p><b>CFTR</b>: Cystic Fibrosis Transmembrane Conductance Regulator</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>Arthur Chun-Chieh Shih and D.T. Lee contributed the original idea, developed the system organization, and drafted the paper. Laurent Lin supervised the system implementation and also drafted some parts of the paper. Chin-Lin Peng, Yu-Wei Wu, Chun-Yi Wong, Meng-Yuan Chou, and Tze-Chang Shiao implemented the codes. Shiang-Heng Chen and Mu-Fen Hsieh implemented some partial codes before leaving their positions.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Dr. Feng-Chin Chen and Dr. Huai-Kuang Tsai for valuable discussions and Mr. Hung-Yi Chen for his assistance in organizing some alignment results. We also thank the anonymous reviewers for their comments and suggestions that help improve the presentation of this paper. This work was supported by the National Science Council of Taiwan under the grants No. NSC-92-3112-B-001-018-Y, NSC-92-3112-B-001-021-Y, NSC-93-3112-B-001-018-Y, NSC93-3112-B-001-023-Y, NSC-94-2213-E-001-029, and NSC 93-2752-E-002-005-PAE, and by the Institute of Information Science, and the Genomics Research Center of Academia Sinica in Taiwan.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The sequence of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Mural</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanatides</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ballew</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Huson</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kodira</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>XH</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Skupski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gabor Miklos</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Broder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Nadeau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>McKusick</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Zinder</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Slayman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hunkapiller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bolanos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dew</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Fasulo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Flanigan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Florea</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hannenhalli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kravitz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mobarry</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Reinert</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Remington</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Abu-Threideh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Beasley</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Biddick</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bonazzi</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Brandon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cargill</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chandramouliswaran</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Charlab</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chaturvedi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Di Francesco</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Eilbeck</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Evangelista</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gabrielian</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Gan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Guan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Heiman</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Ji</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Ke</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Lei</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Merkulov</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Milshina</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Naik</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Narayan</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Neelam</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nusskern</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rusch</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Shue</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wides</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhong</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zhong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baumhueter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Spier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cravchik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Woodage</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ali</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Awe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baden</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Barnstead</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barrow</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Beeson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Busam</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Carver</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Center</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Curry</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Danaher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Davenport</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Desilets</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dietz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Doup</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ferriera</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Garg</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gluecksmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Haynes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haynes</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Heiner</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hladun</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hostin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Houck</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Howland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ibegwam</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kalush</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kline</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koduru</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Love</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McCawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McIntosh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McMullen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Moy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pfannkoch</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pratts</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Puri</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Qureshi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reardon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rodriguez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Romblad</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ruhfel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sitter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Smallwood</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Strong</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Suh</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tint</snm>
                  <fnm>NN</fnm>
               </au>
               <au>
                  <snm>Tse</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vech</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wetter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Windsor</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Winn-Deen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zaveri</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zaveri</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sjolander</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Karlak</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kejariwal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lazareva</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hatton</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Narechania</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Diemer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Muruganujan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bafna</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Istrail</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lippert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Walenz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yooseph</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Basu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baxendale</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blick</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Caminha</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carnes-Stine</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Caulk</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Coyne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dahlke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mays</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dombroski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ely</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Esparham</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fosler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gire</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Glanowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Glasser</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Glodek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gorokhov</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gropman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heil</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hoover</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kasha</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kagan</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kraft</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Levitsky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Majoros</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>McDaniel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Newman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nodell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Peck</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rowe</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sanders</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sprague</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stockwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zandieh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <issue>5507</issue>
            <fpage>1304</fpage>
            <lpage>1351</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1058040</pubid>
                  <pubid idtype="pmpid" link="fulltext">11181995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Initial sequencing and analysis of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dewar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Funke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Heaford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Howland</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lehoczky</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>LeVine</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McEwan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McKernan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meldrim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mesirov</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Miranda</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rosetti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Santos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sheridan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sougnez</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Stojanovic</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wyman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sulston</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Burton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Deadman</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Deloukas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dunham</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dunham</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>French</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grafham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gregory</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Humphray</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lloyd</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McMurray</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mercer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Milne</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mullikin</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Plumb</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ross</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shownkeen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sims</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>McPherson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Mardis</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Pepin</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Chissoe</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Wendl</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Delehaunty</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Miner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Delehaunty</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kramer</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Minx</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Hawkins</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Branscomb</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Predki</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wenning</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Slezak</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Doggett</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lucas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Elkin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Uberbacher</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Frazier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Muzny</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Bouck</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Sodergren</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Worley</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Rives</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Gorrell</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Metzker</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Kucherlapati</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fujiyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yada</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Toyoda</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kawagoe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Totoki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Heilig</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Saurin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Artiguenave</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brottier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bruls</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Pelletier</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Doucette-Stamm</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rubenfield</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Dubois</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rosenthal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Platzer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nyakatura</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Taudien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rump</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rowen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Madan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Federspiel</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Abola</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Proctor</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Schmutz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dickson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grimwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Kaul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kawasaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Minoshima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Athanasiou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ramser</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reinhardt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>de la Bastide</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dedhia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Blocker</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hornischer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nordsiek</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Eichler</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Galagan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Harmon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hokamp</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kaspryzk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Kitts</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Korf</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kulp</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lancet</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>McLysaght</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Mulder</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pollara</snm>
                  <fnm>VJ</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Stupka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Szustakowski</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wallis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Yeh</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Guyer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Felsenfeld</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wetterstrand</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Patrinos</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Morgan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Szustakowki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Catanese</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Osoegawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shizuya</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <issue>6822</issue>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Antonarakis</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barlow</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bloom</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Botcherby</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Burton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chiaromonte</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Delehaunty</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Deri</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dermitzakis</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Dewey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dickens</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dodge</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Elnitski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Emes</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Eswara</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Eyras</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Felsenfeld</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fewell</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Flicek</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Foley</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Frankel</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Gage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Glusman</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gnerre</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Goodstadt</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grafham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Graves</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Gregory</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Guyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hlavina</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Holzer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hua</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jaffe</sn