<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-6-S4-S22</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>NemaFootPrinter: a web based software for the identification of conserved non-coding genome sequence regions between <it>C. elegans </it>and <it>C. briggsae</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Rambaldi</snm>
               <fnm>Davide</fnm>
               <insr iid="I1"/>
               <email>davide.rambaldi@ifom-ieo-campus.it</email>
            </au>
            <au id="A2">
               <snm>Guffanti</snm>
               <fnm>Alessandro</fnm>
               <insr iid="I1"/>
               <email>alessandro.guffanti@ifom-ieo-campus.it</email>
            </au>
            <au id="A3">
               <snm>Morandi</snm>
               <fnm>Paolo</fnm>
               <insr iid="I1"/>
               <email>paolo.morandi@ifom-ieo-campus.it</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Cassata</snm>
               <fnm>Giuseppe</fnm>
               <insr iid="I1"/>
               <email>giuseppe.cassata@ifom-ieo-campus.it</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>IFOM-FIRC Institute of Molecular Oncology Foundation, Milan, Italy</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <supplement>
            <title>
               <p>Italian Society of Bioinformatics (BITS): Annual Meeting 2005</p>
            </title>
            <editor>Rita Casadio, Alessandro Guffanti, Manuela Helmer-Citterich, Giancarlo Mauri, Luciano Milanesi, Graziano Pesole, Cecilia Saccone and Giorgio Valle</editor>
            <note>Research articles</note>
         </supplement>
         <conference>
            <title>
               <p>Italian Society of Bioinformatics (BITS): Annual Meeting 2005</p>
            </title>
            <location>Milan, Italy</location>
            <date-range>17&#8211;19 March 2005</date-range>
            <url>http://bioinformatics.it/</url>
         </conference>
         <issn>1471-2105</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>Suppl 4</issue>
         <fpage>S22</fpage>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16351749</pubid>
               <pubid idtype="doi">10.1186/1471-2105-6-S4-S22</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>1</day>
               <month>12</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Rambaldi et al; licensee BioMed Central Ltd</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>NemaFootPrinter (Nematode Transcription Factor Scan Through Philogenetic Footprinting) is a web-based software for interactive identification of conserved, non-exonic DNA segments in the genomes of <it>C. elegans </it>and <it>C. briggsae</it>. It has been implemented according to the following project specifications:</p>
               <p>a) Automated identification of orthologous gene pairs.</p>
               <p>b) Interactive selection of the boundaries of the genes to be compared.</p>
               <p>c) Pairwise sequence comparison with a range of different methods.</p>
               <p>d) Identification of putative transcription factor binding sites on conserved, non-exonic DNA segments.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Starting from a <it>C. elegans </it>or <it>C. briggsae </it>gene name or identifier, the software identifies the putative ortholog (if any), based on information derived from public nematode genome annotation databases. The investigator can then retrieve the genome DNA sequences of the two orthologous genes; visualize graphically the genes' intron/exon structure and the surrounding DNA regions; select, through an interactive graphical user interface, subsequences of the two gene regions. Using a bioinformatics toolbox (Blast2seq, Dotmatcher, Ssearch and connection to the rVista database) the investigator is able at the end of the procedure to identify and analyze significant sequences similarities, detecting the presence of transcription factor binding sites corresponding to the conserved segments. The software automatically masks exons.</p>
            </sec>
            <sec>
               <st>
                  <p>Discussion</p>
               </st>
               <p>This software is intended as a practical and intuitive tool for the researchers interested in the identification of non-exonic conserved sequence segments between <it>C. elegans </it>and <it>C. briggsae</it>. These sequences may contain regulatory transcriptional elements since they are conserved between two related, but rapidly evolving genomes. This software also highlights the power of genome annotation databases when they are conceived as an open resource and the possibilities offered by seamless integration of different web services via the http protocol.</p>
               <p><b>Availability</b>: the program is freely available at <url>http://bio.ifom-firc.it/NTFootPrinter</url></p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Comparative genomics is a powerful bioinformatics methodology for the identification of conserved genomic DNA segments between two related organisms <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Alignment of DNA sequences from different species provides an effective tool to decode genomic information, based on the assumption that functional sequences tend to diverge at a slower rate than non-functional sequences. By comparing the genomic sequences of species at different evolutionary distances, it is possible, besides identifying coding sequences, to recognize conserved non-coding sequences with a potential regulatory function, and determine which sequences are unique for a given species. This procedure is called Phylogenetic Footprinting <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Alignment algorithms optimize these comparisons so that the regions, that diverge slowly, can be anchored together and highlighted against a background of more rapidly evolving DNA, that is devoid of any functional constraints <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. On a broader view, the identification of non-exonic Conserved Sequence Elements tags associated with human disease-related genes may open new venues for the interpretation of experimental data <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Although <it>C. elegans </it>and <it>C. briggsae </it>are almost identical in morphology and development <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, their genomes have diverged. Several estimates suggest that separation of the two species occurred 23&#8211;40 million years ago <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Conservation of DNA sequences is confined largely to protein-coding regions and short flanking sequences; functional conservation between these two species has also been demonstrated by rescue experiments of mutant phenotypes via DNA-mediated transformation <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>We developed an interactive web-based and user friendly software to help the researchers in the identification of conserved non-coding sequence regions between the genomes of the two nematodes <it>C. elegans </it>and <it>C. briggsae</it>, starting from a bio-computational project focused on identification of conserved segments in a single pair of orthologous genes.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>The program developed here is a research tool; hence the design has been sometimes bound to the functionality, in order to optimize the speed and easiness of interaction between all the components. A careful planning of all the required modules has been achieved, however we have used system analysis and design techniques for the most complex parts of this development project.</p>
         <p>Documentation has been targeted both at the user with a web page <url>http://bio.ifom-firc.it/NTFootPrinter/howto.html</url>, and to the programmer/maintainer of the software, with internal documentation on the scripts. We always relied on feedback from the interested scientist in designing the user interface and ameliorating the software functionality. From the programming language point of view, we adopted standard solutions that were fit to the problem (bioinformatics development).</p>
         <p>The following components have been used to develop the web application:</p>
         <p>1) A local mirror of Wormbase <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> database under mySQL. The genome annotation information in Wormbase is maintained in General Feature Format (GFF). GFF is a text-based format for the transfer of genome information, allowing genome researchers to develop tools and have them tested without having to maintain a complete feature-finding system. Documentation on this format is available at <url>http://www.sanger.ac.uk/Software/formats/GFF/GFF_Spec.shtml</url></p>
         <p>2) An EnsMart table to search orthologs. EnsMart <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> is a data retrieval tool that generates lists of biological objects (e.g. genes, SNPs) from data held in the Ensembl database. EnsMart uses the BioMart system <url>http://www.ebi.ac.uk/biomart/</url>.</p>
         <p>3) A Perl web interface used to retrieve gene and sequences, mask exons, launch analysis tools, and generate gene images (GD libraries). Other web based technologies are implemented inside the Perl code: cursor coordinates capture and client-side image maps are implemented in html and JavaScript.</p>
         <p>4) A graphical user interface implemented in Java (Swing Applet) for interactive selection of sub-sequences. Using HTTP POST and GET method, the Applet is able to communicate with other elements of the system.</p>
         <p>5) A collection of locally compiled C++ software: <b>dotmatcher </b>and <b>extractseq </b>(EMBOSS package), <b>blast two sequences </b>(NCBI), <b>ssearch </b>(part of the FASTA program suite written by William Pearson), <b>blastz</b>.</p>
         <p>6) An User Agent LWP connection (Perl library) to send subsequences and blastz alignment to a transcription factor database web server (<b>rVista</b>)</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>The main functional flow of the software can be summarized as follows (Figure <figr fid="F1">1</figr>): Starting from a <it>C. elegans </it>or <it>C. briggsae </it>gene name or identifier, the software identifies the putative ortholog (if any), based on information derived from one of the EnsMart datamart tables <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In this first step, the user has the opportunity to start from either an identifier ('gene model') or from a 'common gene name', following the <it>CGC </it>(<it>Caenorhabditis Genetics Center</it>) genetic nomenclature <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Web Interface Scheme</p>
            </caption>
            <text>
               <p><b>Web Interface Scheme. NemaFootPrinter scheme: </b>starting from gene name submission ('Start analysis' on the top-left of the scheme), gene name and organism are submitted to the GENEFINDER script (red-border box) that verifies if the given gene has an ortholog and displays the clone name. GENEFINDER also allows not-interactive selection of <it>n </it>base pairs upstream and downstream of the given genes. After gene-name retrieval the user can choose a display mode (green-border boxes): <b>TEXT-MODE </b>allows non-interactive selection of subsequences; the <b>SLIDER-MODE </b>uses an Applet Java to select interactively sub-sequences; the <b>FRAME-MODE </b>combines the slider and the result page on two horizontal frames. On the <b>RESULTS </b>page (blue-border boxes) users can find images of genes structure and boundaries generated on the fly, links to the sequences (FASTA format) and links to a series of bioinformatics tools (yellow-border boxes): <b>BLAST 2 sequences </b>is used for a first screening of the two sequences for similarities; <b>Dotmatcher </b>generates a dot plot and associate an image for direct graphical visualization of regions of similarity. By clicking on a point into the Dotmatcher image and extending the selection for <it>n </it>base pairs, it is possible to align small regions with the <it>Smith and Waterman </it>algorithm. One can then send Fasta files and Blastz alignments to the <b>rVista </b>server (the two subsequences and the relative <b>Blastz </b>alignment generated on the local server). Through this public database it is possible to identify the transcriptional regulatory elements, if any, associated with conserved subsequences.</p>
            </text>
            <graphic file="1471-2105-6-S4-S22-1"/>
         </fig>
         <p>The user can select a display mode:</p>
         <p>1. <b>TEXTMODE</b>: html output only</p>
         <p>2. <b>SLIDER MODE</b>: graphical visualization with a Java Applet (Figure <figr fid="F2">2</figr>)</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Select subsequences</p>
            </caption>
            <text>
               <p><b>Select subsequences. </b>CBrothersSlider.java is a small application written in Java to interactively display gene structures with intron/exon structure and to select subsequences. The interface display clone identifiers (A) and gene images generated "On-the fly" (B). Shifting the sliders (C) or submitting directly chromosome coordinates (D), the user is able to select a subsequence. A small control panel (E) can be used to select only the region on the left of the gene (<it>left </it>checkbox) or on the right of the gene (<it>right </it>checkbox); if both the checkboxes are selected, the application selects <ul>only</ul> the gene sequence. After sequence manipulation, using the Submit button (F), user can post the selected subsequence coordinates to the main script that generates new FASTA files and display the Results page.</p>
            </text>
            <graphic file="1471-2105-6-S4-S22-2"/>
         </fig>
         <p>3. <b>FRAME MODE</b>: both results and slider on the same web page using frames</p>
         <p>The software retrieves the sequences of the two orthologous genes from the local database. At this step, <ul>exons of the two orthologous genes are masked</ul>, since similarities between conserved coding regions are not interesting for our purpose. After sequence retrieval, the software generates <it>'On-the fly' </it>an image of the gene structures with associated intron / exon structures.</p>
         <p>A Java applet (Figure <figr fid="F2">2</figr>) displays gene structures and neighbourhoods, the user can select subsequences. The same operation can be performed in text mode. After this step, the user can start sequence analysis from the results page.</p>
         <p>On the results page the investigator can identify and analyze sequence similarities with a number of tools:</p>
         <p>&#8226;<b> Pairwise Blast</b><abbrgrp><abbr bid="B12"><b>12</b></abbr></abbrgrp><b>: </b>While the standard BLAST program is widely used to search for homologous sequences in nucleotide and protein databases, it is necessary to compare only two sequences to ascertain their similarity or common features. In such cases, searching the entire database would be unnecessarily time-consuming. 'BLAST 2 Sequences' utilizes the BLAST algorithm for pairwise DNA-DNA or protein-protein sequence comparison.</p>
         <p>&#8226;<b> Dotmatcher</b><abbrgrp><abbr bid="B13"><b>13</b></abbr></abbrgrp><b>:</b> Dotplot is a graphical representation of the regions of similarity between two sequences. The two sequences are placed along the axes of a rectangular matrix and (subject to threshold conditions) wherever there is equality between the sequences a dot is placed on the image. Where the two sequences have substantial regions of similarity, many dots align to form diagonal lines. It is therefore possible to glance at local regions of similarity, as these will show diagonal lines (Figure <figr fid="F3">3</figr>). In this version of Dotplot the user can control <b>window size</b>, <b>threshold </b>and <b>which strand to align</b>: a small subroutine in the web page named '<it>strand-helper</it>' helps in choosing the best configuration (align plus strand against plus, minus strand against minus, plus against minus, etc.). Even if the software selects a default strand configuration, the user can manually choose another strand mode. On the web page, text boxes give the exact position (relative to the effective sequence length) of the cursor on the X and Y-axis; this helps the user in choosing the sequence stretch of interest.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>dotmatcher</p>
            </caption>
            <text>
               <p><b>dotmatcher. </b>A Dotplot image: <it>C. elegans </it>and <it>C. briggsae </it>sequences are placed on the axes. The gene structure generated on the fly help the Investigator to orient itself in the rectangular image. The top of the image shows parameters given by the investigator (windowsize and threshold). This web images are clickable by the user, the single click is extended in both directions for <it>n </it>base pairs. Both segments (one for <it>C. elegans </it>and one for <it>C. briggsae</it>) are sent to the Ssearch control page, in order to align segments with the Smith and Waterman algorithm.</p>
            </text>
            <graphic file="1471-2105-6-S4-S22-3"/>
         </fig>
         <p>&#8226; <b>Ssearch </b><abbrgrp><abbr bid="B14"><b>14</b></abbr></abbrgrp><b>:</b> Ssearch uses Pearson's implementation of the method of Smith and Waterman <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> to search for similarities between one sequence (the query) and any group of sequences of the same type (nucleic acid or protein). After the Smith-Waterman score for a pairwise alignment is determined, Ssearch uses a simple linear regression against the natural log of the search set sequence length to calculate a normalized z-score for the sequence pair <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The distribution of the z-scores tends to closely approximate an extreme-value distribution; using this distribution, the program can estimate the number of sequences that would be expected to produce a z-score greater than or equal to the z-score obtained in the search. This is reported as the E() score. When all of the search set sequences have been compared to the query, the list of best scores is printed. In our implementation Ssearch is used for aligning two subsections isolated from the Dotplot output.</p>
         <p>&#8226; <b>BlastZ</b><abbrgrp><abbr bid="B17"><b>17</b></abbr></abbrgrp><b>and rVista </b><abbrgrp><abbr bid="B18"><b>18</b></abbr></abbrgrp><b>: BlastZ </b>computes local alignments for sequences of any length based on the assumption that the input sequences are related and share blocks of high conservation that are separated by regions that lack similarity and vary in length. Regions of homology are displayed collinear only to the reference sequence, while the order and orientation of the conserved elements is not necessarily the same in the second sequence.</p>
         <p>Identifying transcriptional regulatory elements represents a significant challenge in annotating genomes. Our bioinformatics procedure has been transparently connected trough the http protocol to a computational tool, <b>rVista</b>. rVista is aimed at high-throughput discovery of <it>cis</it>-regulatory elements, combining clustering of predicted transcription factor binding sites (TFBSs) and maximizing the identification of functional sites.</p>
         <p>A continuous exchange of ideas and information about this software and its interface occurred between the software developers and the nematode researchers. This user feedback has hence been fundamental in tailoring the graphical interface in functions of his needs.</p>
         <p>A PAIRWISE alignment performed with <b>blast2seq </b>(blast two sequences), a heuristic algorithm (BLAST), is used for fast comparison of two sequences.</p>
         <p><b>Dotmatcher </b>uses a simple but slow algorithm; for this reason we have chosen this as a tool to verify the alignments identified in the first step. When two regions of similarity have been found, this similarity can be quantified with the <b>Smith and Waterman algorithm </b>(best local alignment). This is the most sensitive method available for pairwise sequence comparison, but it works slowly, therefore it is more appropriate for in depth analyses than primary searches.</p>
         <p>The last component of the toolbox is a transparent connection through the http layer to a database search tool (<b>rVista</b>) focused on the identification of transcription factor binding sites related to conserved sequence elements. Continuous update of the transcription factors for vertebrates and, in particular, for nematodes is thus guaranteed. Local execution of <b>blastz </b>algorithm permits a fast genome sequence alignment, while the remote server (<b>rVista</b>) performs transcription factor binding site analysis.</p>
         <p>Finally, in order to test the quality of the software, we performed the following experiment: Natarajan and colleagues <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> have recently determined enhancer elements in the <it>C. elegans </it>gene encoding for a beta catenin homologue: <it>bar-1</it>. By classical promoter analysis (creating transgenic lines bearing deletion constructs of the promoter region) they were able to identify two different Cis acting elements <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. In a second step, by alignment, they show that these enhancers/elements are conserved in <it>C. briggsae</it>. By simply using our software we were able to confirm all the elements described in this work without any molecular biology experimentation needed (data not shown). With the help of NemaFootPrinter, from now on, researchers will "first" use our software and "then" test the results by creating just a few transgenic strains in order to just "test" the results and not to "identify" the regions by tedious <it>in vivo </it>experiments.</p>
         <sec>
            <st>
               <p>Other tools for performing comparative genomics</p>
            </st>
            <p>CisHorto, another tool for transcription-factor identification <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, uses Position Weight Matrices and a user-provided ungapped multiple alignment to predict new transcription factors binding sites. Our software adopts a simpler strategy for the identification of putative transcription factor sites, based on sequence similarity and exon masking to highlight similarities between non-coding regions.</p>
            <p>CSTminer <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> is an user-friendly tool for generic identification of coding and noncoding conserved sequence tags through cross-species genome comparison, that uses an original algorithm to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a "coding potential score". We focused our development specifically on nematode genetics, leaving to the final user a high degree of interactivity and exploration for the identification of conserved sequence regions.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Genome annotation databases such as Wormbase <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> or EnsEMBL <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> have a fundamental role in modern biological research and they also offer a platform to bioinformatics development. We have presented a simple web-based software aimed to identify conserved functional segments outside exons (putative new gene expression control elements) through comparative genomics between the nematodes <it>C. elegans </it>and <it>C. briggsae </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. With this project we have highlighted that a specific bioinformatics project can be realized by a sound integration of genome database mirrors, local development and transparent integration with remote resources <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Where speed and robustness were needed, we relied on local mirroring of databases and development of software modules, but when other resources already solved the task we integrated seamlessly calls to remote services through the http protocol. As a result, a new resource, aimed at solving a specific biological problem is now freely available at <url>http://bio.ifom-firc.it/NTFootPrinter/howto.html</url>. We have also demonstrated that usability and functionality in bioinformatics development can be achieved only through a strong and continued feedback from the scientist/user.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>
            <b>Project name: </b>
            <it>NemaFootPrinter</it>
         </p>
         <p>
            <b>Project home page: </b>
            <url>http://bio.ifom-firc.it/NTFootPrinter/index.html</url>
         </p>
         <p><b>server side</b>: UNIX type platforms</p>
         <p><b>client side</b>: Any operating system</p>
         <p><b>Programming language: </b>SQL, Perl, Java</p>
         <p>
            <b>Other requirements:</b>
         </p>
         <p>The web-based application was tested and is compatible with the more common Internet browser. For the Slider Applet the user must have a Java Virtual Machine installed and configured on the client. User without Java can use the TEXT MODE to analyze genes. Even the older <it>text</it>-<it>only </it>browsers like 'lynx' are compatibles with the software (obviously text-only browser must use the TEXT MODE display). For more compatibility information check the help page: <url>http://bio.ifom-firc.it/NTFootPrinter/slider_help.html</url></p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>All authors contributed to the development of NemaFootPrinter.</p>
         <p>DR wrote most parts of the NemaFootPrinter core, the CGI scripts, the Java applet and the database handlers. AG was responsible of the main bioinformatics programming strategy.</p>
         <p>PM was respnsible of biological applied aspect.</p>
         <p>GC made the scientific supervision and interface design. All authors drafted the manuscript and approved the final version.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors wish to thank all the teams behind the public resources used for this development work: Wormbase, EnsEMBL, EnsMart and rVista. DR is supported by an AIRC (Italian Association for Cancer Research) fellowship in the framework of the AIRC Bioinformatic Center Grant (BICG). AG is supported by FIRC (Italian Foundation for Cancer Research). NemaFootPrinter is a project developed with Perl, MySQL and others open-source software, for these reasons we wish also thank all the Open Source community for their support to the Bioinformatics research. We also acknowledge the members of the Cassata lab for comments on the manuscript.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Bioinformatics for the 'bench biologist': how to find regulatory regions in genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Nardone</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>DU</fnm>
               </au>
               <au>
                  <snm>Ansel</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Immunol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>768</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ni0804-768</pubid>
                  <pubid idtype="pmpid" link="fulltext">15282556</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Algorithms for phylogenetic footprinting</p>
            </title>
            <aug>
               <au>
                  <snm>Blanchette</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <fpage>211</fpage>
            <lpage>223</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/10665270252935421</pubid>
                  <pubid idtype="pmpid" link="fulltext">12015878</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Of mice and men: phylogenetic footprinting aids the discovery of regulatory elements</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Biol</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193683</pubid>
                  <pubid idtype="pmpid" link="fulltext">12814519</pubid>
                  <pubid idtype="doi">10.1186/1475-4924-2-11</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Benchmarking tools for the alignment of functional noncoding DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Stoye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Celniker</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>6</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">344529</pubid>
                  <pubid idtype="pmpid" link="fulltext">14736341</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-6</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>DG-CST (Disease Gene Conserved Sequence Tags), a database of human-mouse conserved elements associated to disease genes</p>
            </title>
            <aug>
               <au>
                  <snm>Boccia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Petrillo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>di Bernardo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Guffanti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mignone</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Confalonieri</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luzi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Paolella</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ballabio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Banfi</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D505</fpage>
            <lpage>510</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539965</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608249</pubid>
                  <pubid idtype="doi">10.1093/nar/gki011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Reproductive patterns and attempts at reciprocal crossing of Rhabditis elegans Maupas, 1900, and Rhabditis briggsae Dougherty and Nigon, 1949 (Nematoda: Rhabditidae)</p>
            </title>
            <aug>
               <au>
                  <snm>Nigon</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>J Exp Zool</source>
            <pubdate>1949</pubdate>
            <volume>112</volume>
            <fpage>485</fpage>
            <lpage>503</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jez.1401120307</pubid>
                  <pubid idtype="pmpid">15404785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Analysis of the constancy of DNA sequences during development and evolution of the nematode Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Emmons</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Klass</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1979</pubdate>
            <volume>76</volume>
            <fpage>1333</fpage>
            <lpage>1337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383245</pubid>
                  <pubid idtype="pmpid">286315</pubid>
                  <pubid idtype="doi">10.1073/pnas.76.3.1333</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Conservation of function and expression of unc-119 from two Caenorhabditis species despite divergence of non-coding DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Maduro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pilgrim</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1996</pubdate>
            <volume>183</volume>
            <fpage>77</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(96)00491-X</pubid>
                  <pubid idtype="pmpid">8996090</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>WormBase: a comprehensive data resource for Caenorhabditis biology and genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Antoshechkin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bastiani</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bieri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bradnam</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Canaran</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>CK</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database</issue>
            <fpage>D383</fpage>
            <lpage>389</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540020</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608221</pubid>
                  <pubid idtype="doi">10.1093/nar/gki066</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>EnsMart: a generic system for fast and flexible access to biological data</p>
            </title>
            <aug>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rocca-Serra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>160</fpage>
            <lpage>169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314293</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707178</pubid>
                  <pubid idtype="doi">10.1101/gr.1645104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A uniform genetic nomenclature for the nematode Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Horvitz</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hodgkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Herman</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1979</pubdate>
            <volume>175</volume>
            <fpage>129</fpage>
            <lpage>133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00425528</pubid>
                  <pubid idtype="pmpid">292825</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1999</pubdate>
            <volume>174</volume>
            <fpage>247</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1097(99)00149-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10339815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>EMBOSS: the European Molecular Biology Open Software Suite</p>
            </title>
            <aug>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bleasby</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>276</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02024-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">10827456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Improved tools for biological sequence comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <fpage>2444</fpage>
            <lpage>2448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">280013</pubid>
                  <pubid idtype="pmpid" link="fulltext">3162770</pubid>
                  <pubid idtype="doi">10.1073/pnas.85.8.2444</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Identification of common molecular subsequences</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1981</pubdate>
            <volume>147</volume>
            <fpage>195</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(81)90087-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">7265238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Comparison of methods for searching protein sequence databases</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1995</pubdate>
            <volume>4</volume>
            <fpage>1145</fpage>
            <lpage>1160</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7549879</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Human-mouse alignments with BLASTZ</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>103</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430961</pubid>
                  <pubid idtype="pmpid" link="fulltext">12529312</pubid>
                  <pubid idtype="doi">10.1101/gr.809403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>rVISTA 2.0: evolutionary analysis of transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W217</fpage>
            <lpage>221</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441521</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215384</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identification of evolutionarily conserved promoter elements and amino acids required for function of the C. elegans beta-catenin homolog BAR-1</p>
            </title>
            <aug>
               <au>
                  <snm>Natarajan</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Szyleyko</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Eisenmann</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2004</pubdate>
            <volume>272</volume>
            <fpage>536</fpage>
            <lpage>557</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2004.05.027</pubid>
                  <pubid idtype="pmpid" link="fulltext">15282167</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>CisOrtho: a program pipeline for genome-wide identification of transcription factor target genes using phylogenetic footprinting</p>
            </title>
            <aug>
               <au>
                  <snm>Bigelow</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Wenick</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hobert</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>27</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">406492</pubid>
                  <pubid idtype="pmpid" link="fulltext">15113408</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-27</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Castrignano</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Canali</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Grillo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liuni</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mignone</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W624</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441624</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215464</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>WormBase: network access to the genome and biology of Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sternberg</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>82</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29781</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125056</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.82</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Ensembl 2005</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database</issue>
            <fpage>D447</fpage>
            <lpage>453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540092</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608235</pubid>
                  <pubid idtype="doi">10.1093/nar/gki138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Coghlan</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <fpage>E45</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">261899</pubid>
                  <pubid idtype="pmpid" link="fulltext">14624247</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0000045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Data acquisition, data storage, and data presentation in a modern genetics laboratory</p>
            </title>
            <aug>
               <au>
                  <snm>Geraghty</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Fortelny</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Guthrie</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Irving</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pham</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Daza</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stonehocker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Vu</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>Rev Immunogenet</source>
            <pubdate>2000</pubdate>
            <volume>2</volume>
            <fpage>532</fpage>
            <lpage>540</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12361094</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
