<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-96</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Annotation of expressed sequence tags for the East African cichlid fish <it>Astatotilapia burtoni </it>and evolutionary analyses of cichlid ORFs</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Salzburger</snm>
               <fnm>Walter</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>walter.salzburger@unibas.ch</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Renn</snm>
               <mi>CP</mi>
               <fnm>Susan</fnm>
               <insr iid="I3"/>
               <email>renns@reed.edu</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Steinke</snm>
               <fnm>Dirk</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>dsteinke@uoguelph.ca</email>
            </au>
            <au id="A4">
               <snm>Braasch</snm>
               <fnm>Ingo</fnm>
               <insr iid="I1"/>
               <insr iid="I5"/>
               <email>ingo.braasch@biozentrum.uni-wuerzburg.de</email>
            </au>
            <au id="A5">
               <snm>Hofmann</snm>
               <mi>A</mi>
               <fnm>Hans</fnm>
               <insr iid="I6"/>
               <email>hans@mail.utexas.edu</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Meyer</snm>
               <fnm>Axel</fnm>
               <insr iid="I1"/>
               <email>axel.meyer@uni-konstanz.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Lehrstuhl f&#252;r Zoologie und Evolutionsbiologie, Department of Biology, University of Konstanz, 78467 Konstanz, Germany</p>
            </ins>
            <ins id="I2">
               <p>Zoological Institute, University of Basel, 4051, Switzerland</p>
            </ins>
            <ins id="I3">
               <p>Department of Biology, Reed College, Portland, Oregon 97202, USA</p>
            </ins>
            <ins id="I4">
               <p>Guelph Centre for DNA Barcoding, Biodiversity Institute of Ontario, University of Guelph, Guelph, Ontario N1G 2W1, Canada</p>
            </ins>
            <ins id="I5">
               <p>Physiological Chemistry I, Biozentrum, University of W&#252;rzburg, 97074 W&#252;rzburg, Germany</p>
            </ins>
            <ins id="I6">
               <p>Section of Integrative Biology, University of Texas at Austin, Austin, Texas 78712, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>96</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/96</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18298844</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-96</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>25</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>25</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Salzburger et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The cichlid fishes in general, and the exceptionally diverse East African haplochromine cichlids in particular, are famous examples of adaptive radiation and explosive speciation. Here we report the collection and annotation of more than 12,000 expressed sequence tags (ESTs) generated from three different cDNA libraries obtained from the East African haplochromine cichlid species <it>Astatotilapia burtoni </it>and <it>Metriaclima zebra</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We first annotated more than 12,000 newly generated cichlid ESTs using the Gene Ontology classification system. For evolutionary analyses, we combined these ESTs with all available sequence data for haplochromine cichlids, which resulted in a total of more than 45,000 ESTs. The ESTs represent a broad range of molecular functions and biological processes. We compared the haplochromine ESTs to sequence data from those available for other fish model systems such as pufferfish (<it>Takifugu rubripes </it>and <it>Tetraodon nigroviridis</it>), trout, and zebrafish. We characterized genes that show a faster or slower rate of base substitutions in haplochromine cichlids compared to other fish species, as this is indicative of a relaxed or reinforced selection regime. Four of these genes showed the signature of positive selection as revealed by calculating K<sub>a</sub>/K<sub>s </sub>ratios.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>About 22% of the surveyed ESTs were found to have cichlid specific rate differences suggesting that these genes might play a role in lineage specific characteristics of cichlids. We also conclude that the four genes with a K<sub>a</sub>/K<sub>s </sub>ratio greater than one appear as good candidate genes for further work on the genetic basis of evolutionary success of haplochromine cichlid fishes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The exceptionally diverse species flocks of cichlid fishes in the East African Great Lakes Tanganyika, Malawi and Victoria are prime examples for adaptive radiations and explosive speciation <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. More than 2,000 cichlid species have evolved in the last few million years in the rivers and lakes of East Africa <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Together with an additional ~1,000 species that are found in other parts of Africa, in South and Central America, in Madagascar, and in India, the family Cichlidae represents one of the most species-rich families of vertebrates. In addition to their unparalleled species-richness, cichlids are famous for their ecological, morphological and behavioral diversity <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B7">7</abbr></abbrgrp>, for their propensity for rapid speciation <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, for their capacity for sympatric speciation <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, and for the formation of parallel characters in independently evolved species flocks <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. For these reasons, the cichlid fishes are an excellent model system to study basic dynamics of evolution, adaptation and speciation. However, while the phylogenetic relationships between the main cichlid lineages are largely established and some of the cichlids' evolutionary innovations have been identified <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr><abbr bid="B7">7</abbr><abbr bid="B13">13</abbr></abbrgrp>, little is known about the genomic and transcriptional basis of the evolutionary success of the cichlids.</p>
         <p>The cichlid model system provides many advantages for evolutionary genomic research. The hundreds of closely related yet morphologically diverse species in East Africa's cichlid species flocks are even more powerful than a 'mutagenic screen' (to which these species assemblages have been compared <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B12">12</abbr></abbrgrp>) in that they represent combinations of alleles that confer a selective advantage under various ecological pressures. Because of the possibility to produce viable crosses between different cichlid species in the lab <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, these alleles can be tied to particular phenotypic traits by means of classical genetic experiments <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. The close relatedness of the different species allows the design of primer sets for the amplification of particular genomic DNA regions such as candidate gene loci, microsatellites, or SNPs, which are applicable to a wide range of species <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. The same is true for expression profiling with cDNA microarrays that, once developed for one species, can be used for any East African cichlid species <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
         <p>A variety of genomic resources have already been established for East African cichlid species. Genetic maps are available for the Nile tilapia <it>Oreochromis niloticus </it><abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> and the Lake Malawi species <it>Metriaclima zebra </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. BAC libraries have been constructed for <it>O. niloticus </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and <it>M. zebra </it>(available at the Hubbard Center for Genome Studies), for the Lake Victoria haplochromine <it>Paralabidochromis chilotes </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and for <it>Astatotilapia burtoni </it>from Lake Tanganyika and surrounding rivers <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. cDNA microarrays are available for <it>A. burtoni </it><abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and for Lake Victoria haplochromines <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. Also, EST sequencing projects have been initiated <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and a BLAST server for cichlid resources has been established <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Recently, the National Institute of Health (NIH) has committed to sequencing four cichlid genomes. A detailed description of genomic resources developed for cichlid fishes is available at <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         <p>Expressed sequence tags (ESTs) derived from the partial sequencing of cDNA clones provide an economical approach to identify large numbers of genes that can be used for comparative genomic and gene expression studies as well as for the detection of splice variants <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Furthermore, EST projects facilitate genome annotation and are therefore often applied in addition to genome sequencing projects. Due to the large amount of data available in public databases, ESTs emerge as important resources for comparative genome-wide surveys both among closely and more distantly related taxa <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. A series of software applications have been developed to date to perform such EST-based analyses <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Since ESTs reflect the coding portions of a genome, they can also be used to test for different evolutionary rates in particular genes when comparing different lineages, and to detect genes that have undergone positive selection <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. It is generally assumed that genes with a statistically significant increase in substitution rates have experienced relaxed functional constraints, while genes, which have not undergone accelerated substitution rates, have experienced purifying selection and, thus, could not accumulate substitutions at random. Positive Darwinian selection, on the other hand, is a phenomenon where selective pressure is favoring change. Natural selection is commonly thought of as a process of editing genetic change so that only a small number of mutational events are retained in natural populations. Under positive selection, the retention of mutations is much closer to the rate at which mutations occur.</p>
         <p>Here we report the collection and annotation of more than 12,000 ESTs generated from two different cDNA libraries obtained from the East African cichlid species <it>Astatotilapia burtoni</it>, as well as a smaller cDNA library from the Lake Malawi species <it>Metriaclima zebra. Astatotilapia burtoni </it>has long been used as a model system to study cichlid spawning behavior <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>, social interactions <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>, neural and behavioral plasticity <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>, endocrinology <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, the visual system <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, as well as cichlid development and embryogenesis <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. In addition, the phylogenetic position of <it>A. burtoni </it>makes this species an ideal model system for comparative genomic research <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. <it>Astatotilapia burtoni</it>, which belongs to the most species-rich lineage of cichlids, the haplochromines, was shown to be a sister group to both the Lake Victoria region superflock (~600 species) and the species flock of Lake Malawi (~1,000 species) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. Three highly specialized haplochromine species from two species assemblages, <it>Paralabidochromis chilotes </it>and <it>Ptyochromis sp. </it>"redtail sheller" from Lake Victoria and <it>Metriaclima zebra </it>from Lake Malawi, have already been established as genomic models <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B26">26</abbr><abbr bid="B28">28</abbr><abbr bid="B30">30</abbr></abbrgrp>. Important insight into cichlid (genome) evolution will be afforded by the comparison of their genomes to that of <it>A. burtoni</it>, which has a more generalist life style and is likely to resemble the ancestral lineage that seeded the cichlid adaptive radiations in these two lakes <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B7">7</abbr></abbrgrp>.</p>
         <p>For EST sequencing, we utilized a cDNA library from <it>A. burtoni </it>brain tissue ('<it>brain</it>') that was used for the construction of a cDNA microarray <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and a newly generated normalized cDNA library constructed from different <it>A. burtoni </it>tissues at different developmental stages ('<it>pinky</it>'). We annotated the ESTs on the basis of similarity searches with BLAST and using the structured vocabulary provided by the Gene Ontology Consortium <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, based on molecular studies of gene function in various model organisms <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. For evolutionary analyses, we combined our newly generated ESTs with all available sequence data for haplochromine cichlids <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and a previously constructed library from skin tissue of the Lake Malawi species <it>Metriaclima zebra </it>(W. Salzburger, H. A. Hofmann &amp; A. Meyer; unpublished data), which resulted in a total of more than 45,000 ESTs. We then compared the haplochromine ESTs to sequence data from two pufferfish species (<it>Takifugu rubripes </it>and <it>Tetraodon nigroviridis</it>), trout, and zebrafish, and identified those ESTs with cichlid specific differences in evolutionary rates with EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>The 14,592 initial sequences were trimmed of vector and low-quality sequences and filtered for minimum length (200 bp cut-off), identifying 12,070 high-quality ESTs (Table <tblr tid="T1">1</tblr>). More than 11,000 of these ESTs (from 13,056 initial sequences) are derived from two different <it>Astatotilapia burtoni </it>cDNA libraries &#8211; one made from brain tissue ('<it>brain</it>'), the other one from different tissues ('<it>pinky</it>') including brain, muscle, skin and fin. The overall quality as measured by sequencing success rate and read-length was better in the '<it>pinky</it>' library. Also, there was much less redundancy in the '<it>pinky</it>' library (16% <it>versus </it>30%), which might be the consequence of the normalization step applied to this library or the use of different source tissues.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Expressed sequence tag (EST) summary</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="left">
                     <p>Total sequences</p>
                  </c>
                  <c ca="center">
                     <p>13,056</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>High quality sequences</p>
                  </c>
                  <c ca="center">
                     <p>12,070 (between 200 and 1,564 bp)</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Brain library (<it>A. burtoni</it>) ('brain')</p>
                  </c>
                  <c ca="center">
                     <p>4,570</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Mixed tissue library (<it>A. burtoni</it>) ('pinky')</p>
                  </c>
                  <c ca="center">
                     <p>6,541</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Skin library (<it>P. zebra</it>)</p>
                  </c>
                  <c ca="center">
                     <p>959</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>A total of 8,636 <it>A. burtoni </it>sequences assembled into EST contigs have an open reading frame (ORF) of at least 400 bp. Of these, 1,219 (14%) had matches in the <it>Takifugu </it>database and 7,417 (86%) had no matches when an expected value threshold (e-value) of &lt; 1 &#215; 10<sup>-50 </sup>was used. 2,902 (34%) had matches in the <it>Takifugu </it>database with an expected value threshold of &lt; 1 &#215; 10<sup>-15 </sup>and 3,460 (40%) had matches with an expected value of &lt; 1 &#215; 10<sup>-5</sup>. Similar proportions were retrieved with other databases (Fig. <figr fid="F1">1</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>The proportion of assembled haplochromine cichlid sequences with and without BLAST matches compared to three databases (<it>Takifugu rubripes</it>, <it>Danio rerio</it>, and <it>Oncorhynchus mykiss</it>)</p>
            </caption>
            <text>
               <p><b>The proportion of assembled haplochromine cichlid sequences with and without BLAST matches compared to three databases (<it>Takifugu rubripes</it>, <it>Danio rerio</it>, and <it>Oncorhynchus mykiss</it>)</b>. The pie charts indicate the relative number of BLAST hits (blue) <it>versus </it>the percentage fraction, for which no BLAST hit was retrieved (red) for three different e-values (&lt; 10<sup>-50</sup>, &lt; 10<sup>-15</sup>, and &lt;10<sup>-5</sup>, respectively).</p>
            </text>
            <graphic file="1471-2164-9-96-1"/>
         </fig>
         <p>Among the 8,363 <it>A. burtoni </it>assembled sequences, 2,977 could be annotated according to Gene Ontology (GO) terms. Additional files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr> use the generic GO slim subset of terms (<abbrgrp><abbr bid="B54">54</abbr></abbrgrp>; Generic GO slim; Mundodi and Ireland; downloaded 04/06/2007) that have been developed to provide a useful summary of GO annotation for comparison of genomes, microarrays, or cDNA collections when a broad overview of the ontology content is required. 2,692 ESTs could be assigned to genes listed in the molecular function ontology, 2,532 to genes listed in the biological process ontology, and 2,293 to genes listed in the cellular components ontology, when using an e-value of &lt; 1 &#215; 10<sup>-12</sup>. Additional files <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, and <supplr sid="S6">6</supplr> provide more detail of the specific fine-grained terms. Because a single <it>A. burtoni </it>assembled sequence may be annotated in all three ontologies and according to multiple ontology terms, a total of 27,451 annotations have been applied (10,926 among biological process, 9,414 among molecular function, and 7,111 among cellular component).</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <text>
               <p><b>Gene ontology table (generic GO slim subset for molecular function)</b>. Hierarchical classification of the GO slim subset for molecular function. Indented terms are children of parent terms listed above. For each term, the number of <it>A. burtoni </it>assembled sequences that match genes to which Gene Ontology annotations have been assigned at, or below, this general level is given. Note that genes may be assigned to more than one term and child terms may have more than one parent term. For parent terms, the total number of <it>A. burtoni </it>assembled sequences is given in parentheses. Match means that the annotation derives from a gene that was the "best hit" for the <it>A. burtoni </it>sequence at and e-value &lt; 10<sup>-12</sup>.</p>
            </text>
            <file name="1471-2164-9-96-S1.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p><b>Gene ontology table (generic GO slim subset for biological process)</b>. Hierarchical classification of the GO slim subset for biological process. Indented terms are children of parent terms listed above. Genes may be assigned to more than one term. For each term, the number of <it>A. burtoni </it>assembled sequences that match genes to which Gene Ontology annotations have been assigned at, or below, this general level is given. Note that genes may be assigned to more than one term and child terms may have more than one parent term. For parent terms, the total number of <it>A. burtoni </it>assembled sequences is given in parentheses. Match means that the annotation derives from a gene that was the "best hit" for the <it>A. burtoni </it>sequence at and e-value &lt; 10<sup>-12</sup>.</p>
            </text>
            <file name="1471-2164-9-96-S2.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <text>
               <p><b>Gene ontology table (generic GO slim subset for cellular component)</b>. Hierarchical classification of the GO slim subset for cellular component. Indented terms are children of parent terms listed above. Genes may be assigned to more than one term. For each term, the number of <it>A. burtoni </it>assembled sequences that match genes to which Gene Ontology annotations have been assigned at, or below, this general level is given. Note that genes may be assigned to more than one term and child terms may have more than one parent term. For parent terms, the total number of <it>A. burtoni </it>assembled sequences is given in parentheses. Match means that the annotation derives from a gene that was the "best hit" for the <it>A. burtoni </it>sequence at and e-value &lt; 10<sup>-12</sup>.</p>
            </text>
            <file name="1471-2164-9-96-S3.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional file 4</p>
            </title>
            <text>
               <p><b>Directed acyclic graph (DAG) of the cichlid specific Gene ontology (GO) slim for molecular function</b>. The graph shows the cichlid specific GO slim for molecular function. Molecular function terms were selected for inclusion in the ontologies such that leaf nodes include approximately 20 annotated genes. Circle size represents relative number of genes annotated to each parent node.</p>
            </text>
            <file name="1471-2164-9-96-S4.JPEG">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <text>
               <p><b>Directed acyclic graph (DAG) of the cichlid specific Gene ontology (GO) slim for biological process</b>. The graph shows the cichlid specific GO slim for biological process. Biological process terms were selected for inclusion in the ontologies such that leaf nodes include approximately 20 annotated genes. Circle size represents relative number of genes annotated to each parent node.</p>
            </text>
            <file name="1471-2164-9-96-S5.JPEG">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional file 6</p>
            </title>
            <text>
               <p><b>Directed acyclic graph (DAG) of the cichlid specific Gene ontology (GO) slim for cellular component</b>. The graph shows the cichlid specific GO slim for cellular component. Cellular component terms were selected for inclusion in the ontologies such that leaf nodes include approximately 20 annotated genes. Circle size represents relative number of genes annotated to each parent node.</p>
            </text>
            <file name="1471-2164-9-96-S6.JPEG">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>For the comparative evolutionary analyses, we combined our newly generated ESTs with previously published data from <it>Paralabidochromis chilotes </it>and <it>P. sp. </it>"redtail sheller" <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and about 1,000 sequences obtained from a <it>Metriaclima zebra </it>skin cDNA library (W. Salzburger, H. A. Hofmann &amp; A. Meyer; unpublished data). When using this set of haplochromine cichlid ESTs as reference, we identified 759 open reading frames that are present in all six databases used for comparative analyses (haplochromine cichlids, <it>Danio rerio</it>, <it>Homo sapiens</it>, <it>Oncorhynchus mykiss</it>, <it>Takifugu rubripes</it>, and <it>Tetraodon nigroviridis</it>).</p>
         <p>In order to identify sequences that evolve significantly more rapidly or more slowly in the haplochromine cichlid, we applied the triangle method implemented in EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> to calculate the p-distance for each of these 759 ORFs in all fish species relative to the human ortholog. There were 22 cases in which more than one haplochromine sequence was found. In these cases, we used the longest sequence for further analyses. The relative p-distances for three fish species were then mapped in ternary diagrams. An example of such a ternary diagram is shown in Fig. <figr fid="F2">2a</figr>, in this case showing the relative p-distances of cichlid, <it>Takifugu rubripe</it>s, and <it>Danio rerio </it>amino acid sequences with respect to the homologous <it>Homo sapiens </it>genes. Figure <figr fid="F2">2b</figr> depicts a diagram with <it>Oncorhynchus mykiss </it>amino acid sequence divergence instead of haplochromine cichlid. The ternary diagrams show that in all combinations most genes are clustered around the center of the respective triangle, which indicates that, in general, the p-distances relative to the human outgroup are similar in all fish species.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Ternary representation of relative distances of ORFs of three fish species compared to their human orthologs</p>
            </caption>
            <text>
               <p><b>Ternary representation of relative distances of ORFs of three fish species compared to their human orthologs</b>. (<it>a</it>) Haplochromine cichlid, <it>Danio rerio</it>, and <it>Takifugu rubripes</it>, (<it>b</it>) <it>Danio rerio</it>, <it>Oncorhynchus mykiss</it>, and <it>Takifugu rubripes</it>. Each dot represents a single ORF, the position of the dot within the ternary diagram indicates the relative distance of this ORF in each of the three fish species compared to the orthologous ORF in human. We were interested in identifying those ORFs that show a faster or slower rate of molecular evolution in the haplochromine cichlids.</p>
            </text>
            <graphic file="1471-2164-9-96-2"/>
         </fig>
         <p>When compared to the green-spotted pufferfish (<it>Tetraodon nigroviridis</it>) and fugu (<it>Takifugu rubripes</it>) (always with human as outgroup), 49 gene fragments appeared to have a significantly faster rate of evolution in haplochromine cichlids, and 213 had a slower rate. In the comparison including zebrafish and fugu, 52 genes were found to have evolved faster and 185 genes slower in cichlids. When trout and zebrafish were used, 69 genes were faster and 139 genes evolved slower. In a comparison including trout and fugu, 68 genes appeared to have a faster rate in haplochromines, and 132 had a slower rate. In total 69 genes were found to have evolved faster, and 213 genes appeared to have evolved with a significantly slower mutation rate in haplochromines compared to other fish species. Altogether, about 22% of the surveyed ESTs were found to have haplochromine specific rate differences in at least one of the comparisons suggesting that these genes might play a role in lineage specific features of haplochromine cichlids. A set of 170 cichlid genes appeared in all comparisons. Forty-eight cichlid genes were found to have a higher rate of amino-acid substitution compared to the other fish species included in this study, while 122 cichlid genes were found to have a slower rate. Cichlid sequences that match <it>Danio rerio</it>, <it>Takifugu rubripes</it>, <it>Tetraodon nigroviridis</it>, and <it>Oncorhynchus mykiss </it>genes and have a significantly higher or lower p-distance compared to the other fish genes relative to the human outgroup are listed in Additional files <supplr sid="S7">7</supplr> and <supplr sid="S8">8</supplr>, respectively.</p>
         <suppl id="S7">
            <title>
               <p>Additional file 7</p>
            </title>
            <text>
               <p><b>ESTs with higher p-distances</b>. The table shows ESTs where the p-distance between <it>Homo sapiens </it>and haplochromine cichlid amino acid sequences is significantly higher as compared to other fish species (<it>Danio rerio, Takifugu rubripes, Tetraodon nigroviridis </it>and <it>Oncorhynchus mykiss</it>). Annotation means that the <it>Homo sapiens </it>gene was "best hit" for the cichlid sequence (and e-value &lt; 10<sup>-50</sup>).</p>
            </text>
            <file name="1471-2164-9-96-S7.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional file 8</p>
            </title>
            <text>
               <p><b>ESTs with smaller p-distances</b>. The table shows ESTs where the p-distance between <it>Homo sapiens </it>and haplochromine cichlid amino acid sequences is significantly smaller as compared to other fish species (<it>Danio rerio</it>, <it>Takifugu rubripes</it>, <it>Tetraodon nigroviridis</it>, and <it>Oncorhynchus mykiss</it>). Annotation means that the <it>Homo sapiens </it>gene was "best hit" for the Cichlid sequence (and e-value &lt; 10<sup>-50</sup>).</p>
            </text>
            <file name="1471-2164-9-96-S8.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>A histogram of the abundance of amino acid sequence divergences of all five fish species with respect to homologous human genes is depicted in Fig. <figr fid="F3">3</figr>. The p-distances appear normally distributed. With 0.211, cichlids show the lowest average distance followed by <it>Oncorhynchus mykiss </it>(0.216), <it>Danio rerio </it>(0.239), <it>Takifugu rubripes </it>(0.242), and <it>Tetraodon nigroviridis </it>(0.258). The average distance of all five fish species to <it>Homo sapiens </it>is 0.233. We also used the 482 redundant sequences that were found in all three large haplochromine cichlid EST datasets (<it>P. chilotes </it>and <it>P. sp. </it>"redtail sheller" <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>; <it>Astatotilapia burtoni</it>, this study) to calculate mean pairwise p-distances. Within these three cichlid species, we found a mean p-distance of 0.14 between <it>A. burtoni </it>and <it>P. chilotes</it>, 0.17 between <it>A. burtoni </it>and <it>P. sp. </it>"redtail sheller", and 0.08 between the two Lake Victoria species <it>P. chilotes </it>and <it>P. sp. </it>"redtail sheller".</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Histogram of the abundance of amino acid sequence divergences of all five fish species (haplochromine cichlid, <it>Danio rerio, Takifugu rubripes, Tetraodon nigroviridis, and Oncorhynchus mykiss</it>) with respect to human genes</p>
            </caption>
            <text>
               <p><b>Histogram of the abundance of amino acid sequence divergences of all five fish species (haplochromine cichlid, <it>Danio rerio, Takifugu rubripes, Tetraodon nigroviridis, and Oncorhynchus mykiss</it>) with respect to human genes</b>. P-distances have been calculated for a set of 759 ORFs found in all five fish species and plotted in categories of 0.1.</p>
            </text>
            <graphic file="1471-2164-9-96-3"/>
         </fig>
         <p>We then calculated K<sub>a</sub>/K<sub>s </sub>ratios for all genes with a higher or slower rate of base substitution in cichlids. K<sub>a</sub>/K<sub>s </sub>ratios greater than one, which are indicative of positive selection in that gene, were found in four genes that evolve more slowly in cichlids compared to the other fish species. The highest K<sub>a</sub>/K<sub>s </sub>ratio (3.77) was found in the neuroendocrine <it>convertase subtilisin/kexin type 1 </it>that is responsible for processing large precursor proteins into mature peptide hormones <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. In <it>claudin 3</it>, a member of the claudin family involved in the formation of tight junctions in various tissues <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, the K<sub>a</sub>/K<sub>s </sub>ratio was 1.55. A K<sub>a</sub>/K<sub>s </sub>ratio of 1.30 was observed in the catalyzing enzyme <it>glutathione peroxidase 3</it>, and a ratio of 1.19 was found in <it>m&#233;nage a trois 1 </it>(MNAT1), which is a member of the CDK7-cyclin H complex that functions in cell cycle progression <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, basal transcription, and DNA repair.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Expressed sequence tags are important genomic resources and their numbers in public databases such as GenBank are rapidly increasing. Full-length cDNA and EST sequencing projects typically accompany genome sequencing projects, as these data are essential for the recognition and annotation of genes, the characterization of the transcriptome, the identification of intron-exon boundaries and the detection of splice variants in eukaryotes, etc.<it/><abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. In addition, the standardized procedure of cDNA library construction and normalization, and the comparably low costs of large-scale DNA sequencing facilitate EST projects in organisms for which the whole genome sequencing has not (yet) been completed. Thus, EST sequencing projects outnumber genome-sequencing projects &#8211; particularly in groups with larger genome sizes such as plants and vertebrates &#8211; leading to a large body of sequence data available for comparative analyses. Large-scale EST analyses have been used in many other contexts, such as primary gene expression assays <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr></abbrgrp>, the estimation of the total number of genes in an organism <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>, cDNA microarray annotations <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>, or the construction of genetic linkage maps <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>. Expressed sequence tags can further be used for phylogenomics <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B69">69</abbr></abbrgrp>, and for the identification of microRNAs <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
         <p>Despite their many advantages, there are also some problems associated with ESTs. For example, EST sequences typically cover only parts of a gene, so that two sequences of the same gene might not necessarily overlap. That only fragments of a gene are available also leads to problems with homology-based analyses such as BLAST. Then, EST sequences often contain the untranslated regions (UTRs) that are present in mRNAs but do not translate into amino acids. Finally, it is often difficult to figure out the proper reading frame, particularly in shorter ESTs, which impedes certain analyses. A combination of multiple EST projects (as we have done here) helps to alleviate some of the shortcomings inherent in EST data.</p>
         <p>We have sequenced, annotated and conducted evolutionary analysis of ESTs of haplochromine cichlids for several reasons. First, this large set of sequence data for cichlid ORFs provides insight into the genome of a representative of haplochromine cichlids, which are a main model system for the study of adaptive evolution and explosive speciation <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Second, we wanted to extend the existing genomic resources for <it>Astatotilapia burtoni </it>such as a genomic BAC library <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> by establishing cDNA libraries from different tissues. Furthermore, these cDNA libraries provide the basis for annotated cDNA microarrays that are being used for expression analyses in a variety of cichlid species <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B28">28</abbr><abbr bid="B71">71</abbr></abbrgrp>. Finally, we were interested in identifying genes with a different evolutionary rate in the rapidly radiating cichlid lineage compared to other fish species, as well as in identifying genes that show the signature of adaptive evolution in cichlids.</p>
         <p>Of the two <it>A. burtoni </it>cDNA libraries that were used for EST sequencing, the normalized mixed tissue library ('<it>pinky</it>') was of better quality. Not only were there much fewer redundant sequences as compared to the <it>brain </it>library, which was mainly due to the normalization step, but also the average insert size was larger and the average read length was longer. Altogether, about 85% of the sequenced cDNA clones led to high-quality ESTs of a length of >200 bp (86% in <it>pinky</it>, and 85% in <it>brain</it>). In the BLAST searches against <it>Takifugu rubripes</it>, <it>Tetraodon nigroviridis</it>, and <it>Danio rerio</it>, between 14% (when compared to <it>T. rubripes</it>; e-value &#8804; 10<sup>-50</sup>) and 43% (when compared to <it>D. rerio</it>; e-value &#8804; 10<sup>-5</sup>) of the <it>A. burtoni </it>ESTs led to hits (Fig. <figr fid="F1">1</figr>). This lies well within the range of other EST sequencing projects <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B65">65</abbr><abbr bid="B72">72</abbr></abbrgrp>.</p>
         <p>About 8,600 <it>A. burtoni </it>ORFs (or 75% of the high quality ESTs) were longer than 400 bp, and about 3,000 sequences could unambiguously be annotated and classified following the vocabulary provided by the Gene Ontology Consortium [Additional files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>, <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>]. According to the Gene Ontology classification, it appears that a broad range of genes involved in functions, processes and compartments are represented in our EST set. This cichlid specific GO slim offers several advantages. First, it offers a rapid visual interpretation of gene subsets. Second, because the cichlid specific slim is built from those sequences used to build a cDNA microarray, it offers maximal power when testing for over- or under-representation of gene lists while reducing the need for correction for multiple hypothesis testing. Finally, it allows for a less experimenter-biased interpretation of microarray results, or other genomics analyses in a manner that can be easily compared between experiments.</p>
         <p>One of our main goals was to characterize genes in haplochromine cichlids that show a faster or slower rate of base substitutions in cichlids compared to other fish species, as this is indicative of a relaxed or reinforced selection regime, respectively <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. To this end, we combined our newly generated ESTs with previously published sequences for Lake Victoria haplochromine cichlids <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and about 1,000 sequences obtained from a <it>Metriaclima zebra </it>skin cDNA library, which resulted in a total of about 45,000 ORFs. By means of homology searches against human, the two pufferfishes, trout, and zebrafish using local BLAST, we identified a set of 759 ORFs that are present in all species and that show a sufficient degree of homology (e-value &#8804; 10<sup>-50</sup>) for further analyses with EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The number of genes with a cichlid-specific faster or slower rate of molecular evolution (always with human as outgroup) varied when different fish taxa were used in addition to the cichlid ORFs. However, we found a set of 170 genes (48 "faster" and 122 "slower"; Additional files <supplr sid="S7">7</supplr>, <supplr sid="S8">8</supplr>) that appeared in all comparisons and are, thus, good candidates for playing an important role in the evolution of (haplochromine) cichlid fishes.</p>
         <p>When characterizing these genes further, by means of calculating K<sub>a</sub>/K<sub>s </sub>ratios, we found that four genes (or 2.35% of all deviating genes) showed the signature of adaptive evolution in the haplochromine lineage. The highest K<sub>a</sub>/K<sub>s </sub>ratio (3.77) was found in the neuroendocrine <it>convertase subtilisin/kexin type 1</it>, followed by <it>claudin 3</it>, (1.55), <it>glutathione peroxidase 3 </it>(1.50), and <it>m&#233;nage a trois 1 </it>(1.19). All gene fragments that show a K<sub>a</sub>/K<sub>s </sub>> 1 are found among the more slowly evolving genes. These genes are now candidate genes for further investigations. The gene with the highest K<sub>a</sub>/K<sub>s </sub>ratio appears particularly interesting. It is known that neuroendocrine factors, such as gonadotropin releasing hormone (GnRH), are involved in regulation of reproduction and behavior in <it>A. burtoni </it><abbrgrp><abbr bid="B56">56</abbr><abbr bid="B73">73</abbr></abbrgrp>.</p>
         <p>In order to generate hypotheses regarding possible mechanisms by which the rapidly or slowly evolving cichlid genes might contribute to the process of adaptive radiation, we made use of the GO term annotations and cichlid specific slim. Over- and under-represented terms were identified among the annotations for the rapidly and slowly evolving cichlid genes (Table <tblr tid="T2">2</tblr>). Among the 759 ORFs for which p-distances were calculated, over 6,000 total annotations were applied to 647, 675, and 619 ORFs according to biological process, molecular function, and cellular component respectively. Therefore the majority of the 122 slowly evolving and 48 rapidly evolving genes could be classified bioinformatically.</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Gene Ontology terms which are over- or under-represented among the rapidly or slowly evolving cichlid ORFs. Hypergeometic p-values are reported uncorrected for multiple testing. The number of ORFs of deviating evolutionary rate (#) relative to the number of core set ORFs (total) is given.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c ca="left">
                     <p>
                        <b>Representation</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>GO-ID</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>p-value</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>#</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>total</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Description</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>biological process</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>42 with higher p-distance (647 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>none</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0050896</p>
                  </c>
                  <c ca="left">
                     <p>0.0161</p>
                  </c>
                  <c ca="right">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>86</p>
                  </c>
                  <c ca="left">
                     <p>response to stimulus</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>GO:0009987</p>
                  </c>
                  <c ca="left">
                     <p>0.0439</p>
                  </c>
                  <c ca="right">
                     <p>12</p>
                  </c>
                  <c ca="right">
                     <p>273</p>
                  </c>
                  <c ca="left">
                     <p>cellular process</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>molecular function</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>44 with higher p-distance (675 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>none</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Cellular component</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>40 with higher p-distance (619 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0015629</p>
                  </c>
                  <c ca="left">
                     <p>0.0327</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>39</p>
                  </c>
                  <c ca="left">
                     <p>actin cytoskeleton</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>none</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>biological process</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>103 with lower p-distance (647 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0009987</p>
                  </c>
                  <c ca="left">
                     <p>0.0024</p>
                  </c>
                  <c ca="right">
                     <p>57</p>
                  </c>
                  <c ca="right">
                     <p>273</p>
                  </c>
                  <c ca="left">
                     <p>cellular process</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007243</p>
                  </c>
                  <c ca="left">
                     <p>0.0052</p>
                  </c>
                  <c ca="right">
                     <p>8</p>
                  </c>
                  <c ca="right">
                     <p>19</p>
                  </c>
                  <c ca="left">
                     <p>protein kinase cascade</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007155</p>
                  </c>
                  <c ca="left">
                     <p>0.0205</p>
                  </c>
                  <c ca="right">
                     <p>7</p>
                  </c>
                  <c ca="right">
                     <p>19</p>
                  </c>
                  <c ca="left">
                     <p>cell adhesion</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0040007</p>
                  </c>
                  <c ca="left">
                     <p>0.0208</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>15</p>
                  </c>
                  <c ca="left">
                     <p>growth</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007154</p>
                  </c>
                  <c ca="left">
                     <p>0.0230</p>
                  </c>
                  <c ca="right">
                     <p>25</p>
                  </c>
                  <c ca="right">
                     <p>109</p>
                  </c>
                  <c ca="left">
                     <p>cell communication</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007267</p>
                  </c>
                  <c ca="left">
                     <p>0.0071</p>
                  </c>
                  <c ca="right">
                     <p>7</p>
                  </c>
                  <c ca="right">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>cell-cell signaling</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0016477</p>
                  </c>
                  <c ca="left">
                     <p>0.0290</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="right">
                     <p>12</p>
                  </c>
                  <c ca="left">
                     <p>cell migration</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0040008</p>
                  </c>
                  <c ca="left">
                     <p>0.0290</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="right">
                     <p>12</p>
                  </c>
                  <c ca="left">
                     <p>regulation of growth</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007409</p>
                  </c>
                  <c ca="left">
                     <p>0.0308</p>
                  </c>
                  <c ca="right">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>axonogenesis</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007610</p>
                  </c>
                  <c ca="left">
                     <p>0.0308</p>
                  </c>
                  <c ca="right">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>behavior</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0015674</p>
                  </c>
                  <c ca="left">
                     <p>0.0308</p>
                  </c>
                  <c ca="right">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>di-, tri-valent inorganic cation transport</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0019752</p>
                  </c>
                  <c ca="left">
                     <p>0.0376</p>
                  </c>
                  <c ca="right">
                     <p>10</p>
                  </c>
                  <c ca="right">
                     <p>35</p>
                  </c>
                  <c ca="left">
                     <p>carboxylic acid metabolic process</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007067</p>
                  </c>
                  <c ca="left">
                     <p>0.0402</p>
                  </c>
                  <c ca="right">
                     <p>4</p>
                  </c>
                  <c ca="right">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>mitosis</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0007417</p>
                  </c>
                  <c ca="left">
                     <p>0.0402</p>
                  </c>
                  <c ca="right">
                     <p>4</p>
                  </c>
                  <c ca="right">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>central nervous system development</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0008152</p>
                  </c>
                  <c ca="left">
                     <p>0.0016</p>
                  </c>
                  <c ca="right">
                     <p>63</p>
                  </c>
                  <c ca="right">
                     <p>477</p>
                  </c>
                  <c ca="left">
                     <p>metabolic process</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0046907</p>
                  </c>
                  <c ca="left">
                     <p>0.0180</p>
                  </c>
                  <c ca="right">
                     <p>2</p>
                  </c>
                  <c ca="right">
                     <p>44</p>
                  </c>
                  <c ca="left">
                     <p>intracellular transport</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0045045</p>
                  </c>
                  <c ca="left">
                     <p>0.0295</p>
                  </c>
                  <c ca="right">
                     <p>0</p>
                  </c>
                  <c ca="right">
                     <p>20</p>
                  </c>
                  <c ca="left">
                     <p>secretory pathway</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0009117</p>
                  </c>
                  <c ca="left">
                     <p>0.0421</p>
                  </c>
                  <c ca="right">
                     <p>0</p>
                  </c>
                  <c ca="right">
                     <p>18</p>
                  </c>
                  <c ca="left">
                     <p>nucleotide metabolic process</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>molecular function</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>110 with lower p-distance (675 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0004930</p>
                  </c>
                  <c ca="left">
                     <p>0.0157</p>
                  </c>
                  <c ca="right">
                     <p>4</p>
                  </c>
                  <c ca="right">
                     <p>7</p>
                  </c>
                  <c ca="left">
                     <p>G-protein</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0003774</p>
                  </c>
                  <c ca="left">
                     <p>0.0233</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>15</p>
                  </c>
                  <c ca="left">
                     <p>motor activity</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005262</p>
                  </c>
                  <c ca="left">
                     <p>0.0264</p>
                  </c>
                  <c ca="right">
                     <p>2</p>
                  </c>
                  <c ca="right">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>calcium channel</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0008047</p>
                  </c>
                  <c ca="left">
                     <p>0.0324</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>enzyme activator activity</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005509</p>
                  </c>
                  <c ca="left">
                     <p>0.0333</p>
                  </c>
                  <c ca="right">
                     <p>12</p>
                  </c>
                  <c ca="right">
                     <p>43</p>
                  </c>
                  <c ca="left">
                     <p>calcium ion binding</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0019899</p>
                  </c>
                  <c ca="left">
                     <p>0.0435</p>
                  </c>
                  <c ca="right">
                     <p>4</p>
                  </c>
                  <c ca="right">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>enzyme binding</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005525</p>
                  </c>
                  <c ca="left">
                     <p>0.0116</p>
                  </c>
                  <c ca="right">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>36</p>
                  </c>
                  <c ca="left">
                     <p>GTP binding</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005198</p>
                  </c>
                  <c ca="left">
                     <p>0.0407</p>
                  </c>
                  <c ca="right">
                     <p>8</p>
                  </c>
                  <c ca="right">
                     <p>85</p>
                  </c>
                  <c ca="left">
                     <p>structural molecule activity</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0051082</p>
                  </c>
                  <c ca="left">
                     <p>0.0467</p>
                  </c>
                  <c ca="right">
                     <p>0</p>
                  </c>
                  <c ca="right">
                     <p>17</p>
                  </c>
                  <c ca="left">
                     <p>unfolded protein binding</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0003743</p>
                  </c>
                  <c ca="left">
                     <p>0.0467</p>
                  </c>
                  <c ca="right">
                     <p>0</p>
                  </c>
                  <c ca="right">
                     <p>17</p>
                  </c>
                  <c ca="left">
                     <p>translation initiation factor activity</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0003924</p>
                  </c>
                  <c ca="left">
                     <p>0.0481</p>
                  </c>
                  <c ca="right">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>27</p>
                  </c>
                  <c ca="left">
                     <p>GTPase activity</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0003676</p>
                  </c>
                  <c ca="left">
                     <p>0.0483</p>
                  </c>
                  <c ca="right">
                     <p>17</p>
                  </c>
                  <c ca="right">
                     <p>147</p>
                  </c>
                  <c ca="left">
                     <p>nucleic acid binding</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>cellular component</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="3" ca="left">
                     <p>
                        <b>97 with lower p-distance (619 annotated)</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0016021</p>
                  </c>
                  <c ca="left">
                     <p>0.0096</p>
                  </c>
                  <c ca="right">
                     <p>22</p>
                  </c>
                  <c ca="right">
                     <p>88</p>
                  </c>
                  <c ca="left">
                     <p>integral to membrane</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0015630</p>
                  </c>
                  <c ca="left">
                     <p>0.0388</p>
                  </c>
                  <c ca="right">
                     <p>5</p>
                  </c>
                  <c ca="right">
                     <p>13</p>
                  </c>
                  <c ca="left">
                     <p>microtubule cytoskeleton</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005625</p>
                  </c>
                  <c ca="left">
                     <p>0.0479</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>18</p>
                  </c>
                  <c ca="left">
                     <p>soluble fraction</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>over</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005615</p>
                  </c>
                  <c ca="left">
                     <p>0.0479</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>18</p>
                  </c>
                  <c ca="left">
                     <p>extracellular space</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0032991</p>
                  </c>
                  <c ca="left">
                     <p>0.0001</p>
                  </c>
                  <c ca="right">
                     <p>19</p>
                  </c>
                  <c ca="right">
                     <p>222</p>
                  </c>
                  <c ca="left">
                     <p>macromolecular complex</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0043234</p>
                  </c>
                  <c ca="left">
                     <p>0.0015</p>
                  </c>
                  <c ca="right">
                     <p>18</p>
                  </c>
                  <c ca="right">
                     <p>195</p>
                  </c>
                  <c ca="left">
                     <p>protein complex</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0043226</p>
                  </c>
                  <c ca="left">
                     <p>0.0089</p>
                  </c>
                  <c ca="right">
                     <p>56</p>
                  </c>
                  <c ca="right">
                     <p>425</p>
                  </c>
                  <c ca="left">
                     <p>organelle</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0030529</p>
                  </c>
                  <c ca="left">
                     <p>0.0139</p>
                  </c>
                  <c ca="right">
                     <p>4</p>
                  </c>
                  <c ca="right">
                     <p>65</p>
                  </c>
                  <c ca="left">
                     <p>ribonucleoprotein complex</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005829</p>
                  </c>
                  <c ca="left">
                     <p>0.0267</p>
                  </c>
                  <c ca="right">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>51</p>
                  </c>
                  <c ca="left">
                     <p>cytosol</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>under</p>
                  </c>
                  <c ca="left">
                     <p>GO:0005739</p>
                  </c>
                  <c ca="left">
                     <p>0.0311</p>
                  </c>
                  <c ca="right">
                     <p>6</p>
                  </c>
                  <c ca="right">
                     <p>75</p>
                  </c>
                  <c ca="left">
                     <p>mitochondrion</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>There was a relatively even distribution of rapidly evolving genes across all GO categories. Only three terms, "response to stimulus", "cellular process" and "actin cytoskeleton" deviated significantly from the distribution expected by chance alone. The most significant disproportionate under-representation for the rapidly evolving genes was the category of response to stimulus for which only 1 of the 86 possible annotated ORFs was included on the list.</p>
         <p>The distribution across GO categories was highly non-uniform for the slowly evolving genes. Many categories from each ontology were represented by significantly more or fewer ORFs than would be expected by chance. Among those terms over-represented we found several relating to cellular processes such as protein kinase cascade, mitosis, and cell signaling as well as growth and cell adhesion, while metabolic process was under-represented along with the secretory pathway category.</p>
         <p>The GO analysis highlights the possible categories of genes that may play an important role in the evolution of the haplochromine cichlid fishes. This analysis presents hypotheses to be tested through focused experimental or sequence analysis. An interesting contrast in GO analysis results was observed between the rapidly evolving genes that showed little tendency to derive from a particular class and slowly evolving genes that were more structured in their distribution. The lack of structure to the distribution of rapidly evolving genes may reflect the possibility that specialization among cichlids occurs along diverse biological pathways rather than a repeated divergence of a given biological process or molecular function. The GO categories that are over-represented among slowly evolving genes could represent genes whose functions are important for phenotypic plasticity or other traits linked to the successful adaptive radiation, while those categories that are under-represented by slowly evolving genes represent categories that are not as tightly constrained.</p>
         <p>Our p-distance comparisons between the five fish species and human (as outgroup) also revealed that cichlids show the lowest average p-distance compared to <it>Homo sapiens </it>(Fig. <figr fid="F3">3</figr>). This might be an artifact that is due to the use of the haplochromine cichlid sequence as query for all BLAST searches. Alternatively, as we also found 122 slowly evolving genes in haplochromine cichlids, there might be a tendency in haplochromines to retain ancestral forms and functions. The pairwise average p-distance comparisons between the three cichlid species <it>Paralabidochromis chilotes, Ptyochromis sp. </it>"redtail sheller", and <it>Astatotilapia burtoni </it>revealed that the coalescence time between the two Lake Victoria species (0.08) is about half compared to their coalescence time with <it>A. burtoni </it>(0.14 and 0.17, respectively), which is in concordance to the phylogenetic relationships between these three taxa <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Here we report the sequencing and annotation of more than 11,000 ESTs from the East African haplochromine cichlid <it>Astatotilapia burtoni</it>. Our EST set comprises a broad range of genes involved in functions, processes and compartments. By combining the <it>A. burtoni </it>ESTs with publicly available ORFs from two Lake Victoria haplochromines and subsequent comparisons to other fish model systems, we identify a set of 170 genes with haplochromine-specific differences in evolutionary rates. These genes appear as good candidates for playing an important role in the evolution of the exceptional diversity found in (haplochromine) cichlids. Interestingly, genes that were more slowly evolving in the cichlid lineage were not evenly distributed across Gene Ontology categories; classes that are over-represented could represent genes whose functions are important for successful adaptive radiation. We also identify four genes with a K<sub>a</sub>/K<sub>s </sub>ratio greater than one, which are, hence, likely to have undergone positive selection in haplochromines. The <it>A. burtoni </it>ESTs provide novel insights into the genome of haplochromine cichlids and will serve as valuable resource for researchers working in the field of (cichlid) evolutionary genomics, particularly in the light of the forthcoming sequencing of four cichlid genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Fishes</p>
            </st>
            <p><it>Astatotilapia burtoni </it>were kept at Stanford, and at the Tierforschungsanlage of the University of Konstanz under standard conditions (12 h light, 12 h dark; 26&#176;C). For RNA isolation, fishes were sacrificed after anesthetization with MS 222 (Sigma).</p>
         </sec>
         <sec>
            <st>
               <p>Pinky cDNA Library Construction</p>
            </st>
            <p>For the preparation of the pinky cDNA library, total RNA was isolated from the following tissues of adult <it>A. burtoni</it>: brain, caudal fin, anal fin (male), lips, muscle, ovary (female), and skin. Additionally, we isolated total RNA from a juvenile individual (about 30 days after fertilization). Total RNA was isolated by guanidine thiocyanate/phenol-chlorophorm-isoamyl alcohol extraction and lithium-chloride precipitation. The different RNA samples were pooled and cDNA was synthesized using the SMART PCR cDNA Synthesis Kit (Clontech) following the manufacturer's protocol. Amplified cDNA was purified using the QIAquick PCR Purification Kit (Qiagen) and concentrated by ethanol precipitation. The pellet was dissolved in 10 &#956;l H<sub>2</sub>O. For normalization, three microliters of purified cDNA were mixed with 1 &#956;l hybridization buffer (200 mM HEPES-HCl, pH 8.0; 2 M NaCl) and incubated at 95&#176;C for 5 minutes and at 70&#176;C overnight. Then, 1 &#956;l of DNAse buffer (500 mM Tris-HCl, pH 8.0; 50 mM MgCl<sub>2</sub>, 10 mM DTT) and 0.5 &#956;l of DSN enzyme (duplex-specific nuclease; Evrogen, Russia) were added, and the mix was incubated at 65&#176;C for 20 minutes. The normalization reaction was terminated by adding 1 &#956;l 50 mM EDTA and incubation at 95&#176;C for 7 minutes. Normalized cDNA was PCR amplified (20 cycles) and cloned into pAL 16 vectors.</p>
         </sec>
         <sec>
            <st>
               <p>Brain cDNA Library Construction</p>
            </st>
            <p>A full-length, directional (EcoRI &#8211; XhoI) cDNA library was constructed in Lambda ZapII phage vector (Stratagene) with mRNA from <it>A. burtoni </it>brains (both sexes at all stages of development and social condition were included). Construction of this library has previously been described in <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. For cDNA sequencing, we used 2 &#956;l of purified PCR products, which were also used for the construction of a cDNA microarray <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>DNA-sequencing and Sequence Analysis</p>
            </st>
            <p>For sequencing of the normalized pinky cDNA library we used purified plasmid DNA from 1 ml colonies that were grown overnight. Plasmid DNA was directly sequenced using T7 primers and the BigDye Termination Reaction Kit v3.0 (Applied Biosystems) on ABI 3730 and ABI 3100 automated capillary DNA sequencers (Applied Biosystems). Sequences of the brain cDNA library were determined on an ABI 3100 DNA sequencer after cycle sequencing reactions from purified PCR products that were available from the construction of a cDNA microarray <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> using the primer CSVP3 (5'-AAGCGCGCAATTAACCCTCACTA-3') and the BigDye Termination Reaction Kit v3.0 (Applied Biosystems).</p>
            <p>Base-calling and quality trimming were performed with phred <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> using a quality score > 20. Vectors were trimmed with Sequencher 4.2.2 (Genecodes). Those ESTs having a total length of >200 bp after quality and vector trimming were considered "high-quality ESTs". Screens for possible contaminations were conducted by blastn searches against the <it>E. coli </it>genome, and the EST_human, EST_mouse and EST_others databases (downloaded in March 2005). Sequences have been deposited in GenBank under accession numbers <ext-link ext-link-type="gen" ext-link-id="CN468542">CN468542</ext-link> &#8211; <ext-link ext-link-type="gen" ext-link-id="CN472211">CN472211</ext-link> (brain library) and <ext-link ext-link-type="gen" ext-link-id="DY625779">DY625779</ext-link> &#8211; <ext-link ext-link-type="gen" ext-link-id="DY632420">DY632420</ext-link> (pinky library).</p>
         </sec>
         <sec>
            <st>
               <p>Annotation of <it>A. burtoni </it>ESTs</p>
            </st>
            <p>High quality <it>A. burtoni </it>ESTs were screened by tblastx searches against protein data from <it>Danio rerio </it>(Zebrafish Sequencing Group at the Sanger Institute), <it>Homo sapiens </it>(GenBank) and <it>Takifugu rubripes </it>(JGI Fugu v3.0) as well as ESTs from <it>Oncorhynchus mykiss </it>and <it>Tetraodon nigroviridis </it>(GenBank) using the standard vertebrate code for translation into amino acids. The expected value thresholds (e-values) were set to &lt; 1 &#215; 10<sup>-5</sup>, &lt; 1 &#215; 10<sup>-15</sup>, and &lt; 1 &#215; 10<sup>-50</sup>. The proper open reading frame for <it>A. burtoni </it>ESTs was determined with EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, based on the results from these BLAST searches.</p>
            <p>For functional annotation of <it>A. burtoni </it>ESTs, we followed the vocabulary provided by the Gene Ontology Consortium using the GO database <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Gene Ontology terms were applied to the cichlid assembled sequences by BLAST comparison to the Gene Ontology database (release 200704), which represents protein sequence for all contributed genes for which at least one GO annotation has been applied based on experimental evidence rather than only inferred electronic annotation of sequence. All GO annotations at any confidence level were then transferred from the single best-hit gene using e-value &lt; 10<sup>-12 </sup>as a threshold. The collection of GO terms used was "slimmed" in order to produce useful summaries of the annotations.</p>
            <p>This cichlid specific slim [Additional files <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>] is based upon statistical consideration for analysis of microarray results. The leaf most nodes have been selected for which 20 or more <it>A. burtoni </it>assembled sequences were annotated with this term. Parent nodes were retained only when an additional 20 <it>A. burtoni </it>assembled sequences were included. To assess the enrichment of particular classes of genes among the genes showing deviating rate of molecular evolution, Gene Ontology annotation terms were tested for significant over- and under-representation in either the higher or lower p-distance list using a hypergeometric test implemented in the BINGO plugin <abbrgrp><abbr bid="B76">76</abbr></abbrgrp> for Cytoscape <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>. Due to the exploratory nature of this analysis and controversial application of correction techniques <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>, reported p-values are not corrected for multiple testing. Only the representation for the leaf most node is reported except in cases when a larger, parent node showed increased significance. The directed acyclic graphs (DAGs) were created using hierarchical visualization in Cytoscape and manually adjusted to facilitate comprehension.</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary Analyses</p>
            </st>
            <p>For evolutionary analyses of ESTs from haplochromine cichlids, we combined our newly generated high-quality ESTs from <it>A. burtoni </it>with previously published ESTs from <it>Paralabidochromis chilotes </it>and <it>Ptyochromis sp. </it>"redtail sheller" <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and with about 1,000 ESTs obtained from a cDNA library made from <it>Metriaclima zebra </it>skin tissue (W. Salzburger, H. A. Hofmann &amp; A. Meyer, unpublished). The combined dataset, including more than 45,000 ESTs, was BLASTed against protein data from <it>Danio rerio</it>, <it>Homo sapiens </it>and <it>Takifugu rubripes </it>as well as ESTs from <it>Oncorhynchus mykiss </it>and <it>Tetraodon nigroviridis </it>(see above for source of data) using the translated BLAST routine and the standard vertebrate code. This was done to identify a set of ORFs present in all datasets under study. BLAST searches were performed with an e-value of &lt; 1 &#215; 10<sup>-50 </sup>in order to achieve high levels of confidence in the similarity searches. The cichlid query sequences and the best hits from every single BLAST search against the different databases were imported into EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
            <p>In order to identify coding sequences showing a deviating rate of molecular evolution in haplochromine cichlids compared to other fish lineages we applied the triangle method implemented in EverEST. In this approach, the query sequences are aligned to their best BLAST hits in two ingroup and one outgroup taxa using the T-Coffee algorithm <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> as implemented in EverEST <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. This reveals multiple sequence alignments consisting of four taxa. Then, uncorrected pairwise p-distances are calculated for all taxon pairs in each alignment, which are used to construct neighbor-joining trees and, after rooting with the outgroup sequences, for a global ternary representation. A relative rate test was applied to each of the orthologous groups. We applied the nonparametric rate test developed by Tajima <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, and compared the genes with their human and their fish orthologs in order to identify higher or lower substitution rates.</p>
            <p>For these analyses, we used the human sequences as outgroup since tetrapods are valid outgroup taxa for teleost fish and the human genome is the most complete and best annotated genome among those. In addition to our haplochromine cichlid query sequences, we used different sets of ingroup taxa in order to minimize biasing effects due to sparse taxon sampling. We used the following combinations of taxa for our evolutionary rate analyses using 759 ORFs that have been found in all datasets: (human, (haplochromine cichlid, <it>Danio rerio</it>, <it>Takifugu rubripes</it>)) (Fig. <figr fid="F2">2a</figr>), (human, (haplochromine cichlid, <it>Danio rerio</it>, <it>Tetraodon nigroviridis</it>)) (not shown), (human, (haplochromine cichlid, <it>Danio rerio</it>, <it>Oncorhynchus mykiss</it>)) (not shown). As a control, we also analyzed a data set without the cichlid-query sequences for the same set of ORFs (human, (<it>Danio rerio</it>, <it>Oncorhynchus mykiss</it>, <it>Takifugu rubripes</it>)) (Fig. <figr fid="F2">2b</figr>). We note that this approach might lead to an underestimation of the number of faster evolving genes, as genes that accumulated too many mutations are likely not to be chosen in the stringent initial BLAST searches. We would also like to point out that some of the observed rate differences might have accumulated on the evolutionary lineage leading to the cichlids but before the cichlids have evolved as a group.</p>
            <p>For orthologous groups, where the p-distance in the haplochromine cichlids were significantly (p &lt; 0.05) higher or lower compared to other fish, the ratio of the number of nonsynonymous substitutions per nonsynonymous site (K<sub>a</sub>) to the number of synonymous substitutions per synonymous site (K<sub>s</sub>) was calculated based on a likelihood approach <abbrgrp><abbr bid="B81">81</abbr></abbrgrp> to evaluate the selective forces acting on those proteins. The K<sub>a</sub>/K<sub>s </sub>ratio is an indicator of the form of sequence evolution, with K<sub>a</sub>/K<sub>s </sub>>> 1 providing strong evidence that positive selection has acted to change the protein sequence.</p>
            <p>We also constructed a histogram of amino acid sequence divergence of all five fish datasets with respect to homologous human sequences. We finally used the redundant sequences in the three datasets <it>P. chilotes</it>, <it>P. sp. </it>"redtail sheller", and <it>A. burtoni </it>to calculate pairwise average p-distances.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>DAG, directed acyclic graph; EST, expressed sequence tag; GO, gene ontology; ORF, open reading frame</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>WS, HAH and AM designed the study. WS and HAH were involved in library construction; WS and IB carried out the molecular work; WS, DS, SCPR, and IB performed the analyses. All authors contributed to the preparation of the manuscript. They read and approved the final version.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank E. Hespeler for technical assistance in the laboratory, P. Jantzen for assistance with GO figures and tables, and R. D. Fernald, in whose laboratory the brain cDNA library was constructed; WS was supported by a Marie Curie Fellowship of the EU, and grants from the Landesstiftung-Baden W&#252;rttemberg gGmbH and the Center for Junior Research Fellows, University of Konstanz; SCPR was supported by an NIH-NRSA grant; HAH was supported by a NIH-NIGMS grant GM068763, the Bauer Center for Genomics Research at Harvard University and the Institute for Cellular and Molecular Biology at the University of Texas, Austin; AM was supported by the Deutsche Forschungsgemeinschaft (DFG) and the University of Konstanz.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Adaptive evolution and explosive speciation: the cichlid fish model</p>
            </title>
            <aug>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Nature Reviews Genetics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>288</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1316</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131652</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The species flocks of East African cichlid fishes: recent advances in molecular phylogenetics and population genetics</p>
            </title>
            <aug>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Naturwissenschaften</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>277</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00114-004-0528-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">15241604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>African Cichlid Fishes: Model systems for evolutionary biology</p>
            </title>
            <aug>
               <au>
                  <snm>Kornfield</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Annu Rev Ecol Syst</source>
            <pubdate>2000</pubdate>
            <volume>31</volume>
            <fpage>163</fpage>
            <lpage>196</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ecolsys.31.1.163</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Out of Tanganyika: Genesis, explosive speciation, key-innovations and phylogeography of the haplochromine cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Mack</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Verheyen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>17</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">554777</pubid>
                  <pubid idtype="pmpid" link="fulltext">15723698</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-5-17</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Origin of the superflock of cichlid fishes from Lake Victoria, East Africa</p>
            </title>
            <aug>
               <au>
                  <snm>Verheyen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Snoeks</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <fpage>325</fpage>
            <lpage>329</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1080699</pubid>
                  <pubid idtype="pmpid" link="fulltext">12649486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Age of cichlids: new dates for ancient lake fish radiations</p>
            </title>
            <aug>
               <au>
                  <snm>Genner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Seehausen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Lunt</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Joyce</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>GF</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1269</fpage>
            <lpage>1282</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm050</pubid>
                  <pubid idtype="pmpid" link="fulltext">17369195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The cichlid fishes of the Great Lakes of Africa: Their biology and Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Fryer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Iles</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <publisher>Edinburgh: Oliver &amp; Boyd</publisher>
            <pubdate>1972</pubdate>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Sympatric speciation in Nicaraguan crater lake cichlid fish</p>
            </title>
            <aug>
               <au>
                  <snm>Barluenga</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stolting</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Muschick</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>439</volume>
            <fpage>719</fpage>
            <lpage>723</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04325</pubid>
                  <pubid idtype="pmpid" link="fulltext">16467837</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Sympatric speciation suggested by monophyly of crater lake cichlids</p>
            </title>
            <aug>
               <au>
                  <snm>Schliewen</snm>
                  <fnm>UK</fnm>
               </au>
               <au>
                  <snm>Tautz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Paabo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>368</volume>
            <fpage>629</fpage>
            <lpage>632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/368629a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8145848</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Similar morphologies of cichlid fish in lakes Tanganyika and Malawi are due to convergence</p>
            </title>
            <aug>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Conroy</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>McKaye</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Stauffer</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>1993</pubdate>
            <volume>2</volume>
            <fpage>158</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/mpev.1993.1016</pubid>
                  <pubid idtype="pmpid" link="fulltext">8025722</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Cichlids of the Rift Lakes</p>
            </title>
            <aug>
               <au>
                  <snm>Stiassny</snm>
                  <fnm>MLJ</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Scientific American</source>
            <pubdate>1999</pubdate>
            <volume>280</volume>
            <fpage>64</fpage>
            <lpage>69</lpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Phylogenetic relationships and evolutionary processes in East African cichlids</p>
            </title>
            <aug>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends in Ecology and Evolution</source>
            <pubdate>1993</pubdate>
            <volume>8</volume>
            <fpage>279</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0169-5347(93)90255-N</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Evolutionary strategies and morphological innovations: cichlid pharyngeal jaws</p>
            </title>
            <aug>
               <au>
                  <snm>Liem</snm>
                  <fnm>KF</fnm>
               </au>
            </aug>
            <source>Systematic Zoology</source>
            <pubdate>1973</pubdate>
            <volume>22</volume>
            <fpage>425</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2412950</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Interspecific fertile hybrids of haplochromine Cichlidae (Teleostei) and their possible importance for speciation</p>
            </title>
            <aug>
               <au>
                  <snm>Crapon de Caprona</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Fritzsch</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Netherlands Journal of Zoology</source>
            <pubdate>1984</pubdate>
            <volume>34</volume>
            <fpage>503</fpage>
            <lpage>538</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Genetic architecture sets limits on transgressive segregation in hybrid cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Albertson</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Evolution Int J Org Evolution</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>686</fpage>
            <lpage>690</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15856710</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Genome mapping of the orange blotch colour pattern in cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Streelman</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Albertson</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Mol Ecol</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>2465</fpage>
            <lpage>2471</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-294X.2003.01920.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12919484</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Directional selection has shaped the oral jaws of Lake Malawi cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Albertson</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Streelman</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>5252</fpage>
            <lpage>5257</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154331</pubid>
                  <pubid idtype="pmpid" link="fulltext">12704237</pubid>
                  <pubid idtype="doi">10.1073/pnas.0930235100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Integration and evolution of the cichlid mandible: the molecular basis of alternate feeding strategies</p>
            </title>
            <aug>
               <au>
                  <snm>Albertson</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Streelman</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Yelick</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>16287</fpage>
            <lpage>16292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283439</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251275</pubid>
                  <pubid idtype="doi">10.1073/pnas.0506649102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The evolution of the pro-domain of bone morphogenetic protein 4 (<it>Bmp4</it>) in an explosively speciated lineage of East African cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Terai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Morikawa</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1628</fpage>
            <lpage>1632</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12200490</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The evolution of genes for pigmentation in African cichlid fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Sugie</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Terai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ota</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2004</pubdate>
            <volume>343</volume>
            <fpage>337</fpage>
            <lpage>346</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2004.09.019</pubid>
                  <pubid idtype="pmpid" link="fulltext">15588588</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Cone opsin genes of african cichlid fishes: tuning spectral sensitivity by differential gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Carleton</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>1540</fpage>
            <lpage>1550</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11470845</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Biologically meaningful expression profiling across species using heterologous hybridization to a cDNA microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Renn</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Aubin-Horth</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>42</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">471549</pubid>
                  <pubid idtype="pmpid" link="fulltext">15238158</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-42</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A Second Generation Genetic Linkage Map of Tilapia (Oreochromis spp.)</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>BY</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Streelman</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Carleton</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Hulata</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Slettan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Terai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>170</volume>
            <fpage>237</fpage>
            <lpage>244</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449707</pubid>
                  <pubid idtype="pmpid" link="fulltext">15716505</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.035022</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A genetic linkage map of a cichlid fish, the tilapia (Oreochromis niloticus)</p>
            </title>
            <aug>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Sobolewska</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Penman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McAndrew</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1998</pubdate>
            <volume>148</volume>
            <fpage>1225</fpage>
            <lpage>1232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460020</pubid>
                  <pubid idtype="pmpid" link="fulltext">9539437</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Construction and characterization of BAC libraries for three fish species; rainbow trout, carp and tilapia</p>
            </title>
            <aug>
               <au>
                  <snm>Katagiri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Asakawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Minagawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hirono</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Aoki</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Anim Genet</source>
            <pubdate>2001</pubdate>
            <volume>32</volume>
            <fpage>200</fpage>
            <lpage>204</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2052.2001.00764.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11531698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Construction of a BAC library for Haplochromis chilotes, a cichlid fish from Lake Victoria</p>
            </title>
            <aug>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fujiyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Genes Genet Syst</source>
            <pubdate>2003</pubdate>
            <volume>78</volume>
            <fpage>103</fpage>
            <lpage>105</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1266/ggs.78.103</pubid>
                  <pubid idtype="pmpid" link="fulltext">12655142</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A BAC library of the East African haplochromine cichlid fish Astatotilapia burtoni</p>
            </title>
            <aug>
               <au>
                  <snm>Lang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miyake</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Braasch</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tinnemore</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Siegel</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Amemiya</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Exp Zoolog B Mol Dev Evol</source>
            <pubdate>2006</pubdate>
            <volume>306B</volume>
            <fpage>35</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/jez.b.21068</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>cimp1, a novel actin family metalloproteinase gene from East African cichlids, is differentially expressed between species during growth</p>
            </title>
            <aug>
               <au>
                  <snm>Kijimoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fujimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nakazawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kuratani</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>1649</fpage>
            <lpage>1660</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi159</pubid>
                  <pubid idtype="pmpid" link="fulltext">15858202</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>magp4 gene may contribute to the diversification of cichlid morphs and their speciation</p>
            </title>
            <aug>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kijimoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fujimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nakazawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ikeo</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2006</pubdate>
            <volume>373</volume>
            <fpage>126</fpage>
            <lpage>133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2006.01.016</pubid>
                  <pubid idtype="pmpid" link="fulltext">16517097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Extensive analysis of ORF sequences from two different cichlid species in Lake Victoria provides molecular evidence for a recent radiation event of the Victoria species flock: identity of EST sequences between Haplochromis chilotes and Haplochromis sp. "Redtailsheller"</p>
            </title>
            <aug>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shin-i</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Horiike</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tateno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2004</pubdate>
            <volume>343</volume>
            <fpage>263</fpage>
            <lpage>269</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2004.09.013</pubid>
                  <pubid idtype="pmpid" link="fulltext">15588581</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <url>http://cichlid.biosci.utexas.edu/html/cichlid_genomics.html</url>
         </bibl>
         <bibl id="B32">
            <url>http://www.cichlidgenome.org</url>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A Primer of Genome Science</p>
            </title>
            <aug>
               <au>
                  <snm>Gibson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <publisher>Sunderland, MA: Sinauer Associates, Inc</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B34">
            <title>
               <p>It's the genes! EST access to human genome content</p>
            </title>
            <aug>
               <au>
                  <snm>Gerhold</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caskey</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>1996</pubdate>
            <volume>18</volume>
            <fpage>973</fpage>
            <lpage>981</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.950181207</pubid>
                  <pubid idtype="pmpid">8976154</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Many genes in fish have species-specific asymmetric rates of molecular evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Steinke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Braasch</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>20</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1413527</pubid>
                  <pubid idtype="pmpid" link="fulltext">16466575</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-20</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Higher teleostean relationships revealed from genome-wide phylogenetic analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Steinke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2006</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B37">
            <title>
               <p>EverEST &#8211; a phylogenomic EST database approach</p>
            </title>
            <aug>
               <au>
                  <snm>Steinke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>PhyloInformatics</source>
            <pubdate>2004</pubdate>
            <volume>6</volume>
            <fpage>1</fpage>
            <lpage>4</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>prot4EST: translating expressed sequence tags from neglected genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Wasmuth</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Blaxter</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>187</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">543579</pubid>
                  <pubid idtype="pmpid" link="fulltext">15571632</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>galaxieEST: addressing EST identity through automated phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Rajashekar</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Ursing</snm>
                  <fnm>BM</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>87</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">459213</pubid>
                  <pubid idtype="pmpid" link="fulltext">15236648</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-87</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>'Egg-dummies' as natural releasers in mouth-breeding cichlids</p>
            </title>
            <aug>
               <au>
                  <snm>Wickler</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1962</pubdate>
            <volume>194</volume>
            <fpage>1092</fpage>
            <lpage>1093</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/1941092a0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Zur Stammesgeschichte funktionell korrelierter Organ- und Verhaltensmerkmale: Ei-Attrappen und Maulbr&#252;ten bei afrikanischen Cichliden</p>
            </title>
            <aug>
               <au>
                  <snm>Wickler</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Zeitschrift f&#252;r Tierpsychologie</source>
            <pubdate>1962</pubdate>
            <volume>19</volume>
            <fpage>129</fpage>
            <lpage>164</lpage>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Haplochromis burtoni (Cichlidae) Ablaichen</p>
            </title>
            <aug>
               <au>
                  <snm>Wickler</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Encyclopedia Cinematographica</source>
            <publisher>G&#246;ttingen: Institut f&#252;r den wissenschaftlichen Film</publisher>
            <pubdate>1969</pubdate>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Olfactory communication in a cichlid fish, Haplochromis burtoni</p>
            </title>
            <aug>
               <au>
                  <snm>Crapon de Caprona</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Zeitschrift f&#252;r Tierpsychologie</source>
            <pubdate>1980</pubdate>
            <volume>52</volume>
            <fpage>113</fpage>
            <lpage>134</lpage>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Fish can infer social rank by observation alone</p>
            </title>
            <aug>
               <au>
                  <snm>Grosenick</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Clement</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Fernald</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>445</volume>
            <fpage>429</fpage>
            <lpage>432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05511</pubid>
                  <pubid idtype="pmpid" link="fulltext">17251980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>What cichlids tell us about the social regulation of brain and behavior</p>
            </title>
            <aug>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Fernald</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Journal of Aquariculture and Aquatic Sciences</source>
            <pubdate>2001</pubdate>
            <volume>9</volume>
            <fpage>1</fpage>
            <lpage>15</lpage>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Functional genomics of neural and behavioral plasticity</p>
            </title>
            <aug>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>J Neurobiol</source>
            <pubdate>2003</pubdate>
            <volume>54</volume>
            <fpage>272</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/neu.10172</pubid>
                  <pubid idtype="pmpid" link="fulltext">12486709</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Gonadotropin-releasing hormone receptor in the teleost Haplochromis burtoni: structure, location, and function</p>
            </title>
            <aug>
               <au>
                  <snm>Robison</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Illing</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Troskie</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Morley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Millar</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Fernald</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Endocrinology</source>
            <pubdate>2001</pubdate>
            <volume>142</volume>
            <fpage>1737</fpage>
            <lpage>1743</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1210/en.142.5.1737</pubid>
                  <pubid idtype="pmpid" link="fulltext">11316736</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The development of the crystalline lens is sensitive to visual input in the African cichlid fish, Haplochromis burtoni</p>
            </title>
            <aug>
               <au>
                  <snm>Kroger</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Fernald</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Vision Res</source>
            <pubdate>2001</pubdate>
            <volume>41</volume>
            <fpage>549</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0042-6989(00)00283-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">11226501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The embryogenesis of rod photoreceptors in the teleost fish retina, Haplochromis burtoni</p>
            </title>
            <aug>
               <au>
                  <snm>Hagedorn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mack</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fernald</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Brain Res Dev Brain Res</source>
            <pubdate>1998</pubdate>
            <volume>108</volume>
            <fpage>217</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0165-3806(98)00051-0</pubid>
                  <pubid idtype="pmpid">9693798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Monophyletic origin of Lake Victoria cichlid fishes suggested by mitochondrial DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Basasibwaki</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1990</pubdate>
            <volume>347</volume>
            <fpage>550</fpage>
            <lpage>553</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/347550a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">2215680</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Phylogeny of the Lake Tanganyika cichlid species flock and its relationship to the Central and East African haplochromine cichlid fish faunas</p>
            </title>
            <aug>
               <au>
                  <snm>Salzburger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baric</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Verheyen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sturmbauer</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2002</pubdate>
            <volume>51</volume>
            <fpage>113</fpage>
            <lpage>135</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/106351502753475907</pubid>
                  <pubid idtype="pmpid" link="fulltext">11943095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Creating the gene ontology resource: design and implementation</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>TGO</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1425</fpage>
            <lpage>1433</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311077</pubid>
                  <pubid idtype="pmpid" link="fulltext">11483584</pubid>
                  <pubid idtype="doi">10.1101/gr.180801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <url>http://www.geneontology.org/GO.slims.shtml</url>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Neuroendocrine-specific expression of the human prohormone convertase 1 gene. Hormonal regulation of transcription through distinct cAMP response elements</p>
            </title>
            <aug>
               <au>
                  <snm>Jansen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ayoubi</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Meulemans</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Van de Ven</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>15391</fpage>
            <lpage>15397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.270.19.11222</pubid>
                  <pubid idtype="pmpid" link="fulltext">7797529</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Gonadotropin-releasing hormone signaling in behavioral pasticity</p>
            </title>
            <aug>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>Current Opinion in Neurobiology</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>343</fpage>
            <lpage>350</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.conb.2006.05.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16697636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Claudin multigene family encoding four-transmembrane domain protein components of tight junction strands</p>
            </title>
            <aug>
               <au>
                  <snm>Morita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Furuse</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fujimoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsukita</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>511</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15167</pubid>
                  <pubid idtype="pmpid" link="fulltext">9892664</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.2.511</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>MTA1 interacts with MAT1, a cyclin-dependent kinase-activating kinase complex ring finger factor, and regulates estrogen receptor transactivation functions</p>
            </title>
            <aug>
               <au>
                  <snm>Talukder</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Mishra</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Mandal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Balasenthil</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mehta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sahin</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Barnes</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <fpage>11676</fpage>
            <lpage>11685</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M209570200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12527756</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Strengths and weaknesses of EST-based prediction of tissue-specific alternative splicing</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zink</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Korn</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>72</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">521684</pubid>
                  <pubid idtype="pmpid" link="fulltext">15453915</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-72</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Identification and mapping of human cDNAs homologous to Drosophila mutant genes through EST database searching</p>
            </title>
            <aug>
               <au>
                  <snm>Banfi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Borsani</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bernard</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Guffanti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rubboli</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Marchitiello</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Giglio</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Coluccia</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zollo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zuffardi</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Ballabio</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1996</pubdate>
            <volume>13</volume>
            <fpage>167</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0696-167</pubid>
                  <pubid idtype="pmpid" link="fulltext">8640222</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Analysis of EST-driven gene annotation in human genomic sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>LC</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Searls</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Overton</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>362</fpage>
            <lpage>376</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9548972</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Exhaustive mining of EST libraries for genes differentially expressed in normal and tumour tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Specht</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Beckmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dahl</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pilarsky</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Hinzmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rosenthal</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>4251</fpage>
            <lpage>4260</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148701</pubid>
                  <pubid idtype="pmpid" link="fulltext">10518618</pubid>
                  <pubid idtype="doi">10.1093/nar/27.21.4251</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries</p>
            </title>
            <aug>
               <au>
                  <snm>Habermann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bebin</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Herklotz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Volkmer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eckelt</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pehlke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Epperlein</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Schackert</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Wiebe</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522874</pubid>
                  <pubid idtype="pmpid" link="fulltext">15345051</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-9-r67</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Analysis of expressed sequence tags indicates 35,000 human genes</p>
            </title>
            <aug>
               <au>
                  <snm>Ewing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>232</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/76115</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835644</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee</p>
            </title>
            <aug>
               <au>
                  <snm>Whitfield</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Band</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Bonaldo</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pardinas</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>555</fpage>
            <lpage>566</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187514</pubid>
                  <pubid idtype="pmpid" link="fulltext">11932240</pubid>
                  <pubid idtype="doi">10.1101/gr.5302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>A Comprehensive EST Linkage Map for Tiger Salamander and Mexican Axolotl: Enabling Gene Mapping and Comparative Genomics in Ambystoma</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Kump</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Parichy</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Generation of a high-density rat EST map</p>
            </title>
            <aug>
               <au>
                  <snm>Scheetz</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Nishimura</snm>
                  <fnm>DY</fnm>
               </au>
               <au>
                  <snm>McClain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Birkett</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gardiner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Butters</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kwitek-Black</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jacob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Casavant</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Sheffield</snm>
                  <fnm>VC</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>497</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311028</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230173</pubid>
                  <pubid idtype="doi">10.1101/gr.GR-1516R</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Genetic linkage maps of the red flour beetle, Tribolium castaneum, based on bacterial artificial chromosomes and expressed sequence tags</p>
            </title>
            <aug>
               <au>
                  <snm>Lorenzen</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Doyungan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Savard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Snow</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Crumly</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Shippy</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Beeman</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>170</volume>
            <fpage>741</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1450394</pubid>
                  <pubid idtype="pmpid" link="fulltext">15834150</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.032227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lartillot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>1246</fpage>
            <lpage>1253</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi111</pubid>
                  <pubid idtype="pmpid" link="fulltext">15703236</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Identification and characterization of new plant microRNAs using EST analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>XP</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>QL</fnm>
               </au>
               <au>
                  <snm>Cobb</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Cell Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>336</fpage>
            <lpage>360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.cr.7290302</pubid>
                  <pubid idtype="pmpid" link="fulltext">15916721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Masculinized dominant females in a cooperatively breeding species</p>
            </title>
            <aug>
               <au>
                  <snm>Aubin-Horth</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Desjardins</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Martei</snm>
                  <fnm>YM</fnm>
               </au>
               <au>
                  <snm>Balshine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>Mol Ecol</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <fpage>1349</fpage>
            <lpage>1358</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-294X.2007.03249.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17391260</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Development and application of a salmonid EST database and cDNA microarray: data mining and interspecific hybridization characteristics</p>
            </title>
            <aug>
               <au>
                  <snm>Rise</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>von Schalburg</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Mawer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Devlin</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Kuipers</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Busby</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beetz-Sargent</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Alberto</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shukin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zeznik</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Smailus</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schein</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Butterfield</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Stott</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Koop</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>478</fpage>
            <lpage>490</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353236</pubid>
                  <pubid idtype="pmpid" link="fulltext">14962987</pubid>
                  <pubid idtype="doi">10.1101/gr.1687304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Somatostatin regulates aggressive behavior in an African cichlid fish</p>
            </title>
            <aug>
               <au>
                  <snm>Trainor</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>HA</fnm>
               </au>
            </aug>
            <source>Endocrinology</source>
            <pubdate>2006</pubdate>
            <volume>147</volume>
            <fpage>5119</fpage>
            <lpage>5125</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1210/en.2006-0511</pubid>
                  <pubid idtype="pmpid" link="fulltext">16887916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <url>http://www.genome.washington.edu/UWGC/analysistools/Phred.cfm</url>
         </bibl>
         <bibl id="B75">
            <url>http://www.geneontology.org</url>
         </bibl>
         <bibl id="B76">
            <title>
               <p>BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks</p>
            </title>
            <aug>
               <au>
                  <snm>Maere</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Heymans</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuiper</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3448</fpage>
            <lpage>3449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti551</pubid>
                  <pubid idtype="pmpid" link="fulltext">15972284</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <url>http://www.cytoscape.org/</url>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Resampling-based multiple testing for microarray data analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Ge</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Speet</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Test</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>1</fpage>
            <lpage>77</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF02595811</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>T-Coffee: A novel method for fast and accurate multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Simple methods for testing the molecular evolutionary clock hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Tajima</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1993</pubdate>
            <volume>135</volume>
            <fpage>599</fpage>
            <lpage>607</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205659</pubid>
                  <pubid idtype="pmpid" link="fulltext">8244016</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>568</fpage>
            <lpage>573</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9580986</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
